While searching for historical Hong Kong Land Auction data I realised there is no publicly available dataset to download for analysis. The data is public info, but not published in a machine readable format. It is published on the website of the Lands Department as HTML tables (archives) or in PDF for more recent historical records.
Data extraction
Not a particularly elegant solution in this case as I ended up mass copy-pasting the data into MS excel to cleanup which got me a clean CSV file. Converted to json and used the very useful jq to manipulate the json file.
The dataset is at mikehiggins/hk-govt-land-sales
and pull requests to update the purchaser
field are most welcome.
Let me know if you've done anything interesting with the data and I'll post a link here.