Guy Freeman, 20th February 2019
IN GENERAL, GETTING ENOUGH CLEAN DATA IS DAMN HARD
In 2014, the New York Times exposed the truth 😳
“Open data and content can be freely used, modified, and shared by anyone for any purpose”
From https://opendefinition.org/:
From http://opendatahandbook.org/guide/en/what-is-open-data/:
However, data.gov.hk is missing the juiciest datasets:
Even the data on data.gov.hk isn't as open as can be; much of the data is in Excel spreadsheets at best, or PDF at worst!
If open data isn't available via an easy method, we can go and create our own.
Using web scraping, I have collected publicly available data and cleaned it up for clients and the public to easily access via API:
A similar project is webb-site.com, which doesn't have an API, and is focused on HKEX matters.
Open Data is a democratic tool for understanding social phenomena. On Data Guru's blog I have revealed some truths that were otherwise hidden:
Now that I've started to open some Hong Kong data, after analysing it, I can create ML solutions. I once had horse race tipper, but I'll say no more about that... Today I will show my latest product, truehome.hk
We gathered the transaction amounts for 1.9 million transactions for 1.7 million "units" (flats or houses) from over 43,000 buildings, and I am building statistical models for predicting the value of any given flat with this data.
Open Data, by its very name, is not proprietary. Once it is collected and disseminated, it is a net win for everyone. Get in touch to: