Geolocation

How to geo-reference your dataset

@JacopoOttaviani

Geocoding is the process of converting addresses (like "Puerta del Sol, Madrid") into geographic coordinates (like latitude 40.416980 and longitude -3.704429), which you can use to place markers on a map.

Scenarios

  • Location columns are lat/lon

  • Location column is a city

  • Location column is a postal code

  • Location column is a province, region or country

  • Location column is an address

Scenarios, i

  • Location columns are lat/lon (free on CartoDB)

  • Location column is a city (free on CartoDB)

  • Location column is a postal code (free on CartoDB)

  • Location column is a province, region or country

  • Location column is an address

Scenarios, ii

  • Location columns are lat/lon (free on CartoDB)

  • Location column is a city (free on CartoDB)

  • Location column is a postal code (free on CartoDB)

  • Location column is a province, region or country (choropleth: shapefile merging required)

  • Location column is an address

Scenarios, iii

  • Location columns are lat/lon (free on CartoDB)

  • Location column is a city (free on CartoDB)

  • Location column is a postal code (free on CartoDB)

  • Location column is a province, region or country (choropleth: shapefile merging required)

  • Location column is an address (100 for free on CartoDB. Otherwise: workaround required). 

When for free:

More info: http://docs.cartodb.com/tutorials/how_to_georeference.html

When it is not for free, or it doesn't work... 

1st option (easy):

  • Use Google Maps API in Google Spreadsheet with a macro script

 

2nd option (nerdy): 

  • Use OpenRefine to geocode your data with Google and OpenStreetMap API
  • Open the dataset you want to georeference on OpenRefine
  • Fetch the geographic information from a geographic API
  • Fill it in the dataset and re-import in CartoDB

https://github.com/nuket/google-sheets-geocoding-macro

  • Open the Google Spreadsheet you want to geolocate
  • Tools -> Script Editor

  • Copy and paste the code into the editor 

  • Save and reload your sheet 

  • a new “Geocode” menu should appear after the reload

What's an API?

Application Programming Interface

 

It's just a different way to query a web application, such as Google Maps.

 

How? With a URL:

 

http://maps.googleapis.com/maps/api/geocode/json?sensor=false&address=the_address_im_looking_for

 

for example:

http://maps.googleapis.com/maps/api/geocode/json?sensor=false&address=puerta del sol madrid

JSON result returned by the API

(It's like a blob of data with geographic info)

Let's be a little bit more nerdish with OpenRefine

www.openrefine.org

OpenRefine

Open the dataset: 90_non_geolocated_libya_body_count

Coordinates extraction

Once queried the Google Maps API, we can extract the coordinates from the API blobs

Finally you have your coordinates

  • Check empty/error rows
  • Remove the columns you don't need
  • Export the dataset in CSV or XLS format
  • Open your new dataset on CartoDB
  • Map it!

When the API doesn't find the coordinates, go manual

Geolocation via OpenRefine

Step-by-step guide:

http://bit.ly/openrefine-geolocation

 

Questions?

Exercise #4. Geolocation with OpenRefine

  • Download dataset: 90_libya_body_count from http://bit.ly/cartography-madrid

  • Georeference it with OpenRefine

    • Use Google Maps Geolocation API
    • Extract coordinates from blobs
    • Remove blobs
  • Export the new dataset

  • Import it on CartoDB and check the geometry

#4 - Geolocation

By jottaviani

#4 - Geolocation

  • 758