Comparison of Four Geocoders: Nominatim, GoogleV3, ArcGis and AzureMaps

Geocoding services (via Geopy):

Geocoding refers to the maping of an address to its geographic coordinates. Specifically, the geocoding of a location specified by a query string can be achieved using calls to geocoding services (APIs) directly in a browser address box, or with a wrapping library such as Geopy.

Why I setup this comparison:

In another application, I tried using the New York City boroughs bounding boxes to impute missing borough names for records with geolocation information (via set operations) as a way to prevent over 85,000 requests. I trashed the box ‘solution’ once I found out about the official territorial boundaries (shapefiles). Yet, in the intervening time I had checked several services for speed and limits and I found that some geocoders were not returning the same geolocation or bounding boxes for the same query, so I investigated!

To date, April 2019, Geopy has 21 traditional geocoders (excluding What3Words), so I picked only four of them. Following are the links to the geocoders documentation and their respective service providers via Geopy:

Shapefile sources:

I used two shapefiles: that of New York City (with water extent), and that of Boston, without water extent as it was the only one I found.

New York City: https://data.cityofnewyork.us/City-Government/Borough-Boundaries-Water-Areas-Included-/tv64-9x69
Boston: https://data.boston.gov/dataset/city-of-boston-boundary2

Places queried:

“New York City, NY, USA”
“Cleopatra’s needle, Central Park, New York, NY, USA”
“Bronx county, NY, USA”
“Kings county, NY, USA”
“New York county, NY, USA”
“Queens county, NY, USA”
“Richmond county, NY, USA”
“Boston, MA, USA”

Update: Report on temporal differences:

Because I noticed that the results from this April were different from those from last September (2018), I put together an HTML report highlighting the differences. A persistent oddity is that half of the geocoders tested cannot distinguish between “New York City” and “New York county” (Manhattan): GoogleV3 and ArcGis return identical bounding boxes.

Slides in this updated report:

Intro (this slide)
Heatmaps of pairwise distance differences:
-> How large are the distance differences for each geocoder in this comparison?
Old heatmaps
New heatmaps
Relationship between center of bounding box and queried location (NYC):
Comparison tables
Maps:
New York City
New York county (Manhattan)
Cleopatra’s Needle (a monument in Central Park)
Bronx county
Brooklyn county
Queens county
Richmond county (Staten Island)
Boston

The main conclusions from this comparison:

Which is the best of these four?

Nominatim: ⭐ of the glorious open-source community that is OpenStreetMap;
GoogleV3: similar results to Nominatim; requires an account;
ArcGis: the least wrong of the worse two
AzureMaps: Oh come on! 😖

No hedging!
I presume that it is very unlikely that an application would use different geolocating services, but in the case some ‘hedging’ is involved (e.g. on limits, time-outs), the geolocation for the same query will be different.

Mind the box!
Additionally, my non-exhaustive comparison of four Geopy geocoders (out of 21), reveals that the boxing of a location is not always principled. For instance, Nominatim and GoogleV3 most often use the shapefile with water extent for boxing, whereas ArcGis and AzureMaps do not; moreover, ArcGis boxes typically extend further North than warranted by the existing shapefiles by at most 10 miles.

Final notes on accuracy in the geolocation context:

I found this excellent resource on precision for geolocations. Here are the main points:

Decimal place (ordinal)	Maximal resolved distance	Comments on precision
1	11.1 km	can distinguish the position of one large city from a neighboring large city
2	1.1 km	can separate one village from the next
3	110 m	can identify a large agricultural field or institutional campus
4	11 m	can identify a parcel of land. It is comparable to the typical accuracy of an uncorrected GPS unit with no interference
5	1.1 m	can distinguish trees from each other. Accuracy to this level with commercial GPS units can only be achieved with differential correction
6	0.11 m	can be used for laying out structures in detail, for designing landscapes, building roads. It should be more than good enough for tracking movements of glaciers and rivers. This can be achieved by taking painstaking measures with GPS, such as differentially corrected GPS
7	11 mm	good for surveying and is near the limit of what GPS-based techniques can achieve
8	1.1 mm	good for charting motions of tectonic plates and movements of volcanoes. Permanent, corrected, constantly-running GPS base stations might be able to achieve this level of accuracy
9	110 microns	we are getting into the range of microscopy. For almost any conceivable application with earth positions, this is overkill and will be more precise than the accuracy of any surveying device
> 9	useless	indicates a computer or calculator was used and that no attention was paid to the fact that the extra decimals are useless. Be careful, because unless you are the one reading these numbers off the device, this can indicate low quality processing!

Cat Chenal