Geocoder
Complete geocoding solution for Ruby.
Key features:
- Forward and reverse geocoding.
- IP address geocoding.
- Connects to more than 40 APIs worldwide.
- Performance-enhancing features like caching.
- Integrates with ActiveRecord and Mongoid.
- Basic geospatial queries: search within radius (or rectangle, or ring).
Compatibility:
- Ruby versions: 2.1+, and JRuby.
- Databases: MySQL, PostgreSQL, SQLite, and MongoDB.
- Rails: 5.x, 6.x, and 7.x.
- Works outside of Rails with the
json
(for MRI) or json_pure
(for JRuby) gem.
Table of Contents
Basic Features:
Advanced Features:
The Rest:
See Also:
Basic Search
In its simplest form, Geocoder takes an address and searches for its latitude/longitude coordinates:
results = Geocoder.search("Paris")
results.first.coordinates
The reverse is possible too. Given coordinates, it finds an address:
results = Geocoder.search([48.856614, 2.3522219])
results.first.address
You can also look up the location of an IP address:
results = Geocoder.search("172.56.21.89")
results.first.coordinates
results.first.country
The success and accuracy of geocoding depends entirely on the API being used to do these lookups. Most queries work fairly well with the default configuration, but every application has different needs and every API has its particular strengths and weaknesses. If you need better coverage for your application you'll want to get familiar with the large number of supported APIs, listed in the API Guide.
Geocoding Objects
To automatically geocode your objects:
1. Your model must provide a method that returns an address to geocode. This can be a single attribute, but it can also be a method that returns a string assembled from different attributes (eg: city
, state
, and country
). For example, if your model has street
, city
, state
, and country
attributes you might do something like this:
def address
[street, city, state, country].compact.join(', ')
end
2. Your model must have a way to store latitude/longitude coordinates. With ActiveRecord, add two attributes/columns (of type float or decimal) called latitude
and longitude
. For MongoDB, use a single field (of type Array) called coordinates
(i.e., field :coordinates, type: Array
). (See Advanced Model Configuration for using different attribute names.)
3. In your model, tell geocoder where to find the object's address:
geocoded_by :address
This adds a geocode
method which you can invoke via callback:
after_validation :geocode
Reverse geocoding (given lat/lon coordinates, find an address) is similar:
reverse_geocoded_by :latitude, :longitude
after_validation :reverse_geocode
With any geocoded objects, you can do the following:
obj.distance_to([43.9,-98.6])
obj.bearing_to([43.9,-98.6])
obj.bearing_from(obj2)
The bearing_from/to
methods take a single argument which can be: a [lat,lon]
array, a geocoded object, or a geocodable address (string). The distance_from/to
methods also take a units argument (:mi
, :km
, or :nm
for nautical miles). See Distance and Bearing below for more info.
One More Thing for MongoDB!
Before you can call geocoded_by
you'll need to include the necessary module using one of the following:
include Geocoder::Model::Mongoid
include Geocoder::Model::MongoMapper
Latitude/Longitude Order in MongoDB
Everywhere coordinates are passed to methods as two-element arrays, Geocoder expects them to be in the order: [lat, lon]
. However, as per the GeoJSON spec, MongoDB requires that coordinates be stored longitude-first ([lon, lat]
), so internally they are stored "backwards." Geocoder's methods attempt to hide this, so calling obj.to_coordinates
(a method added to the object by Geocoder via geocoded_by
) returns coordinates in the conventional order:
obj.to_coordinates
whereas calling the object's coordinates attribute directly (obj.coordinates
by default) returns the internal representation which is probably the reverse of what you want:
obj.coordinates
So, be careful.
Use Outside of Rails
To use Geocoder with ActiveRecord and a framework other than Rails (like Sinatra or Padrino), you will need to add this in your model before calling Geocoder methods:
extend Geocoder::Model::ActiveRecord
Geospatial Database Queries
For ActiveRecord models:
To find objects by location, use the following scopes:
Venue.near('Omaha, NE, US')
Venue.near([40.71, -100.23], 50)
Venue.near([40.71, -100.23], 50, units: :km)
Venue.geocoded
Venue.not_geocoded
With geocoded objects you can do things like this:
if obj.geocoded?
obj.nearbys(30)
obj.distance_from([40.714,-100.234])
obj.bearing_to("Paris, France")
end
For MongoDB-backed models:
Please do not use Geocoder's near
method. Instead use MongoDB's built-in geospatial query language, which is faster. Mongoid also provides a DSL for geospatial queries.
Geocoding HTTP Requests
Geocoder adds location
and safe_location
methods to the standard Rack::Request
object so you can easily look up the location of any HTTP request by IP address. For example, in a Rails controller or a Sinatra app:
result = request.location
The location
method is vulnerable to trivial IP address spoofing via HTTP headers. If that's a problem for your application, use safe_location
instead, but be aware that safe_location
will not try to trace a request's originating IP through proxy headers; you will instead get the location of the last proxy the request passed through, if any (excepting any proxies you have explicitly whitelisted in your Rack config).
Note that these methods will usually return nil
in test and development environments because things like "localhost" and "0.0.0.0" are not geocodable IP addresses.
Geocoding Service ("Lookup") Configuration
Geocoder supports a variety of street and IP address geocoding services. The default lookups are :nominatim
for street addresses and :ipinfo_io
for IP addresses. Please see the API Guide for details on specific geocoding services (not all settings are supported by all services).
To create a Rails initializer with sample configuration:
rails generate geocoder:config
Some common options are:
Geocoder.configure(
lookup: :yandex,
ip_lookup: :maxmind,
api_key: "...",
timeout: 5,
units: :km,
cache: Redis.new,
cache_options: {
expiration: 1.day,
prefix: "another_key:"
}
)
Please see lib/geocoder/configuration.rb
for a complete list of configuration options. Additionally, some lookups have their own special configuration options which are directly supported by Geocoder. For example, to specify a value for Google's bounds
parameter:
Geocoder.search("Middletown", bounds: [[40.6,-77.9], [39.9,-75.9]])
Please see the source code for each lookup to learn about directly supported parameters. Parameters which are not directly supported can be specified using the :params
option, which appends options to the query string of the geocoding request. For example:
Geocoder.search("Rome", params: {countrycodes: "us,ca"})
Geocoder.search("Rome", params: {region: "..."})
Configuring Multiple Services
You can configure multiple geocoding services at once by using the service's name as a key for a sub-configuration hash, like this:
Geocoder.configure(
timeout: 2,
cache: Redis.new,
yandex: {
api_key: "...",
timeout: 5
},
baidu: {
api_key: "..."
},
maxmind: {
api_key: "...",
service: :omni
}
)
Lookup-specific settings override global settings so, in this example, the timeout for all lookups is 2 seconds, except for Yandex which is 5.
Performance and Optimization
Database Indices
In MySQL and Postgres, queries use a bounding box to limit the number of points over which a more precise distance calculation needs to be done. To take advantage of this optimisation, you need to add a composite index on latitude and longitude. In your Rails migration:
add_index :table, [:latitude, :longitude]
In MongoDB, by default, the methods geocoded_by
and reverse_geocoded_by
create a geospatial index. You can avoid index creation with the :skip_index option
, for example:
include Geocoder::Model::Mongoid
geocoded_by :address, skip_index: true
Avoiding Unnecessary API Requests
Geocoding only needs to be performed under certain conditions. To avoid unnecessary work (and quota usage) you will probably want to geocode an object only when:
- an address is present
- the address has been changed since last save (or it has never been saved)
The exact code will vary depending on the method you use for your geocodable string, but it would be something like this:
after_validation :geocode, if: ->(obj){ obj.address.present? and obj.address_changed? }
Caching
When relying on any external service, it's always a good idea to cache retrieved data. When implemented correctly, it improves your app's response time and stability. It's easy to cache geocoding results with Geocoder -- just configure a cache store:
Geocoder.configure(cache: Redis.new)
This example uses Redis, but the cache store can be any object that supports these methods:
store#[](key)
or #get
or #read
- retrieves a valuestore#[]=(key, value)
or #set
or #write
- stores a valuestore#del(url)
- deletes a valuestore#keys
- (Optional) Returns array of keys. Used if you wish to expire the entire cache (see below).
Even a plain Ruby hash will work, though it's not a great choice (cleared out when app is restarted, not shared between app instances, etc).
When using Rails use the Generic cache store as an adapter around Rails.cache
:
Geocoder.configure(cache: Geocoder::CacheStore::Generic.new(Rails.cache, {}))
You can also set a custom prefix to be used for cache keys:
Geocoder.configure(cache_options: { prefix: "..." })
By default the prefix is geocoder:
If you need to expire cached content:
Geocoder::Lookup.get(Geocoder.config[:lookup]).cache.expire(:all)
Geocoder::Lookup.get(:nominatim).cache.expire("http://...")
Geocoder::Lookup.get(:nominatim).cache.expire(:all)
Geocoder::Lookup.all_services.each{|service| Geocoder::Lookup.get(service).cache.expire(:all)}
Do not include the prefix when passing a URL to be expired. Expiring :all
will only expire keys with the configured prefix -- it will not expire every entry in your key/value store.
In addition to conventional cache stores like Redis, it's possible to keep your cache in the database using ActiveRecord
. For example see this gist.
Before you implement caching in your app please be sure that doing so does not violate the Terms of Service for your geocoding service.
Not all services support caching, check the service limitations in the API guide for more information.
Advanced Model Configuration
You are not stuck with the latitude
and longitude
database column names (with ActiveRecord) or the coordinates
array (Mongo) for storing coordinates. For example:
geocoded_by :address, latitude: :lat, longitude: :lon
geocoded_by :address, coordinates: :coords
For reverse geocoding, you can specify the attribute where the address will be stored. For example:
reverse_geocoded_by :latitude, :longitude, address: :loc
reverse_geocoded_by :coordinates, address: :street_address
To specify geocoding parameters in your model:
geocoded_by :address, params: {region: "..."}
Supported parameters: :lookup
, :ip_lookup
, :language
, and :params
. You can specify an anonymous function if you want to set these on a per-request basis. For example, to use different lookups for objects in different regions:
geocoded_by :address, lookup: lambda{ |obj| obj.geocoder_lookup }
def geocoder_lookup
if country_code == "RU"
:yandex
elsif country_code == "CN"
:baidu
else
:nominatim
end
end
Custom Result Handling
So far we have seen examples where geocoding results are assigned automatically to predefined object attributes. However, you can skip the auto-assignment by providing a block which handles the parsed geocoding results any way you like, for example:
reverse_geocoded_by :latitude, :longitude do |obj,results|
if geo = results.first
obj.city = geo.city
obj.zipcode = geo.postal_code
obj.country = geo.country_code
end
end
after_validation :reverse_geocode
Every Geocoder::Result
object, result
, provides the following data:
result.latitude
- floatresult.longitude
- floatresult.coordinates
- array of the above two in the form of [lat,lon]
result.address
- stringresult.city
- stringresult.state
- stringresult.state_code
- stringresult.postal_code
- stringresult.country
- stringresult.country_code
- string
Most APIs return other data in addition to these globally-supported attributes. To directly access the full response, call the #data
method of any Geocoder::Result object. See the API Guide for links to documentation for all geocoding services.
Forward and Reverse Geocoding in the Same Model
You can apply both forward and reverse geocoding to the same model (i.e. users can supply an address or coordinates and Geocoder fills in whatever's missing) but you'll need to provide two different address methods:
- one for storing the fetched address (when reverse geocoding)
- one for providing an address to use when fetching coordinates (forward geocoding)
For example:
class Venue
geocoded_by :address_from_components
reverse_geocoded_by :latitude, :longitude, address: :full_address
end
The same goes for latitude/longitude. However, for purposes of querying the database, there can be only one authoritative set of latitude/longitude attributes for use in database queries. This is whichever you specify last. For example, here the attributes without the fetched_
prefix will be authoritative:
class Venue
geocoded_by :address,
latitude: :fetched_latitude,
longitude: :fetched_longitude
reverse_geocoded_by :latitude, :longitude
end
Advanced Database Queries
The following apply to ActiveRecord only. For MongoDB, please use the built-in geospatial features.
The default near
search looks for objects within a circle. To search within a doughnut or ring use the :min_radius
option:
Venue.near("Austin, TX", 200, min_radius: 40)
To search within a rectangle (note that results will not include distance
and bearing
attributes):
sw_corner = [40.71, 100.23]
ne_corner = [36.12, 88.65]
Venue.within_bounding_box(sw_corner, ne_corner)
To search for objects near a certain point where each object has a different distance requirement (which is defined in the database), you can pass a column name for the radius:
Venue.near([40.71, 99.23], :effective_radius)
If you store multiple sets of coordinates for each object, you can specify latitude and longitude columns to use for a search:
Venue.near("Paris", 50, latitude: :secondary_latitude, longitude: :secondary_longitude)
Distance and Bearing
When you run a geospatial query, the returned objects have two attributes added:
obj.distance
- number of miles from the search point to this objectobj.bearing
- direction from the search point to this object
Results are automatically sorted by distance from the search point, closest to farthest. Bearing is given as a number of degrees clockwise from due north, for example:
0
- due north180
- due south90
- due east270
- due west230.1
- southwest359.9
- almost due north
You can convert these to compass point names via provided method:
Geocoder::Calculations.compass_point(355)
Geocoder::Calculations.compass_point(45)
Geocoder::Calculations.compass_point(208)
Note: when running queries on SQLite, distance
and bearing
are provided for consistency only. They are not very accurate.
For more advanced geospatial querying, please see the rgeo gem.
Geospatial Calculations
The Geocoder::Calculations
module contains some useful methods:
Geocoder::Calculations.distance_between([47.858205,2.294359], [40.748433,-73.985655])
=> 3619.77359999382
Geocoder::Calculations.geographic_center([city1, city2, [40.22,-73.99], city4])
=> [35.14968, -90.048929]
See the code for more!
Batch Geocoding
If you have just added geocoding to an existing application with a lot of objects, you can use this Rake task to geocode them all:
rake geocode:all CLASS=YourModel
If you need reverse geocoding instead, call the task with REVERSE=true:
rake geocode:all CLASS=YourModel REVERSE=true
In either case, it won't try to geocode objects that are already geocoded. The task will print warnings if you exceed the rate limit for your geocoding service. Some services enforce a per-second limit in addition to a per-day limit. To avoid exceeding the per-second limit, you can add a SLEEP
option to pause between requests for a given amount of time. You can also load objects in batches to save memory, for example:
rake geocode:all CLASS=YourModel SLEEP=0.25 BATCH=100
To avoid exceeding per-day limits you can add a LIMIT
option. However, this will ignore the BATCH
value, if provided.
rake geocode:all CLASS=YourModel LIMIT=1000
Testing
When writing tests for an app that uses Geocoder it may be useful to avoid network calls and have Geocoder return consistent, configurable results. To do this, configure the :test
lookup and/or :ip_lookup
Geocoder.configure(lookup: :test, ip_lookup: :test)
Add stubs to define the results that will be returned:
Geocoder::Lookup::Test.add_stub(
"New York, NY", [
{
'coordinates' => [40.7143528, -74.0059731],
'address' => 'New York, NY, USA',
'state' => 'New York',
'state_code' => 'NY',
'country' => 'United States',
'country_code' => 'US'
}
]
)
With the above stub defined, any query for "New York, NY" will return the results array that follows. You can also set a default stub, to be returned when no other stub matches a given query:
Geocoder::Lookup::Test.set_default_stub(
[
{
'coordinates' => [40.7143528, -74.0059731],
'address' => 'New York, NY, USA',
'state' => 'New York',
'state_code' => 'NY',
'country' => 'United States',
'country_code' => 'US'
}
]
)
You may also delete a single stub, or reset all stubs including the default stub:
Geocoder::Lookup::Test.delete_stub('New York, NY')
Geocoder::Lookup::Test.reset
Notes:
- Keys must be strings (not symbols) when calling
add_stub
or set_default_stub
. For example 'country' =>
not :country =>
. - The stubbed result objects returned by the Test lookup do not support all the methods real result objects do. If you need to test interaction with real results it may be better to use an external stubbing tool and something like WebMock or VCR to prevent network calls.
Error Handling
By default Geocoder will rescue any exceptions raised by calls to a geocoding service and return an empty array. You can override this on a per-exception basis, and also have Geocoder raise its own exceptions for certain events (eg: API quota exceeded) by using the :always_raise
option:
Geocoder.configure(always_raise: [SocketError, Timeout::Error])
You can also do this to raise all exceptions:
Geocoder.configure(always_raise: :all)
The raise-able exceptions are:
SocketError
Timeout::Error
Geocoder::OverQueryLimitError
Geocoder::RequestDenied
Geocoder::InvalidRequest
Geocoder::InvalidApiKey
Geocoder::ServiceUnavailable
Note that only a few of the above exceptions are raised by any given lookup, so there's no guarantee if you configure Geocoder to raise ServiceUnavailable
that it will actually be raised under those conditions (because most APIs don't return 503 when they should; you may get a Timeout::Error
instead). Please see the source code for your particular lookup for details.
Command Line Interface
When you install the Geocoder gem it adds a geocode
command to your shell. You can search for a street address, IP address, postal code, coordinates, etc just like you can with the Geocoder.search method for example:
$ geocode 29.951,-90.081
Latitude: 29.952211
Longitude: -90.080563
Full address: 1500 Sugar Bowl Dr, New Orleans, LA 70112, USA
City: New Orleans
State/province: Louisiana
Postal code: 70112
Country: United States
Map: http://maps.google.com/maps?q=29.952211,-90.080563
There are also a number of options for setting the geocoding API, key, and language, viewing the raw JSON response, and more. Please run geocode -h
for details.
Technical Discussions
Distance Queries in SQLite
SQLite's lack of trigonometric functions requires an alternate implementation of the near
scope. When using SQLite, Geocoder will automatically use a less accurate algorithm for finding objects near a given point. Results of this algorithm should not be trusted too much as it will return objects that are outside the given radius, along with inaccurate distance and bearing calculations.
There are few options for finding objects near a given point in SQLite without installing extensions:
-
Use a square instead of a circle for finding nearby points. For example, if you want to find points near 40.71, 100.23, search for objects with latitude between 39.71 and 41.71 and longitude between 99.23 and 101.23. One degree of latitude or longitude is at most 69 miles so divide your radius (in miles) by 69.0 to get the amount to add and subtract from your center coordinates to get the upper and lower bounds. The results will not be very accurate (you'll get points outside the desired radius), but you will get all the points within the required radius.
-
Load all objects into memory and compute distances between them using the Geocoder::Calculations.distance_between
method. This will produce accurate results but will be very slow (and use a lot of memory) if you have a lot of objects in your database.
-
If you have a large number of objects (so you can't use approach #2) and you need accurate results (better than approach #1 will give), you can use a combination of the two. Get all the objects within a square around your center point, and then eliminate the ones that are too far away using Geocoder::Calculations.distance_between
.
Because Geocoder needs to provide this functionality as a scope, we must go with option #1, but feel free to implement #2 or #3 if you need more accuracy.
Numeric Data Types and Precision
Geocoder works with any numeric data type (e.g. float, double, decimal) on which trig (and other mathematical) functions can be performed.
A summary of the relationship between geographic precision and the number of decimal places in latitude and longitude degree values is available on Wikipedia. As an example: at the equator, latitude/longitude values with 4 decimal places give about 11 metres precision, whereas 5 decimal places gives roughly 1 metre precision.
Troubleshooting
Mongoid
If you get one of these errors:
uninitialized constant Geocoder::Model::Mongoid
uninitialized constant Geocoder::Model::Mongoid::Mongo
you should check your Gemfile to make sure the Mongoid gem is listed before Geocoder. If Mongoid isn't loaded when Geocoder is initialized, Geocoder will not load support for Mongoid.
ActiveRecord
A lot of debugging time can be saved by understanding how Geocoder works with ActiveRecord. When you use the near
scope or the nearbys
method of a geocoded object, Geocoder creates an ActiveModel::Relation object which adds some attributes (eg: distance, bearing) to the SELECT clause. It also adds a condition to the WHERE clause to check that distance is within the given radius. Because the SELECT clause is modified, anything else that modifies the SELECT clause may produce strange results, for example:
If you get an error in the above cases, try the following:
City.near("Omaha, NE", 20, select: "id, name")
City.near("Omaha, NE", 20, select: "id, name").to_a.pluck(:id, :name)
City.near("Omaha, NE", 20, select: "id").to_a.pluck(:id)
City.near("Omaha, NE", 20, select: "cities.*, venues.*").joins(:venues)
Hotel.near("London, UK", 50).joins(:administrator).preload(:administrator)
Geocoding is Slow
With most lookups, addresses are translated into coordinates via an API that must be accessed through the Internet. These requests are subject to the same bandwidth constraints as every other HTTP request, and will vary in speed depending on network conditions. Furthermore, many of the services supported by Geocoder are free and thus very popular. Often they cannot keep up with demand and their response times become quite bad.
If your application requires quick geocoding responses you will probably need to pay for a non-free service, or--if you're doing IP address geocoding--use a lookup that doesn't require an external (network-accessed) service.
For IP address lookups in Rails applications, it is generally NOT a good idea to run request.location
during a synchronous page load without understanding the speed/behavior of your configured lookup. If the lookup becomes slow, so will your website.
For the most part, the speed of geocoding requests has little to do with the Geocoder gem. Please take the time to learn about your configured lookup before posting performance-related issues.
Unexpected Responses from Geocoding Services
Take a look at the server's raw response. You can do this by getting the request URL in an app console:
Geocoder::Lookup.get(:nominatim).query_url(Geocoder::Query.new("..."))
Replace :nominatim
with the lookup you are using and replace ...
with the address you are trying to geocode. Then visit the returned URL in your web browser. Often the API will return an error message that helps you resolve the problem. If, after reading the raw response, you believe there is a problem with Geocoder, please post an issue and include both the URL and raw response body.
You can also fetch the response in the console:
Geocoder::Lookup.get(:nominatim).send(:fetch_raw_data, Geocoder::Query.new("..."))
Known Issues
Using count
with Rails 4.1+
Due to a change in ActiveRecord's count
method you will need to use count(:all)
to explicitly count all columns ("*") when using a near
scope. Using near
and calling count
with no argument will cause exceptions in many cases.
Using near
with includes
You cannot use the near
scope with another scope that provides an includes
option because the SELECT
clause generated by near
will overwrite it (or vice versa).
Instead of using includes
to reduce the number of database queries, try using joins
with either the :select
option or a call to preload
. For example:
City.near("Omaha, NE", 20, select: "cities.*, venues.*").joins(:venues)
Hotel.near("London, UK", 50).joins(:administrator).preload(:administrator)
If anyone has a more elegant solution to this problem I am very interested in seeing it.
Using near
with objects close to the 180th meridian
The near
method will not look across the 180th meridian to find objects close to a given point. In practice this is rarely an issue outside of New Zealand and certain surrounding islands. This problem does not exist with the zero-meridian. The problem is due to a shortcoming of the Haversine formula which Geocoder uses to calculate distances.
Copyright :copyright: 2009-2021 Alex Reisner, released under the MIT license.