What does Geotagging do?

Geotagging finds the geographic coordinates of place names mentioned in unstructured text. For example, consider the following unstructured text:

“Abraham Lincoln lived in Springfield before he became president”

  • Springfield (in Massachusetts)
  • Springfield (in Illinois)
  • etc.

Geotagging provides the capability of deciding which is the right candidate by analyzing other place names and indicators in the surrounding context to rank the coordinate candidates.

Why is Geotagging Challenging?

Geotagging may sound straightforward, but it’s actually quite challenging for the following reasons:

  • Place names can be spelled differently than the names in the gazetteer, so the matching has to be fuzzy. Middle Eastern or Central Asian place names in English frequently exhibit varying spellings:
    • Mazar-e Sharif vs. Mazir-i Sharif (city in Afghanistan)
  • Place names can be ambiguous. For instance, many person names are also place names: Paris, Austin, Jordan, Washington, Lincoln, Clinton, etc. You don’t want to geotag a person name as a place!
  • Locations may be described in relative terms (e.g., 100km northeast of Paris). Geotagging should be capable of calculating the precise geographic location even if it’s not named.
  • Gazetteers may not be available for some languages. It is possible to use English language gazetteers for foreign names, but it requires a machine translation of names in foreign languages to English.

Advanced Geotagging

Geotagging has usually been limited to place names, but by utilizing relationship extraction and event extraction, events as well as other types of entities that are linked to places can be analyzed geospatially.

  • Examples:
    • Locations of a person who traveled to places:
      • “John Robertson traveled to Beirut”
    • Attack events in particular region
      • “Three terrorists attacked a convoy in Kabul”
    • Locations of outbreaks of diseases
      • “Virginia reported 996 new cases of Corona virus yesterday”
    • Locations of the offices of an organization
      • “XYZ Corporation has its headquarters in Brussels”

To take the first example, Event Extraction will produce the following structured representation for it:

Event Type: travel
Person: John Robertson
Destination: Beirut

As with simple place names, geotagging can now geoenable this now-structured data by placing a person icon for John Roberson on Beirut (or any other places he traveled to) on a map. Events such as an individual’s movements can now be tracked and analyzed in an entirely visual way.

Summary

In sum, Geotagging is a critical technology for maximizing the value of geospatial information found in your unstructured text sources. Crucial information is increasingly found in unstructured, natural language data, and Geotagging allows it to be combined with already structured data for more complete analysis.

Recent Posts

  • Two silhouetted figures stand facing a large window in a modern office building.

    How Entity Extraction Identifies Adverse Information on PEPs

    Doing business in a global economy carries unique compliance risks that require collecting information from millions of news articles and…

    View Post

  • The complexities of building your own name matching system

    The Complexities of Building Your Own Name Matching System

    Here are some pitfalls to consider if you are weighing whether to build your own Name Matching system

    View Post

  • name matching for border security

    Name Matching for Border Security

    The stakes are especially high in border security. It requires accurate, real-time, scalable, multi-ethnicity name matching.

    View Post