Changes in our Global Places Data – Q3 2014

We know it’s important to our customers that Factual data is always the best possible representation of places in the physical world. To keep up with the ceaselessly changing environment of businesses around the globe, we are always refreshing and improving our data — stripping out old listings that have gone out of businesses, adding [...]

Emergent Behaviors in Factual’s Geopulse Audience Profiles

Factual’s Geopulse Audience product assembles real-world profiles for millions of smart-phone users around the world. A suite of sophisticated geo-fencing, machine-learning, and heuristic methods are used to convert the user input, a set of lat/long records for a particular device, into a colorful description of the user. This description includes demographic, behavioral, and geographic information, [...]

Expanding Restaurants Extended Attributes Data Coverage to France, Germany, and Australia

Today we are excited to announce the expansion of our Restaurants Extended Attributes data to include France, Germany, and Australia. While our Global Places data has always covered restaurants around the world, the 43 additional restaurant specific attributes in our Restaurants Extended Attributes data were previously only available in the United States and the United [...]

Ten years of OpenStreetMap

Note: This was originally posted on O’Reilly Radar on 8/15/14, available here. Next to GPS, the most significant development in the Open Geo Data movement is OpenStreetMap (OSM), a community-driven mapping project whose goal is to create the most detailed, correct, and current open map of the world. This week, OSM celebrates its 10th birthday, [...]

Profiling Hadoop jobs with Riemann

Factual processes nontrivial amounts of data. Our analyses may range over 1011 records, reading hundreds of gigabytes to hundreds of terabytes of source data and intermediate representations. At this scale, performance optimizations can save us significant time and money. We use VisualVM, jhat, and Yourkit for memory and CPU profiling, and the excellent Criterium for microbenchmarks [...]