Ten years of OpenStreetMap

Note: This was originally posted on O’Reilly Radar on 8/15/14, available here. Next to GPS, the most significant development in the Open Geo Data movement is OpenStreetMap (OSM), a community-driven mapping project whose goal is to create the most detailed, correct, and current open map of the world. This week, OSM celebrates its 10th birthday, [...]

Profiling Hadoop jobs with Riemann

Factual processes nontrivial amounts of data. Our analyses may range over 1011 records, reading hundreds of gigabytes to hundreds of terabytes of source data and intermediate representations. At this scale, performance optimizations can save us significant time and money. We use VisualVM, jhat, and Yourkit for memory and CPU profiling, and the excellent Criterium for microbenchmarks [...]

Announcing the Trusted Data Contributor Program

We are very pleased to launch today the Factual Trusted Data Contributor Program. These organizations work directly with businesses and brands, and in-turn ensure that their data is represented accurately in Factual. We have selected a limited number of partners who provide high-quality data to Factual and equally excellent service to their customers. These organizations [...]

​Introducing: Multi-Categorization and Hours of Operation

We have officially released Multiple Categories and the Hours of Operation attribute in our Global Places data! Multiple Categories Category_ids and category_labels can now support multiple values​. This makes ​perfect ​sense​,​ as there are many ​entities in the world that are more than one ​”​thing​”​ and can belong to multiple categories. Now, as a user [...]

Changes in our Global Places Data – Q2 2014

Back in April we talked about the challenges of keeping our Global Places data as fresh and clean as possible. Because Factual’s data represents the real world- which is changing every day- we have to update it often to make sure we’re providing the most accurate records possible. Here’s a quick look at what’s gone [...]