Princeton Startup Weekend Recap

Last weekend, I had the privilege of attending Princeton Startup Weekend. I went as a representative of Factual, which was one of the weekend’s sponsors, and to act as a resource for teams that wanted advice, feedback, or technical help. If you haven’t heard of Startup Weekend, the gist is that developers, designers, marketers, product [...]

Dedupe, Merge, and Purge: the Art of Normalization Video from Strata NYC 2011

Watch Tyler Bell and Leo Polovets’s talk from Strata NYC 2011 (Thursday, 09/22/2011). Big Noise always accompanies Big Data, especially when extracting entities from the tangle of duplicate, partial, fragmented and heterogeneous information we call the Internet. The ~17m physical businesses in the US, for example, are found on over 1 billion webpages and endpoints [...]

Princeton Startup Weekend 2011 Contest

Factual is sponsoring this weekend’s Princeton Startup Weekend. If you’re not familiar with Startup Weekend, learn about it on their site. The details for this weekend’s event: November 11-13, 2011 Friend Center for Engineering Education Princeton University, Princeton, NJ Factual is hosting a contest and awarding $500 to the team that demonstrates the best use [...]

Clojure on Hadoop: A New Hope

Factual’s U.S. Places dataset is built from tens of billions of signals. Our raw data is stored in HDFS and processed using Hadoop. We’re big fans of the core Hadoop stack, however there is a dark side to using Hadoop. The traditional approach to building and running Hadoop jobs can be cumbersome. As our Director [...]