Blog
The latest news, interviews, technologies, and resources.

News
The Columnar Index Is Now the URL Index
We have renamed the Columnar Index to the URL Index, to be clearer about its purpose and to pave the way for more datasets in a columnar format.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.




Analysis
SlideShare: Building a Scalable Web Crawler with Hadoop
Common Crawl on building an open Web-Scale crawl using Hadoop.