Search results

Common Crawl - Blog - Common Crawl's Advisory Board

Common Crawl's Advisory Board. As part of our ongoing effort to grow Common Crawl into a truly useful and innovative tool, we recently formed an Advisory Board to guide us in our efforts.

Common Crawl - Blog - Professor Jim Hendler Joins the Common Crawl Advisory Board!

Professor Jim Hendler Joins the Common Crawl Advisory Board! We are extremely happy to announce that Professor Jim Hendler has joined the Common Crawl Advisory Board.

Common Crawl - Our Team

Advisory Board. Board of Directors. Emeritus Members. The Data. Overview. Web Graphs. Latest Crawl. Crawl Stats. Graph Stats. Errata. Resources. Get Started. AI Agent. Blog. Examples. Use Cases. CCBot. Infra Status. FAQ. Community. Research Papers.

Common Crawl - Blog - Mat Kelcey Joins The Common Crawl Advisory Board

Mat Kelcey Joins The Common Crawl Advisory Board. We are excited to announce that Mat Kelcey has joined the Common Crawl Board of Advisors!

Common Crawl - Team - Chris Tolles

Executive Advisor. Chris Tolles is an experienced Silicon Valley executive, entrepreneur & 3X co-founder; building products & companies that have championed individual agency and freedom on the Internet.

Common Crawl - Blog - White House Briefing on Open Data’s Role in Technology

Rich Skrenta, Executive Director of the Common Crawl Foundation led the briefing, accompanied by Hugh Marbury and Chris Tolles from our advisory board. Other attendees both in person and online included representatives from the OSTP, the U.S.

Common Crawl - Blog - Twelve steps to running your Ruby code across five billion web pages

The following is a guest blog post by Pete Warden, a member of the Common Crawl Advisory Board. Pete is a British-born programmer living in San Francisco.

Common Crawl - Team - Eva Ho

Board Member. Eva is a General Partner at Fika Ventures. Prior to Fika, Eva was a founding GP at Susa Ventures. She is a serial entrepreneur and founder, including companies like Applied Semantics, Google, Factual and Navigating Cancer.

Common Crawl - Team - Mike Markson

Advisor. Michael Markson is an accomplished professional with a solid track record in both the legal and technology sectors. He began his career at the Brown and Wood law firm, gaining valuable experience in legal advisory and corporate affairs.

Common Crawl - Team - Jennifer Pahlka

Advisor. Jennifer Pahlka is the founder, executive director and board chair of Code for America. Previously, she ran the Web 2.0 and Gov 2.0 events for TechWeb, in conjunction with O’Reilly Media, and co-chaired the successful Web 2.0 Expo.

Common Crawl - Blog - March/April 2024 Newsletter

New Board Member. Discord Server. Updated Legal Information. Crawl & Graph Errata. Improved Cadence. Acknowledgements. Web Graphs. Our.

Common Crawl - Team - Kurt Bollacker

Advisor. Kurt is a computer scientist with a research background in the areas of machine learning, digital libraries, semantic networks, and electro-cardiographic modeling.

Common Crawl - Team - Peter Norvig

Advisor. Peter Norvig is Director of Research at Google and a Fellow of the American Association for Artificial Intelligence and the Association for Computing Machinery.

Common Crawl - Team - Hugh Marbury

Advisor. Hugh focuses his practice on business and intellectual property litigation. His business litigation practice focuses on complex financial transactions and commercial disputes across multiple sectors.

Common Crawl - Team - Danny Sullivan

Advisor. Widely considered a leading “search engine guru,” Danny Sullivan has been helping webmasters, marketers and everyday web users understand how search engines work for 15 years.

Common Crawl - Team - Pete Skomoroch

Advisor. Pete Skomoroch is a Principal Data Scientist at LinkedIn in Mountain View, CA, focused on reputation systems, collaborative filtering, and building data driven products.

Common Crawl - Team - Praveen Paritosh

Advisor. Praveen has spent his career studying the intersection of crowdsourcing, natural language understanding, knowledge representation, and artificial intelligence (AI).

Common Crawl - Team - Lesley Gold

Advisor. Lesley uses strategic communications to put companies on the map, build brands, and create platforms for leaders driving massive, disruptive success.

Common Crawl - Team - Pete Warden

Advisor. Pete Warden is CEO at Useful Sensors, was previously technical lead of the TensorFlow Micro team at Google, and founder of Jetpac, a deep learning technology startup acquired by Google in 2014.

Common Crawl - Team - Lilith Bat-Leah

Advisor. Lilith specializes in the strategic application of data science, AI/machine learning, and analytics.

Common Crawl - Team - Sam Reddy

Staff Advisor. Over a 30-year tech career, Sam has a broad range of experiences as an engineer, founder, early employee, advisor, and strategic angel investor. Her roots are in public safety systems, open source, and social entrepreneurship.

Common Crawl - Team - Carl Malamud

Board Member. Carl Malamud is an American technologist, author, and public domain advocate, known for his foundation Public.Resource.Org. He founded the Internet Multicasting Service.

Common Crawl - Blog - IAB Workshop on AI-CONTROL

Earlier this month, the Common Crawl Foundation had the privilege of participating in a groundbreaking workshop hosted by the Internet Architecture Board (IAB) in Washington DC. Common Crawl Foundation.

Common Crawl - Team - Gil Elbaz

In 2020, Factual merged with Foursquare and today Gil is Co-Chairman of the board of a combined entity which generated $150m in combined revenue at the time of the merger.

Common Crawl - Blog - Video: This Week in Startups - Gil Elbaz and Nova Spivack

Founder Gil Elbaz and Board Member Nova Spivack appeared on. This Week in Startups. on January 10, 2012.

Common Crawl - Blog - Learn Hadoop and get a paper published

Then once you've talked with your advisor, follow up to your comment, and we'll be available to help point you in the right direction technically. Step 1: Learn Hadoop. MapReduce for the Masses: Zero to Hadoop in 5 Minutes with Common Crawl.

Common Crawl - Blog - Welcome, Sebastian!

With Sebastian on board, we have both the competence and momentum to take Common Crawl to the next level. The Data. Overview. Web Graphs. Latest Crawl. Crawl Stats. Graph Stats. Errata. Resources. Get Started. AI Agent. Blog. Examples. Use Cases. CCBot.

Common Crawl - Blog - Providing Authenticity & Data Provenance for Common Crawl Using Blockchain: Our Work with Constellation Network

Watch this panel from Constellation’s event, Protecting America and Restoring Trust Using AI & Blockchain, featuring our Executive Advisor Chris Tolles, who speaks on the role of open data in rebuilding public trust. The Data. Overview. Web Graphs.

Common Crawl - Blog - Gil Elbaz and Nova Spivack on This Week in Startups

As a sign of many more good things to come in 2012, Founder Gil Elbaz and Board Member Nova Spivack appeared on this week's episode of. This Week in Startups.

Common Crawl - Blog - Dialog and Discovery at AI_dev 2024

(Senior Advisor at Open Voice TrustMark Initiative). Pedro Ortiz Suarez. (Senior Research Scientist at Common Crawl). The panel moderator and presenter was. Anni Lai. (Head of Open Source Operations at LF AI & Data Foundation).

Common Crawl - Blog - 5 Good Reads in Big Open Data: Feb 20 2015

From the Chairman of Common Crawl’s Board of Directors (and Factual CEO) Gil Elbaz on the future of search. On opening up libraries with linked data. – via.

Common Crawl - Blog - Common Crawl Enters A New Phase

In 2008, Carl Malamud and Nova Spivack joined Gil to form the Common Crawl board of directors. Talented engineer Ahad Rana began developing the technology for our crawler and processing pipeline.

Common Crawl - Blog - October/November 2024 Newsletter

In late September, we had the privilege of participating in a groundbreaking workshop on AI-CONTROL hosted by the Internet Architecture Board (IAB) in Washington DC.

Common Crawl - Blog - August/September 2024 Newsletter

We're also taking part in. a workshop hosted by the Internet Architecture Board. in Washington DC in September.