Advisory Board

Kurt Bollacker

Kurt Bollacker – Advisor

Kurt is a computer scientist with a research background in the areas of machine learning, digital libraries, semantic networks, and electro-cardiographic modeling. He received a Ph.D. in Computer Engineering from The University Of Texas At Austin. He was co-creator of the CiteSeer research tool as a visiting researcher at the NEC Research Institute, the technical director of the Internet Archive, and a biomedical research engineer at the Duke University Medical Center. His was Chief Scientist at Metaweb Technologies until Feb, 2009. He is currently pursuing research on long term digital archiving as the Digital Research Director at the Long Now Foundation as well as serving as a consulting Data Scientist at InfoChimps. Kurt is a philanthropist active with many organizations that promote openness, transparency and preservation. As an Advisor at Common Crawl, he provides the organization with valuable advice and insight into the crawl technology, big data processing, open innovation, products and collaborations.


Glenn Otis Brown

Glenn Otis Brown – Advisor

Glenn Otis Brown is director of business development at Twitter in New York. He has been head of music partnerships at YouTube, product counsel at Google, and executive director of Creative Commons, where he still serves on the board of directors. Glenn has also been a fellow at the Berkman Center for Internet and Society at Harvard and a lecturer at Stanford Law School.


Kevin DeBré

Kevin DeBré – Legal Counsel

Kevin is a highly respected Intellectual Property (IP) attorney who has continually worked at the forefront of the evolving IP landscape. He is a Partner and the Chairman of the Intellectual Property and Technology Transactions Department at the firm Stubbs, Alderton and Markiles. He is an author and frequent speaker on technology commercialization and intellectual property licensing. Kevin was selected in 2009, 2008 and 2006 as a Southern California Super Lawyer and is the founder and Chair of the Licensing Interest Group of the California State Bar Intellectual Property Section. Prior to joining Stubbs Alderton & Markiles, LLP, Kevin was a partner in leading international law firms, including Brobeck Phleger & Harrison, LLP, where he headed the firm’s technology transactions practice in Southern California. After law school, he served as a judicial law clerk for Hon. John G. Davies, United States District Court for the Central District of California. Kevin received his J.D. from Hastings College of the Law and his B.S. degree from the University of California, Davis.

Eva Ho

Eva Ho – Advisor

Eva Ho currently is the VP of Marketing & Operations at Factual, an open data platform that leverages large-scale aggregation and community exchange. Eva provides Common Crawl with valuable insight and advice on all aspects of the organization. Prior to Factual, she was a Sr. Product Marketing Manager at Google and Youtube for 5 years, and was the head of marketing for Applied Semantics, a company sold to Google in 2003. She also serves on the Board of Directors of Iridescent, a science education non-profit, First Descents a non-profit adventure outfit for cancer fighters and survivors, and Whole Child LA, a non-profit pediatric pain clinic. Eva holds an MBA from Cornell and a BA in Biology from Harvard University.


Joi Ito

Joi Ito – Advisor

Joi Ito is currently Director of the MIT Media Lab and a leading thinker and writer on innovation, global technology policy, and the role of the Internet in transforming society in substantial and positive ways. A vocal advocate of emergent democracy, privacy, and Internet freedom, Ito is board chair (and former CEO) of Creative Commons, and sits on the boards of the Mozilla Foundation, WITNESS, Global Voices, and the John D. and Catherine T. MacArthur Foundation. In Japan, he was a founder of Digital Garage, and helped establish and later became CEO of the country’s first commercial Internet service provider. He was an early investor in more than 40 companies, including Flickr, Six Apart,, Kongregate, Kickstarter, and Twitter. Ito’s honors include TIME magazine’s “Cyber-Elite” listing in 1997 (at age 31) and selection as one of the “Global Leaders for Tomorrow” by the World Economic Forum (2001). In 2008, BusinessWeek named him one of the “25 Most Influential People on the Web.” In 2011, he received the Lifetime Achievement Award from the Oxford Internet Institute.

May Kelcey

Mat Kelcey – Advisor

Mat is a software engineer at Wavii working on a variety of machine learning and natural language processing systems. Previous to Wavii he has worked with trading engines, massively multiplayer online games, high speed payment systems and has held numerous roles relating to information retrieval. Most recently while as an engineer on Amazon Elastic MapReduce he made several passes over the Common Crawl dataset doing statistical natural language processing. Mat considers Common Crawl to be one of the most important corpora available for building large scale probabilistic language models.

Bill Michels

Bill Michels – Advisor

Bill is currently the VP of Product Management and Partnerships at Factual. He joined in 2009 and prior to that worked at Yahoo! for about six years, the last two of which he was the GM and Senior Director of the Yahoo! Search Platform. Here he managed the product management, design, marketing and business development functions for Yahoo! BOSS, which grew from inception to over +1B queries/month. Before this he was the Director of Int’l Search Business Operations & Product Strategy, a +$1B P&L with 500+ employees. Other experience includes Equity Research Associate at UBS on the #1 ranked Institutional Investor telecom research team. At one point he wore a white coat and worked in a solid-state physics research lab. He has an MBA from Columbia University Business School and a BA in Geology from Colby College.

Peter Norvig

Peter Norvig – Advisor

Peter Norvig is Director of Research at Google and a Fellow of the American Association for Artificial Intelligence and the Association for Computing Machinery. From 2002-2005 he was Director of Search Quality, responsible for the core web search algorithms. Previously he was the head of the Computational Sciences Division at NASA Ames Research Center, making him NASA’s senior computer scientist. He has served as an assistant professor at the University of Southern California and a research faculty member at the University of California at Berkeley Computer Science Department, from which he received a Ph.D. in 1986 and the distinguished alumni award in 2006. He has over fifty publications in Computer Science, concentrating on Artificial Intelligence, Natural Language Processing and Software Engineering, including the books Artificial Intelligence: A Modern Approach (the leading textbook in the field), Paradigms of AI Programming: Case Studies in Common Lisp, Verbmobil: A Translation System for Face-to-Face Dialog, and Intelligent Help Systems for UNIX. He is also the author of the Gettysburg Powerpoint Presentation and the world’s longest palindromic sentence.

Jennifer Pahlka

Jennifer Pahlka – Advisor

Jennifer Pahlka is the founder, executive director and board chair of Code for America. Previously, she ran the Web 2.0 and Gov 2.0 events for TechWeb, in conjunction with O’Reilly Media, and co-chaired the successful Web 2.0 Expo. Before that, she spent eight years at CMP Media where she ran the Game Developers Conference, Game Developer magazine, and; there she also launched the Independent Games Festival and served as Executive Director of the International Game Developers Association. Jennifer’s early career was spent in the non-profit sector. She is a graduate of Yale University and lives in Oakland, California with her daughter and six chickens.

Boris Shimanovsky

Boris Shimanovsky – Advisor

Boris took his first programming course in the seventh grade and wrote a video game as his first class project. In support of open data and the open web, he generously donates his time to advise Common Crawl on technical matters and to share his visionary insight on the digital ecosystem. Boris is currently employed as the Director of Engineering at Factual where he’s working to bring greater accessibility and transparency with and to data. Prior to joining Factual, he was CTO at Xap, which he helped build from eight employees to a thriving business with millions of users, and over a hundred employees. His junior year in college, he built a site that quickly became the web’s 27th most trafficked and then optimized it enough to run from two machines in his den. He’s not retired now because he really wanted to graduate. Boris holds a BS in Physiological Science and an MS in Computer Science, both from UCLA. He put himself through school working night shifts as an ambulance driver. Boris lives in Los Angeles with his badass wife and amazing boys. He is a lousy philosopher, a mediocre poet, and will someday become a great chef.

Pete Skomoroch

Pete Skomoroch – Advisor 

Pete Skomoroch is a Principal Data Scientist at LinkedIn in Mountain View, CA, focused on reputation systems, collaborative filtering, and building data driven products. He leads a team of Data Scientists focused on Identity at LinkedIn and was the inventor of LinkedIn Skills. Prior to LinkedIn, he was based in Washington, DC where he focused on mining insights from search query data as the Director of Advanced Analytics at Juice Analytics and as a Sr. Research Engineer at AOL Search. While in DC, he also founded which provided custom data mining solutions to clients in bioinformatics, finance, and cloud computing. He spent the previous 6 years in Boston implementing Biodefense pattern detection algorithms for streaming sensor data at MIT Lincoln Laboratory and constructing predictive models for large retail datasets at Profitlogic (now Oracle Retail). Pete has a B.S. in Mathematics and Physics from Brandeis University and did graduate coursework in machine learning at MIT.

Danny Sullivan

Danny Sullivan – Advisor

Widely considered a leading “search engine guru,” Danny Sullivan has been helping webmasters, marketers and everyday web users understand how search engines work for 15 years. Danny’s expertise about search engines is often sought by the media, and he has been quoted in places like The Wall St. Journal, USA Today, The Los Angeles Times, Forbes, The New Yorker and Newsweek and ABC’s Nightline. Danny began covering search engines in late 1995, when he undertook a study of how they indexed web pages. The results were published online as “A Webmaster’s Guide To Search Engines,” a pioneering effort to answer the many questions site designers and Internet publicists had about search engines. Danny currently heads up Search Engine Land, which covers search marketing and search engine news. He produces the SMX: Search Marketing Expo conference series, writes a personal blog called Daggle.

Pete Warden

Pete Warden – Advisor

Pete Warden is a British-born programmer living in San Francisco. After spending over a decade as a software engineer, including 5 years at Apple, he’s now focused on a career as a mad scientist. He is currently gathering, analyzing and visualizing the flood of web data that’s recently emerged, trying to turn it into useful information without trampling on people’s privacy. Pete is the current CTO of Jetpac, a site for sharing travel photos, tips, and guides among friends. Passionate about large-scale data processing and visualization, he writes regularly on the topic on his blog and as a regular contributor to O’Reilly Radar.