AIC Seminar Series

Though search on the World-Wide Web has focused mostly on unstructured
text, there is an increasing amount of structured data on the Web and
growing interest in harnessing such data. I will describe several
current projects at Google whose overall goal is to leverage
structured data and better expose it to our users. These projects
include crawling the deep web, collecting and mining the HTML tables
on the Web, and computing aspects for search queries to better
organize answers. In each case, I will focus the lessons learned from
the project and the opportunities that lie ahead. I will also describe how
we can apply the principles of dataspaces, i.e., pay-as-you-go data management to these problems.

Bio for Alon Halevy

Alon Halevy heads the Structured Data Management Research group at
Google. Prior to that, he was a professor of Computer Science at the
University of Washington in Seattle. In 1999, Dr. Halevy co-founded
Nimble Technology, one of the first companies in the Enterprise
Information Integration space, and in 2004, Dr. Halevy founded
Transformic Inc., a company that created search engines for the deep
web, and was acquired by Google. Dr Halevy is a Fellow of the
Association for Computing Machinery, received the the Presidential
Early Career Award for Scientists and Engineers (PECASE) in 2000, and
was a Sloan Fellow (1999-2000). He received his Ph.D in Computer
Science from Stanford University in 1993.

Note for Visitors to SRI

Please arrive at least 10 minutes early as you will need to sign in by
following instructions by the lobby phone at Building E. SRI is located
at 333 Ravenswood Avenue in Menlo Park. Visitors may park in the parking
lots off Fourth Street. Detailed directions to SRI, as well as maps, are
available from the Visiting AIC web page.
There are two entrances to SRI International located on Ravenswood Ave.
Please check the Builing E entrance signage.