Yahoo has also developed a set of open source
code called Cocktails, which gives advertisers the tools to extract
information in Yahoo's Hadoop systems. Cocktails are written in
JavaScript so they can run in browser windows; Manhattan is a
server-side cocktail and Mojito is client-side. Essentially, Burke
explained, each cocktail could run the same reporting program on a
server or an end-user laptop, respectively, to get information that an
advertiser is seeking. "We believe over the years, Cocktails will
replace PHP in the enterprise. JavaScript is a better, more interactive
language," Burke said.

Cocktailswork with a Yahoo service, Advertiser Insights, to find
information. The reporting system works on a Hadoop-Oracle-Pentaho
Mondrian (open source code for building multi-view cubes) stack. "Yahoo
has a lot of valuable data. Advertisers can see all the information that
we've learned about their customers," noted Burke.

Finally, Yahoo uses Hadoop for its own internal analysis of information
captured from user interactions. It stores 140 petabytes in Hadoop.
Since Hadoop keeps all data sets in triplicate, over 400 petabytes of
storage are needed to sustain its systems.