Google Wants Email Scan Information Blocked

San Francisco — Google is seeking to black out portions of a transcript from a public court hearing that includes information on how it mines data from personal emails.

The world’s largest Internet-search provider, fighting a lawsuit claiming its interception of emails amounts to illegal wiretapping, asked U.S. District Judge Lucy H. Koh in a filing last week to redact “confidential” information from the transcript, without being more specific. The main revelation at the Feb. 27 hearing was the existence of “Content Onebox,” used by Google to intercept emails for targeted advertising and to build user profiles, Sean Rommel, a lawyer for plaintiffs, told the judge at the time.

Google’s latest move to keep records in the case out of public view comes as Koh is weighing a request by companies including National Public Radio, New York Times Co. and Washington Post Co. to unseal other key documents filed earlier that the company contended were too sensitive to be revealed.

The Feb. 27 hearing in federal court in San Jose, Calif., was to determine whether the lawsuit will proceed as a group suit, or class action. Koh’s ruling will have implications for other email privacy cases assigned to her that were filed last year against Yahoo Inc. and LinkedIn Corp.

According to Rommel, Google in September or October 2010 moved Content Onebox, which previously scanned stored emails, to “the delivery pipeline,” to extract data before users received the messages. Live or unopened communications are more protected under the federal Wiretap Act than stored ones.

Rommel told Koh that Google made the change after determining that Content Onebox couldn’t extract information from emails that hadn’t been opened or deleted, or that were opened on an iPhone or sent through an email provider other than Google.

After the switch to scanning unopened emails, Google began creating profiles of people “from which they could extrapolate additional advertisements,” Rommel said, without giving details of how Content Onebox works. “Google itself has admitted and declared that the location and the timing of Content Onebox’s existence is proprietary, it’s secret, it’s unknown,” Rommel said at the Feb. 27 hearing. “There’s not a single disclosure in the record which identifies that there’s a content extraction feature occurring in the delivery process, which would be the interception.”

Google’s attempt to black out portions of the transcript that haven’t been publicly reported in news accounts is at odds with statements from Michael Rhodes, the Mountain View, Calif.-based company’s lawyer.

“We came here today and we unburdened the court of any sealing effort,” Rhodes told Koh at the Feb. 27 hearing. “We agreed that all of the material that had been previously designated confidential could be aired in the public courtroom so that those folks back there in the media could see that Google has nothing to hide here.”

At the hearing, Rommel discussed a predecessor to “Content Onebox” called “Cat II mixer,” which he said was used for Google’s advertising business. Rommel also claimed Google’s interception of emails violates opt-out agreements that educational institutions or small businesses signed to shield themselves from the company’s scans.

Google has said in court documents that its practice of scanning emails and use of scanned information isn’t a secret.

“However, the mechanics of how Google performs those processes are sensitive, both for user security reasons and for competitive reasons,” Google argued in a court filing. “The specific techniques that Google uses to implement Gmail’s processes, the systems and infrastructure it uses to apply those processes, and the sequences in which it applies those processes are the result of over nine years of development by Google’s engineers.”

The media companies contend Google improperly sought to seal, for example, an email exchange between employees analyzing different Gmail systems, and documents describing an experiment conducted to better understand how users interact with Gmail, and why they click on ads, according to a court filing.

Google’s sealing order relies on a “nebulous economic harm” that the company claims could result from “the public disclosure of any information about Google’s business model or technology,” the media companies said in a court filing.

The consolidated case before Koh stems from lawsuits brought by users of Gmail and other email services from states including Texas, Pennsylvania, Maryland and Florida.

Koh in September denied Google’s bid to dismiss the case. The judge rejected Google’s argument — which it continues to make — that Gmail users agreed to let their messages be scanned when they accepted subscription service terms and privacy policies.