The original RECAP extension for Firefox was launched eight years ago. Today we launch an all new version. Since the original launch in 2009, we’ve kept the system running smoothly, added a Chrome extension, and — with your help — collected and shared information about tens of millions of PACER documents.

Today we’re announcing the future of RECAP. If you’re an existing Firefox or Chrome user, you should automatically get this update over the next 24 hours. If you’re a new user, just learning about RECAP, you can find links for Firefox or Chrome on the right, and you can learn more on the RECAP homepage.

As you’re using PACER, the extensions will stop providing links to the Internet Archive, and will instead provide links to CourtListener and the RECAP Archive, where dockets and documents are fully text searchable.

Links to CourtListener will be available very soon after an upload from PACER is complete — possibly within seconds or minutes. This has been the most-requested enhancement we’ve heard over the years, and we’re really happy to be bringing this …

Today we’re proud to announce that Ansel Halliburton has joined the Free Law Project Board of Directors. Ansel has already been contributing to Free Law Project for several years, and his experience as both a practicing lawyer and legal technologist will help steer us into the future.

Ansel has been involved in legal technology since 2006, when he joined a team at Stanford Law School to help build a comprehensive database of all federal intellectual property litigation. Now a lawyer, Ansel practices technology law in San Francisco. Ansel holds degrees from UC Berkeley and UC Davis School of Law, writes about technology and security for Lawyerist, and builds small robots.

Board members are selected for terms lasting two years.

“We couldn’t be more delighted to welcome Ansel to the board,” stated Free Law Project Executive Director Michael Lissner. “Ansel brings the kinds of knowledge, experience, and smart thinking that will be invaluable to Free Law Project over the next few years.”

At Free Law Project, we have gathered millions of court documents over the years, but it’s with distinct pride that we announce that we have now completed our biggest crawl ever. After nearly a year of work, and with support from the U.S. Department of Labor and Georgia State University, we have collected every free written order and opinion that is available in PACER. To accomplish this we used PACER’s “Written Opinion Report,” which provides many opinions for free.

This collection contains approximately 3.4 million orders and opinions from approximately 1.5 million federal district and bankruptcy court cases dating back to 1960. More than four hundred thousand of these documents were scanned and required OCR, amounting to nearly two million pages of text extraction that we completed for this project.

PACER/ECF is a system of 204 websites that is run by the Administrative Office of the Courts (AO) for the management of federal court documents. The main function of PACER/ECF is for lawyers and the public to upload and download court documents such as briefs, memos, orders, and opinions.

In February we reported that we disclosed a major vulnerability in PACER/ECF to the AO. The proof of concept and disclosure/resolution timeline are available here.

We are pleased to share that this issue is now properly addressed, and that we are now able to report more details about it. Throughout the process of researching, disclosing, and resolving this vulnerability, the AO has been prompt and professional, something that we greatly appreciate given the considerable constraints and complexities they are facing. However, despite their skill in dealing with this issue, after discovering it we have lingering concerns about the security of PACER/ECF on the whole.

In this post, we discuss three topics. First, we outline what the vulnerability was and how to identify if you were a victim of it. Second, we discuss why the vulnerability is troubling for a system of PACER/ECF’s size and …

Today, we’re celebrating another milestone. We now have more than a million minutes of oral argument audio in CourtListener.com’s oral argument archive. All of this audio is available in our search engine, APIs, podcasts, and our website.

We have had a lot of success with our oral argument archive, and we’re thrilled to be able to provide it at no cost, despite its size growing every day.

In the coming months we will continue expanding this collection in two key ways. First, we plan to begin automatically generating transcripts for audio files so that you can get an email alert any time certain words are said in court. Second, we plan to continue expanding our coverage so that it includes more state courts, and so that it includes all of the federal courts that have recordings available.

Today, the PACER system contains millions of court filings for the federal district, circuit, and bankruptcy courts, most of which are sold at a dime per page with a three dollar cap per document. But content in PACER was not always priced this way, and indeed the PACER system goes back all the way to the early 1990’s, before computers were generally connected to the Internet.

During the 27 year history outlined below, technology has changed significantly, and the Administrative Office of the Courts has done its best to keep up. Over the years, PACER has offered a variety of ways to get court information. These include a 1-900 number, a search service available via a regular phone call, the ability to connect your own computer directly to the courts’, and the websites that we know today.

We have a small update to share today, as we’ve wrapped up adding thousands of historical Supreme Court citations to our collection. These are the original citations for the Supreme Court from 1754 to 1874, from before when the United States Reports had begun. Previously we had many of these citations, but as of today we can say we have historical citations for our entire SCOTUS collection.

Today we are launching a new project to download all of the free opinions and orders that are available on PACER. Since we do not want to unduly impact PACER, we are doing this process slowly, giving it several weeks or months to complete, and slowing down if any PACER administrators get in touch with issues.

In this project, we expect to download millions of PDFs, all of which we will add to both the RECAP Archive that we host, and to the Internet Archive, which will serve as a publicly available backup.1 In the RECAP Archive, we will be immediately parsing the contents of all the PDFs as we download them. Once that is complete we will extract the content of scanned documents, as we have done for the rest of the collection.

This project will create an ongoing expense for Free Law Project—hosting this many files costs real money—and so we want to explain two major reasons why we believe this is an important project. The first reason is because there is a monumental value to these documents, and until now they have not been easily available to the public. These documents are a critical …

To use this new feature, type the name of the party or attorney into the fields on the RECAP Archive homepage or in the sidebar to the left of any search results. These boxes also accept advanced query syntax, and there are several new fields that can be queried from the main search box including party, attorney, and firm.

For example, in the main box you can search for attorney:”eric holder”~2 firm:covington. This query shows the cases where the attorney has the word “Eric” within two words of “Holder” (thus allowing his middle name) which were handled at the firm “Covington & Burling”.

Recently, as part of our routine business practices, we discovered what we believe is a major vulnerability in the PACER system of websites that we believe affects both the electronic case filing and public access portals.

At this time, as part of a responsible disclosure process, we have notified the appropriate parties at The Administrative Office of the Courts, the agency that runs PACER. According to industry norms, we have given them a broad 90 day window to resolve the vulnerability.

After the 90 days are up or the issue is resolved, we plan to publish the details of what we discovered, the ramifications of the discovery, and the solution that they have put in place, if any.

Free Law Project’s role in this grant will be to acquire court opinions and orders from PACER, and to provide them to Alexander and Feizollahi for their research. Because PACER is not optimized for automated access, a key outcome of the grant will be to develop tools and infrastructure to enable other researchers to utilize PACER data through future grants.

“PACER data is too difficult for researchers to access, and it’s high time that a centralized service be created by a non-profit to gather this kind of data for researchers,” says Michael Lissner, Founder and Executive Director of …

The House Judiciary Committee held a hearing today on the topic of the “the effectiveness of the PACER service and use of audio and video recordings of courtroom procedures.” Three witnesses were invited by the committee to speak at the hearing, including our board member, Thomas Bruce, who spoke at length on the topic of reforming the PACER system. His written testimony can be found here.

Bruce framed his testimony by providing an overview of the things that PACER is and is not. In his words, these are the characteristics that define PACER:

First, PACER charges fees for access to public records.

Second, PACER became outmoded two years after it was built, and in some ways has never caught up.

Third, PACER suffers from a split personality. On one hand, it is an electronic filing and case management system that supports the Federal courts […]On the other […] it is a data publishing system that offers the work of the Federal courts, both documents and metadata, to a very wide range of people[…]

Free Law Project is proud to share that in our effort to create the most complete online collection of opinions, today we added 24 Attorney General advisory opinions relating to the Emoluments Clause and dating from 1952 to 2009. Attorney General advisory opinions are written by the Office of Legal Counsel in the Justice Department, and provide legal advice to the President and all the Executive Branch agencies.

The opinions we added today were made available to Politico in 2012 in response to a Freedom of Information Act request, and were recently published on their website. Included in the opinions is a range of advisory guidance on topics ranging from whether Nixon could accept anonymous donations to pay for his taxes to whether President Obama could accept the Nobel Peace Prize.

Ginsburg has participated in hundreds of oral arguments that we have in our system.

Clicking on the button at the bottom takes you back to our database of oral argument recordings where you can further refine your search. If the judge is active, there is an icon in the upper right that lets you subscribe to a podcast of the cases heard by that judge. At this time, these features are only available for the Supreme Court and for jurisdictions where the judges for specific cases are provided by the court website. We hope to expand this in the future.

To our knowledge, a linkage like this has never previously existed on any system, and we hope that it will make research and exploration faster and easier for our users …

If you are a user of iTunes, you can easily subscribe to our podcasts by opening iTunes and searching for “Free Law Project” or “oral arguments.” Once you subscribe, the podcasts will download to iTunes wherever you use it.

These podcasts contain all of the oral argument audio for a given court or for a search that you create. This means that as of this moment, you can pipe the audio from the Supreme Court and Federal Circuit Courts directly to your pocket with almost no effort.

To learn more about creating custom podcasts or about the podcasts that we already have, we’ve created a page on our site with all the details. It also has information about how to subscribe using Google Music, Stitcher Radio, and other apps.

We hope you’ll enjoy these podcasts. Who doesn’t want the Supreme Court in their pocket?

After months of development, we are thrilled to share a from-scratch re-launch of the RECAP Archive. Our new archive, available immediately at https://www.courtlistener.com/recap/, contains all of the content currently in RECAP and makes it all fully searchable for the first time. At launch, the collection contains information about more than ten million PACER documents, including the extracted text from more than seven million pages of scanned documents.

The search capabilities of this new system empower researchers in new ways. For example:

It is now straightforward to search for certain types of documents within our archive of PACER documents. This makes it easy to find examples of motions to dismiss, summary judgements, or any other type of document.

By now most readers of this blog know that PACER brings in a lot of money by selling public domain documents at a dime per page. What people might not realize is how these costs can add up for individual researchers or journalists. Looking through our database, we realized that we have quite a few really big cases.

All of the cases below have more than ten thousand entries that we know about.1 There are some names you might recognize:

PACER is the system that the public and various organizations use to access electronic records in the federal district and appeals courts. When PACER is used, it charges for certain activities, like downloading a PDF or making a search query. Raising funds this way was authorized by congress in the E-Government Act to the extent that the revenue paid for running PACER.

In the beginning the revenue from these charges was fairly modest, but the revenue has risen for many years, culminating in revenue of $145M in 2015 (the last year that’s available).

This chart shows the trends in PACER revenue since 1995:

In total, that’s $1.2B that PACER has brought in over 21 years, with an average revenue of $60.7M per year. The average for the last five years is more than twice that —- $135.2M/year.1 These are remarkable numbers and they point to one of two conclusions. Either PACER is creating a surplus —- which is illegal according to the E-Government Act —- or PACER is costing $135M/year to run.

Whichever the case, it’s clear that something has gone terribly wrong. If the justice system is turning a profit selling public domain …

As most readers of this blog know, PACER is a system run by the Administrative Office of the Courts (AO) that hosts over a billion documents from the Federal District and Circuit courts. The system was created in the nineties and was set up with a paywall so that you pay for every “page” of data that you receive. The idea of the fees, as established by the E-Government Act, is that the AO could use them to recoup the cost of running the PACER, but the pricing of the content has always been a bit odd. In my last post I talked about how these fees result in an outrageous cost for PACER data. In this post, I do a deep dive into the core unit of PACER’s pricing and attempt to answer the question, what is a “page” of PACER data?

The size of PACER’s fees has varied over the years, but they’ve always gone up, and they’ve always been assessed roughly as follows:

If you download a PDF from PACER, you pay by the page.

If you do a search, you pay by the number of search results returned. Because you don’t …