The 154 GB NARA Blue Book Archive

Build v0.0.1.5 was broken. So I updated the code to make it compatible with the new Fold3 URLs. You should only have to grab the updated software if
you have found a new case that isn't already in the 154 GB archive.

The software is still alpha and is probably still a bit confusing. So if you haven't located a new case and just want to download the Blue Book
archive. You don't have to do anything different. =) Just continue to use BT Sync.

At least seven other people have fully synced and downloaded all the updates. There are two other machines online at the moment, besides my own, that
have a full copy. A big thank you to John Greenewald, Jr. and everyone else that is seeding for helping out.

Thank You Xtraeme for sharing this ! in the past the blue book archives rolls cover only 1947 to 1954 partim and 1968 69, but the fold3 archive was
more complete but very slow to consult, I have scan and extract and analyse 1947 to 1957 cases one by one (yes a lot of job ! ) , but with this new
archive finish the job would be more easy for me so thank you !

My opinion is that blue books have many non closed cases , many many more than what usaf said at the time, blue book is an important archive .

for the scan problem in my opinion the better first step is to scan the "synthesis file", first image of each container , because some texts
doesn't pass the ocr test

originally posted by: tanka418
1. How many of y'all actually downloaded this file?

At the peak there were 110 people simultaneously connected. There are 38 connected now. So over the last couple weeks probably a couple hundred people
have downloaded bits and pieces.

2. How long did it take?

The pipe is a 50 Mbps FIOS connection. So on average it took people with a 10-15 Mbps connection about two to three days to download all the material.

If you are trying to figure out how long it will take to download for yourself. It is a pretty simple calculation: Total size / megabits per second
(Mbps).

So in other words: 154 GB (or 154,000 MB) / (your connection speed). To convert megabits to megabytes simply divide by 8. Then just divide the result
by 60 seconds / min and 60 mins / hour to calculate total time in hours. Here is a quicker way to do it:

Yep, there were a couple of pages still outstanding. I got a form letter style response from Fold3 about some of the missing pages on their end, but I
don't expect they'll add the pages any time soon. I also wrote a little script to help sort out the cases in
9667997 (8198 pages), 9669100 (1415 pages), and
9669191 (3710 pages). If you'd like, I would be happy to give you write access to SVN so you can add
your scripts to the project. We can divide up the task of running the OCR leg. I have a couple of machines that are just idling. So it would probably
speed things along.

Glad to know you are still pursuing it. FULL credit to you for compiling the data. I emailed you directly, asking what credit you'd like on the
site... but as I mentioned above... figured out a few things, and built a searchable database.

First, they are entirely converted to PDFs, with the exception of a few that contained THOUSANDS of jpegs... it was just too large and not feasible
(for now)... re-aligned and adjusted the images to then run it through the best OCR process I could. I then uploaded the entire collection, and then
built a server site search engine to catalog them all (ie: not using Google which may only search 4% of them!) and it is now available at:

This is amazing! They have the Kenneth Arnold sighting concluding the event to be a mirage. If you look through he archives, you will find some good
ones! It would be nice if the unexplained ones were archived separately since those account for about 5%. S&F

This content community relies on user-generated content from our member contributors. The opinions of our members are not those of site ownership who maintains strict editorial agnosticism and simply provides a collaborative venue for free expression.