Checking Folders

The Collection Level folder contains subfolders and their content must adhere to certain specifications prior to the collection being considered ready to "ship" for online access and long term storage.

Skipped items list - .skipped.txt extension. For batched collections: this should be present ONLY during the last upload, as it will contain information about skipped items across the entire collection.

u0003_0000193.skipped.txt NOTE: the archiving script doesn't yet know what to do with this optional file. Move by hand to the archive during that process.

u0001_2007010.match.txt (This file provides a match between photo IDs and assigned IDs so content can be linked in the right place in the EAD, and found by users)

Other relevant documents saved as plain .txt (ANSI or UTF-8 without BOM preferred). If possible please incorporate any additional data into the log.txt file. For example, audio collections often have significant item-level notes that we want to retain. Additional notes can be saved as a plain text files with a ".notes.txt" extension

u0008_0000001_0000001.notes.txt

Metadata

MUST contain:

Excel metadata spreadsheet

u0003_0000001.m01.xlsx or u0002_0000001.m03.xlsx or u0008_0000001.m02.xlsx

Note the type of spreadsheet is echoed in the segment before the ".txt" -- if this is a batch file, the batch number precedes the m0x value -- example: u0002_0000001.1.m01.xlsx.

With FITS files created by script (NOTE: Audio fits2aes puts the FITS and AES files on the server)

Scans

MUST contain ONLY:

Scans (tiffs/wavs) of non-compound objects and compound objects (inside respective subfolders). All other files types will not be retained. Temporary files and thumbs.db files do not have to be deleted since they will be removed upon transfer to Storage.