Erratum
Missing WARC File
Originally reported by
.
One WARC and WET is missing in June 2017 Crawl (CC-MAIN-2017-26). The corresponding WAT file is present, as well as the URL index entries contained in the missing WARC file. For more details, see the release announcement in the Common Crawl Google Group.
The following two files are missing:
crawl-data/CC-MAIN-2017-26/segments/1498128320063.74/warc/CC-MAIN-20170623133357-20170623153357-00225.warc.gz
crawl-data/CC-MAIN-2017-26/segments/1498128320063.74/wet/CC-MAIN-20170623133357-20170623153357-00225.warc.wet.gz
Affected Crawls
Affected Web Graphs
No items found.