![A person standing and observing a waterfall of data, with computer screens in the periphery](https://cdn.prod.website-files.com/6479b8d98bf5dcb4a69c4f31/64997d997eebc399bd8cbae2_sortingdata.webp)
Latest Crawl - Archive Location & Download
The latest crawl is:
CC-MAIN-2024-26
To assist with exploring and using the dataset, we provide gzipped files which list all segments, WARC, WAT and WET files.
By simply adding either s3://commoncrawl/ or https://data.commoncrawl.org/ to each line, you end up with the S3 and HTTP paths respectively.
Learn how to Get Started.
By simply adding either s3://commoncrawl/ or https://data.commoncrawl.org/ to each line, you end up with the S3 and HTTP paths respectively.
Learn how to Get Started.