Pre-crawl Scoping

Make sure your seeds are set up correctly before you start your crawls. In this video you'll learn tips for selecting, formatting, and administering your seed URLs before you run a crawl to help capture the data you're looking for.

Test Crawls

Adding new seeds or scoping rules? You'll want to make sure you run a test crawl! This video will give you tips on how and why to run test crawls in your collections.

PDF Only Crawls

Learn how to run a PDF Only crawl and how to access the archived PDFs

Post-crawl Analysis

Getting the most from your post crawl reports

So you've run a crawl, now what? This video walks through each report to provide detail on why each post-crawl report is necessary, and the information you can glean from them.

Understanding your Hosts Report

Don't be overwhelmed by the information in your Hosts report! Find out all of the different ways you can use it to identify crawler traps, block hosts, add data limits, run patch crawls, and more!

Quality Assurance

What can you do if your archived websites don't look quite right in Wayback? Following these steps may help you improve the capture and replay of your Wayback pages.

Advanced Training Webinars

These recordings of our advanced training webinars are recommended for users who are familiar and comfortable with the content outlined in our “Getting Started” and “Post Crawl Analysis” video curricula.

Advanced Scoping

This live session on advanced crawl scoping tools and techniques will empower you with a toolbox of tips and tricks for you to use as you crawl. Recorded August 28, 2018.

Archiving Video Content

This webinar explains how general archiving workflows apply to video content. A look at both capture and replay, as related to YouTube, Vimeo, and streaming video platforms, followed by a live review of results from a YouTube crawl. Recorded February 7, 2017.

Archiving Social Media

In this webinar, the focus is on Facebook, Twitter, Instagram and YouTube. We also cover how to scope embedded social content and quality assurance strategies relevant to social media sites. Recorded November 14, 2017.

Web Archiving Quality Assurance

This recording takes an in-depth look at quality assurance strategies that will strengthen your ability to assess and improve your crawl results. Recorded August 8, 2017.

Access to Archive-It Collections

This webinar reviews different strategies used by partners for providing and boosting access to their Archive-It collections. Recorded May 2, 2017.

Under the Hood: Tips & Tools

This webinar takes a look at the tips and tools the Archive-It team uses most in their own web archiving and quality assurance workflows. Recorded February 13, 2018.

Under the Hood Section

Time

Collections and Seeds

02:00

CDX

07:05

Browser Tips

19:10

Wayback QA

23:35

Search

30:20

Describing Web Archives

This webinar takes a look at some ideas and methods for descriptive metadata practice and features Archive-It partners and peers. Recorded May 22, 2018. Presentation materials and further discussion about this topic may be found here in the Archive-It Community Forum.

Intro to Brozzler

This webinar describes and demonstrates the new browser-based web capture technology available to Archive-It partners. Recorded July 10, 2018.

WARC Tools for Management and Preservation

This webinar takes a look at the tools some Archive-It partners use in their own web archiving workflows for WARC management and preservation. Recorded November 20, 2018.