Crawl of outlinks from wikipedia.org started March, 2016. These files are currently not publicly accessible.
Properties of this collection.
It has been several years since the last time we did this.
For this collection, several things were done:
1. Turned off duplicate detection. This collection will be complete, as there is a
good chance we will share the data, and sharing data with pointers to random
other collections, is a complex problem.
2. For the first time, did all the different wikis. The original runs were just against the
enwiki. This one, the seed list was built from all 865 collections.

Monday 19 May: Building Modern Research Corpora: the Evolution of Web Archiving and Analytics (open to public)

Tuesday 20 May: Working and Small Group meetings (members)

Wednesday 21 May: Working and Small Group meetings (members)

Thursday 22 May: Recap of members meetings in the morning and workshops in the afternoon (open to public in the afternoon)

Friday 23 May: Workshops (open to public)

Registration has now closed.If you have any questions about registration, please contact Peter Stirling (BNF).

A sponsorship scheme is being run this year to allow each IIPC member institution to invite a guest researcher to join them in attending the internal IIPC events on the Tuesday, Wednesday and Thursday morning. This sponsorship scheme is intended to introduce guest researchers to the work of the IIPC. It will also bridge the gap between the Open Day on the Monday and the Open Workshops on the Thursday afternoon and Friday. For planning purposes all sponsorships will need to be recorded at registration. Please note that this sponsorship does not carry with it any financial support.

To keep in touch, before and during the event, check the Twitter hashtag: #iipcGA14