2. Ontology

3. Datasets

The following table provides all datasets extracted by the extraction framework for every wikipedia language with more than 10.000 articles.
Select the languages you are interested in on the top of the table, filter the list of datasets with the search function.
Click on the dataset names to obtain additional information. Click on the question mark next to a download link to preview file contents.

Starting with this release we provide all datasets in two serializations:

6. Dataset Metadata as DataIDs

Starting with the release 2016-04 we provide extensive dataset metadata by adding DataIDs for all extracted languages to the respective language directories. Use these files to gather additional information about the Datasets and the files which represent them.
A dcat:Catalog file (ttl, json-ld) pointing to all DataIDs (via dcat:record) can be found in the root folder of this release.