< Back to Errata

Erratum

Nodes in Domain-Level Webgraphs Not Sorted and May Include Duplicates

Originally reported by 
covuworie
.

The nodes in domain-level Web Graphs may not be properly sorted lexicographically by node label (reversed domain name). It's also possible that few nodes are duplicated, that is two nodes share the same label. For more details, see the Issue Report in the cc-webgraph repository.

The issue affects all domain-level Web Graphs until the issue has been fixed for the May, June/July, August 2022 Web Graph (cc-main-2022-may-jun-aug-domain) and the following Web Graph releases.

Affected Crawls
No items found.
Affected Web Graphs