Validation

Projects and communities

In general, data validation is the process of ensuring that data are clean, correct and useful. Eurostat
performs data validation by verifying whether data are in accordance with certain basic criteria that serve to assess the plausibility of the given data.

The ESSnet ValiDat Integration examines different ways to implement a common infrastructure for data validation in the ESS. Our work includes theoretical groundwork, data structures and languages and their integration into the data validation subprocess of statistical production. We also look at the architecture and interoperability of distributed data validation in the ESS.

When data sets are linked at individual level, for instance survey data with administrative data, often no unique linkage keys are available. In that case, probabilistic linkage may be used. With probabilistic linkage, linkage errors will occur. These errors may have impact on subsequent statistical analysis.

What is the CROS Portal?

The CROS
Portal is a content management system based on Drupal and stands for "Portal on Collaboration in Research and Methodology for Official Statistics". The CROS
Portal is dedicated to the collaboration between researchers and Official Statisticians in Europe and beyond. It provides a working space and tools for dissemination and information exchange for statistical projects and methodological topics. Services provided include hosting of statistical communities, repositories of useful documents, research results, project deliverables, and discussion fora on different topics like the future research needs in Official Statistics.