This website uses cookies to ensure you get the best experience. By clicking or navigating the site you agree to allow our collection of information through cookies. Check our Privacy policy.

Data Quality Committee

Quality is a key priority for our whole community! The Data Quality Committee works to address key data quality issues over time.

Posted on Tuesday August 1, 2017

Updated on Monday February 17, 2025

Formally defined as a Europeana Network and EuropeanaTech Working Group, the Data Quality Committee is a standing committee that works on the various facets of the data quality challenge over time with a particular focus on reuse and discovery of cultural heritage objects. We believe it is crucial to tackle data quality issues at every level of the data exchange chain from its creation to its publication. We have gathered together experts from various backgrounds (metadata experts, software developers, search and retrieval experts) to help us capture all the issues.

Data Quality Committee Mission Statement

Work areas as well as specific tasks will be defined and prioritized as the Committee sees fits and regularly reported and submitted for ratification to the community, notably the Europeana Aggregators' Forum. Items such as mandatory elements for ingestion of EDM data, data checking and normalisation, data completeness have been already added on the menu. The Committee's mission statement is available here.

Discovery - User scenarios and their metadata requirements

We defined our main requirements in terms of discovery and information-retrieval requirements. A series of usage scenarios have been created reflecting information-access user needs (based on the Europeana user personas), listing current metadata issues for a given scenario and then proposing future actions. These scenarios focus specifically on metadata and are not tackling any challenges regarding the user interface or the user experience in Europeana Collections.

Europeana Publishing Framework - Metadata Quality Tiers

The Data Quality Committee has developed, together with aggregators, the metadata component of the Europeana Publishing Framework, based on three quality "pillars". The first one is based on the languages in which metadata records are provided, following Europeana's multilingual strategy. The second follows the recommendations of the Discovery - User Scenarios and relies on "enabling metadata elements". These enabling elements are desirable but optional - including them supports functionalities for a specific set of usage scenarios, beyond the fundamental minimum for all metadata descriptions defined by mandatory elements. The third pillar is based on the use of "contextual classes" that provide richer semantics to metadata. More information can be found here.

Representing events in EDM

Metadata on events can bring important information about the history of cultural heritage objects, especially when multiple events (e.g. creation, modification, destruction) can help to distinguish between different contexts. We have written a report that looks at existing events metadata from the community and provides recommendations about supporting events in EDM. This report can be of interest to all Europeana data partners that hold events data, but it may also benefit other non-Europeana aggregation initiatives. You can download the report and the associated documentation through the folder below. 

Folder

Representing events in EDM - documentation

Problem patterns

Metadata problems can occur in many records. We're actively identifying patterns that reflect metadata quality issues such as redundant or incomplete information, lack of normalization and misuse of metadata fields. A work in progress list of these metadata problem patterns can be found here.

Multilingual saturation score

The multilingual saturation score is a score for multilinguality which can be applied on statement, property or record level. We defined a simplified schema which is the basis for the measurement assuming that each statement in a property can have one of the following values: a literal, a literal with a language tag, a URI (ideally to a controlled vocabulary). Learn more in this presentation. Note that the implementation work described in this presentation is still ongoing and is subject to changes

Presentations and publications

Folder

Documents

Chairs

Filter by
NameCompanyCountryDomain
Antoine IsaacCompany: Europeana FoundationCountry: Netherlands, NLSector: Cultural heritageCommunity:
Eleftheria TsoupraCompany: Europeana FoundationCountry: Netherlands, NLSector: Cultural heritageCommunity: Research, Education

Members

  • Fiona Fieldsend, DigitalNZ
  • Dulce Fontes, Biblioteca Nacional de Portugal

and

  • Fiona Fieldsend, DigitalNZ

This page was updated in March 2021 to add the sections 'Europeana Publishing Framework - Metadata Quality Tiers', 'Representing events in EDM' and 'Problem patterns'  in 2023 and 2025 to reflect updates in membership and work carried out.

top