2.1.1 - Attributes of a self-description for a dataset
Description
Simpl shall provide a data catalogue that considers at least the following information:
• Unique identifier
• Title
• Description
• Location of the dataset (e.g. URL, handle)
• Version
• Creation date
• Last update date
• License
• Price (free, under cost)
• Data Provider
• Type of Data in the dataset
• Keywords
• Contact point (who to contact in case of questions/issues)
• Provenance
• Language (of the metadata, like the title, description)
• Access policy (to define who can access the dataset)
• Usage policy (to define how a dataset can be used)
• Format under which the data is distributed (e.g. csv, xml, …)
• Schema of the dataset, depends on the type of data for JSON it would be JSON Schema Description that states what fields the data has and the types.
• Related datasets
• Target users
• Data Quality (to include metrics such as completeness, accuracy, timeliness and other)
• Encryption: Describes the encryption algorithms and keys used to secure the data.
• Anonymization/pseudonymization: Indicates whether sensitive information has been anonymized or pseudonymized to protect privacy.
• Compliance: Indicates compliance with relevant data protection regulations and standards.
L2 - Detailed Requirement | Issue ID: SIMPL-1728 | Status: Proposed |
Please log in or sign up to comment.