Is the aim of the Vocabulary Hub or "vocabulary management component" to actually enable the creation, administration and distribution of "vocabularies" (or ontologies) that would serve as a basis for the description of the actual data in a data product?
In the context of data spaces it seems as if "data descriptions" are only concerned with DCAT-AP types of metadata about the data product, whereas the actual data description - like an actual JSON or JSON-LD schema about the data product itself - is always missing. "Data" is also often referred to as only some massive dumps of data that somehow magically just explains itself?
If you look at how the UN/CEFACT is developing the description of "business documents" that in my mind are "data products" just like any other data to be exchanged or shared, they are focusing on creating JSON-LD vocabularies (https://vocabulary.uncefact.org/) that describe in an pretty understandable form the actual content of the information in a specific business document that is building it's semantics on the common vocabulary.
In the context of Gaia-X, IDSA or DSSC I've not encountered anything similar. Regarding Gaia-X, when asked, Pierre Grosselier stated a couple of years that I'm talking about "domain specific matters" that Gaia-X is not occupied with. Then again in the DSSC Blueprint, there's lots of stuff now in the Technical building blocks part about data modeling with the W3C Semantic Web (Linked Data) stack - but is the outcome of all the use of ontologies and vocabularies only the production of semantically interoperable METAdata descriptions about a data product, not the actual data itself?
I'm starting to get really lost :-) at least in the context of data sharing in the form of W3C VC 2.0 conformant verifiable credentials the use of "semantics" is pretty clear: you create an ontology to describe the concepts that describe your data, then use the ontology to create actual data models of the data to be shared and then turn these into physical data sharing artifacts in the Verifiable Credential format. We've done that already in the eIDAS 2.0 digital wallet large scale pilot EWC, focusing on data stemming from national business registries (company certificate, signatory rights, beneficial owners, power of attorney, tax related information etc.)
Would assume that this approach would fit at least a certain type of data spaces, including for instance the Health Data in the present SIMPL context?
Please log in or sign up to comment.