Main changes are support for DQV as additional output format for both, metrics as well as FAIR assessment results, a more verbose standard JSON output which now includes information about the origin of harvested metadata (format, source, method) and some newly implemented methods to verify the presence of temporal and spatial coverage metadata which supports the assessment of community specific metrics from the earth and environmental sciences. In detail the following changes may affect future F-UJI test results:
- Support for metric and FAIR assessment result standardised output as DQV, F-UJI API now support output as RDF (ttl, jsonld etc) which return DQV RDF. Default output still is the F-UJI custom JSON.
- Added some ontology and metadata standards namespaces such as FHIR and geoDCAT etc.
- Improved DDI mapping and parsing (e.g. file type and size detection improved for distributions = may improve FsF-R1-01MD)
- Improved ISO GCMG mapping and parsing of file size and type (may improve FsF-R1-01MD)
- Data objects which are offered via services (streaming) now supported for DCA and schema.org and ISO 19xxx see: #513 this included verification sub test (FsF-R1-01MD-2-c) which checks if service endpoint is given and protocol information are specified in metadata
- Added a browser-like user agent to mimic browsers in case web scraping detection methods hinder access (HTTP 405)
- Replaced JMESPath based simple JSON-LD parsing moved to RDF parsing
- Improved schema.org handling e.g. license mapping/parsing now supports CreativeWork licenses in schema.org, may improve FsF-R1.1-01
- Swagger output format JSON is changed so it now also includes the harvested metadata as well as metadata sources and formats (similar to the harvest method)
- Improved RDF handling for complicated graphs, now F-UJI tries to detect the main entity instead of picking Dataset classes from a graph which actually describes something else.
- Added a warning in case the resource type is not indicated or differs from ‘Dataset’ so users may decide if F-UJI is appropriate for the test.
- Improved schema.org and RO-Crate handling: FsF-R1-01MD and FsF-F3-01M now also consider MediaObjects which are indicated as hasPart of a Dataset
- New metadata properties are parsed to support community specific tests (geo, env) : spatial coverage, temporal coverage in DCAT, schema.org, DC, DDI EML ISO etc..
- New tests implemented for env/earth science metrics which verify the presence of spatial or temporal coverage info
- New YAML prototypic file of a first potential env/earth community metric
- Some pseudo namespaces which are included in some lov collections are excluded from lov list since they are identifiers: "orcid.org", "doi.org", "ror.org", "zenodo.org", "isni.org", "github.com", "arxiv.org" which may result in lower scores in FsF-I2-01M
- Due to a parsing bug, sometimes empty property values or null or None values have been stringified to “None” or “null” and scored as valid values. This is no longer regarded as valid value thus, some scores might be lower in 3.4.0.