status: Recognised & Endorsed
Chair (s): Mingfang Wu, Sarala Wimalaratne, Adam Shepherd, Leyla Jael Castro
Group Email: [group_email]
Secretariat Liaison: [field_secretariat_liaison]
Introduction:
The wide use of schema.org to add structured metadata in web pages for use by commercial search engines has attracted the attention of the data management community as a possible mechanism to leverage the robust commercial search engines like Google, Yahoo, Bing etc. to facilitate discovery and access to scientific data. Various projects have been exploring this approach, including the US NSF EarthCube p418 projectGoogle's Dataset Recommendations, BioSchemas, Force11 DCIP, Research Data Australia, DataCite, Harvard Dataverse, NASA’s Distributed Active Archive Center (DAAC) Infrastructure, EOSCpilot, etc. Since schema.org has largely been driven by commercial business use cases, and a loosely governed process for adding and defining resource type, property and vocabulary for research domain, there are gaps and deficiencies that make its application for research data problematic.
Since P11, the RDA Data Discovery Paradigms IG started the task force "Using schema.org for research data discovery". The group has organised sessions at RDA plenaries and online calls to discuss how we research community come together to embrace the advantages of discovering data via web search engines, meanwhile to address gaps and deficiencies. There is a proposal to form a RDA Working Group with a focused scope and set of well-defined priorities/objectives.
The objectives of this work group are twofold:
- to identify and bridge gaps in existing schemas commonly used for research data, by bringing together communities who are working with such vocabularies to document research data and related resources;
- to provide guidelines for those communities whose needs are not addressed by existing metadata schema such as schema.org, and provide guidelines on proposing extensions.
The planned outputs will include:
-
A generic ‘conceptual data model’ with essential types and properties for research data discovery over the web. The model will be built on bioschemas.org, science-on-schema.org, schema.org, DCAT, DDI-DISCO and SSN schemas from some representative research domains, and data discovery use cases. A research domain can map their schema to the conceptual model when they publish data to the web or exchange metadata between data portals/repositories.
-
A guideline, illustrated with common patterns, of common patterns for publishing metadata landing pages with structured data markups; and a guideline of how to customise the research schemas for target domains with examples.
- Toolings for making the implementation easier if resources are available. This could include collecting and cataloguing tools that generate, validate and parse schema.org & DCAT markup, etc.
Group Output:
- A collection of crosswalks from fifteen research data schemas to Schema.org (RDA supporting output)
- Guidelines for publishing structured metadata on the Web (RDA recommended output)
- Tooling (collecting tools for supporting the structured data publishing process, e.g. tools that generate, validate, crawling search structured data)
- In 2022, some group members co-authored and published this paper: "An analysis of crosswalks from research data schemas to Schema.org"
Group Status:
The WG was in maintenance mode in 2022.
Agendas and notes from previous group meetings are accessiable Here.
Posts
Call for Papers - DaMaLOS 2024 co-located with ESWC
Dear all IG and WG at RDA Apologies for cross-posting. Please find below the Call for Papers for the 4th Workshop on Metadata and Research (objects) Management for Linked Open Science - DaMaLOS 2024 , this year co-located with the Extended Semantic Web Conference ESCW 2024. Drop me a line if you have any questions. Looking forward to some submissions from RDA groups. Kind regards, on behalf of DaMaLOS Organizing Committee ---------------- DaMaLOS 2024 - 4th Workshop on Metadata and Research (objects) Management0 | Add new comment
Call for Contributions - SWAT4HCLS 2024 in Leiden
Dear Semantic Web community, ---apologies for cross-posting--- We would like to warmly invite you to submit your recent work to the 2024 edition of the Semantic Web Applications and Tools for Health Care and Life Science conference (SWAT4HCLS 2024) that will be held in Leiden from the 26th to the 29th of February 2024. SWAT4HCLS will publish work in the form of papers, posters, and demonstrations on original research, data, semantic models, and application experiences. As a guide, please consider the topics of interest.0 | Add new comment
[NFDI4Ing] Industrial data models in theory and practice: The CC41 Community Meeting
Dear RDA members, Modelling complex information and its interrelations is vital for sustainable data management, but achieving a common understanding and language for these models is challenging. Connecting models from different yet similar use cases, like injection molding and hot rolling, presents interesting opportunities. What parameters impact both processes similarly? Are there advantages to considering both models simultaneously? These questions and more can only be answered by examining multiple models built on a shared foundation.0 | Add new comment
A Decade of Data: Learning resources minimal metadata application profile; Wednesday 17 May 2023
Dear all, ‘*Learning resources minimal metadata application profile*’: online, Wednesday 17 May 2023 at 15:00-16:00 UTC. Registration is now open!0 | Add new comment
IASSIST'23 = Diversity in Research: Social Justice from Data
by Paula Lackie
FYI: IASSIST is a wonderful community of helpful data professionals from around the world.0 | Add new comment
IASSIST'23 = Diversity in Research: Social Justice from Data
by Paula Lackie
FYI: IASSIST is a wonderful community of helpful data professionals from around the world.0 | Add new comment
IASSIST'23 = Diversity in Research: Social Justice from Data
by Paula Lackie
FYI: IASSIST is a wonderful community of helpful data professionals from around the world.0 | Add new comment
Reminder: Final group call this Wednesday (19th, 8pm UTC) to wrap up the Research Metadata Schemas WG
by Mingfang Wu
Dear All, A reminder that we have a scheduled meeting on 19th April 8PM UTC (you can check your local time here). We would like to take this opportunity to thank you and to wrap up this WG. The WG started in 2019, and completed its work in 2021. Our outputs include:0 | Add new comment
Second Call for papers - DaMaLOS 2023 co-located with ESWC
Dear all IG and WG at RDA Apologies for cross-posting. Please find below the Call for Papers for the 3rd Workshop on Metadata and Research (objects) Management for Linked Open Science - DaMaLOS 2023 , this year co-located with the Extended Semantic Web Conference ESCW 2023. Drop me a line if you have any questions. Kind regards, On behalf of the organizing committee ---- *DaMaLOS 2023 CfPESWC 2023. Hersonissos, Greece, May 29, 2023Conference0 | Add new comment
Meet during P20 to discuss data interoperability?
by Gavin Chait
Dear Research Metadata Schemas group, I'm Gavin Chait, and I'm involved in a data interoperability project funded via EOSC Futures (https://eoscfuture-grants.eu/meet-the-grantees/implementation-no-code-metho d-schema-schema-data-transformations-interoperability, and https://whyqd.com). I will be participating in the RDA plenary event in Gothenburg and would like to discuss issues around data interoperability,0 | Add new comment