-
To update the group progress
-
To wrap up Ten rules to improve data discoverability
-
To discuss new trends surrounding chatbots in data discovery
-
Introduction of the group (10 minutes)
-
Introduction: Ten rules to improve data discoverability (15 minutes)
-
Discussion and feedback on the ten rules (20 minutes)
-
Presentations on AI/ML to improve data quality and data discoverability
-
Lizhou Fan (UMich) and Sara Lafia (NORC at the University of Chicago): DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization (15 min)
-
Tianying Chen (GESIS) on a ChatGPT discovery study (15 min)
-
Discussion of this new topic (10 min)
-
-
Wrap up and steps forward (5 minutes)
-
Researchers who conduct user studies for understanding more about an individual’s data discovery process
-
Data managers/providers who are responsible in describing data and making data findable
-
Data managers to investigate whether any user studies or evaluation method could be applied to their data repository/catalogue
-
Attendees with some prior preparation or insights for their institutional / personal data discovery approaches would benefit more during the group discussion parts of the session.
The objective of this IG is to provide a forum where representatives from across the spectrum of stakeholders and roles pertaining to data discovery can work together to identify, study and make recommendations concerning issues related to improving data discovery. The goal is to produce concrete deliverables that will be recognised and valued by the research and data communities.
This group was officially endorsed at RDA P9. The group has worked on the following task forces, namely:
-
User studies in data discovery (ongoing)
-
Data/Metadata granularity (started the Data Granularity Working Group)
-
Using schema.org for research dataset discovery (This task force has spun off to the Research Metadata Schemas Working Group, which was endorsed in Sept. 2019. The group is now in maintainance mode).
-
Task forces from the group:
-
Relevancy ranking (completed)
-
Use cases, prototyping tools and test collections (completed)
-
Best practice for making data findable (completed)
-
Metadata enrichment (closed)
-
Data granularity (became a WG, in progress)
-
Publish structured metadata (became the Research Data Schemas WG, completed)
-
User study of data discovery context and search behaviour (in progress)
-
The DDPIG has been established and endorsed as an IG during P9. The group started with four task forces around target data discovery topics soon after P9. All task forces actively explored their topics, and reported progress and outputs at consequent plenaries. At P11, the first three task forces were officially closed, and a discussion on new Task Forces took place, focusing during P12 primarily on Schema.org and Data Granularity. After P13, a case statement for a Research Schemas WG was submitted, the case statement was endorsed in Sept. 2019, just before P14.
The group has delivered the following three supporting outputs:
Slides from previous plenary sessions:
-
October 2023 - RDA hybrid plenary 21:
-
March 2023 - RDA Virtual Plenary 20:
-
June 2022 - RDA Virtual Plenary 19:
-
November 2021 - RDA Virtual Plenary 18:
-
Mapping the road ahead for the data discovery paradigms IG (Group session, slides)
-
-
January 2021 - RDA Virtual Plenary 17:
-
Investigating data discovery across domains (Group session, slides)
-
-
November 2020 - RDA Virtual Plenary 16:
-
What information about data do users desire for discovery? (Group session, slides)
-
-
April 2020 - RDA Virtual Plenary 15:
-
Oct. 2019 (P14) - Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)
-
April 2019 (P13): Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)
-
Oct. 2018 - RDA Plenary 12; IG meets, Task Forces report back
-
March 2018 — RDA Plenary 11; IG meets, Task Forces report back
Slides from earlier plenaries are available from the group page.
- 110 reads