status: Recognised & Endorsed
Chair (s): Jens Klump, Lesley Wyborn, Mingfang Wu, Kirsten Elger
Group Email: [group_email]
Secretariat Liaison: [field_secretariat_liaison]
The Data Versioning WG has transitioned to the Data Versioning IG as of July 2021. The email address and group space have remained the same.
The demand for reproducibility of research results and re-using data is growing, therefore it will become increasingly important for a researcher to be able to cite the exact version of the dataset that was used to underpin their research publication. The capacity of computational hardware infrastructures have grown and this has encouraged the development of concatenated seamless data sets where users can use web services to select subsets based on spatial and time queries, or other data attributes. Further, the growth in computer power has meant that higher-level data products can be generated in really short time frames. This means that we need a systematic way to refer to the exact version of a data set or data product that was used to underpin the research findings or was used to generate higher-level data products, including who developed and also funded it.
Versioning procedures and best practices are well established for scientific software and can be used to enable reproducibility of scientific results. The codebase of very large software projects does bear some semblance to large dynamic datasets. Are these practices suitable for datasets or do we need different practices for data versioning? The need for unambiguous references to specific datasets was recognised by the RDA Working Group on Data Citation, whose final report recognises the need for systematic data versioning practices.
This gap was discussed at a BoF meeting held at the RDA Plenary in September 2016 in Denver, resulting in the formation in 2017 of an RDA Interest Group on Data Versioning. A review of the recommendations by this RDA Data Versioning IG concluded that systematic data versioning practices were not available. In 2018 the Working Group was formed and first met at P12 in Gaborone. Its focus was on assessing current practices and compiled 39 use cases of data versioning across 33 organisations globally. In January 2020, the WG produced a white paper documenting these use cases and recommended practices (Klump, et al, 2020). The WG delineated 6 high-level principles, which provided a high-level framework for guiding the consistent practice of data versioning and can also serve as guidance for data centres or data providers when setting up their own data revision and version
protocols and procedures (Klump, et al, 2021). To further the adoption of the outcomes, the proposed new interest group plans to contribute the use cases and recommended data versioning practices to other groups in RDA, W3C, and other emerging activities in this field.
Please read the group's charter for more information.
The BoF initially emerged at Plenary 8 in Denver through the discussion available here: https://www.rd-alliance.org/data-versioning-rda-8th-plenary-bof-meeting.
Posts
RDA VP22 - Notification of Acceptance
Dear Chairs of the Data Versioning IG, Your RDA VP22 session application titled Translating the Data Versioning Principles into Machine Actionable Recommendations has been approved and accepted. Congratulations! Key dates to keep in mind: * Breakout sessions draft programme will be published here on Friday, 15 March * Requests for changes to the programme breakout times must be sent to ***@***.***-foundation.org by Friday, 29 March * VP22 programme will be deemed final by Monday, 1 April0 | Add new comment
Call for speakers at the data versioning IG plenary session RDA P21
by Mingfang Wu
Dear Group Members, We would like to bring your attention that the group is going to have a plenary session at the upcoming IDW. The session is titled: Revising the versioning principles: The road to actionable recommendations . We are inviting you to join the group session and present your related work. Here is a bit of context: The group has delivered this supporting output: *Versioning data is about more than revisions: A Conceptual framework and proposed principles*0 | Add new comment
Re-established Reproducibility Interest Group -- endorsed by RDA Council
by Limor Peer
***Apologies for cross-posting*** We are pleased to announce that the Reproducibility Interest Group was endorsed by the RDA Council in July 2023. The Reproducibility IG seeks to advance and enable reproducibility in research based on or producing datasets that require code. This IG follows the broad definition of reproducibility stated by Turing Way in order to provide an inclusive framework for discussions.0 | Add new comment
RDA P21 @ IDW 2023 - Notification of Conditional Acceptance
Dear Chairs of the Data Versioning IG, Your RDA P21 session application titled Revising the Versioning Principles: The Road to Actionable Recommendations has been conditionally approved, subject to you incorporating the following changes provided in the feedback below: The agenda should include more detail about the presenters and information on the new use cases. The Technical Advisory Board (TAB) recommends reaching out to RDA working groups and interest groups which were identified at the 20th Plenary to be of interest to the group.0 | Add new comment
Invitation to participate in 'A Decade of Data: 10 Years of the RDA' events and activities
by Connie Clare
Good day, The RDA Secretariat would like to invite the Data Versioning IG to participate in ‘A Decade of Data’: Celebrating 10 Years of the Research Data Alliance’. 10 months to celebrate 10 years of the RDA0 | Add new comment
Materials for RDA P19 Session Roadmap to develop actionable guidelines from the data versioning principles
by Jens Klump
Dear Members of the RDA Data Versioning IG Our session at IDW/RDA P19 is coming up soon (22 June 2022, 00.00 UTC). https://www.rd-alliance.org/plenaries/rda-19th-plenary-meeting-part-inte...0 | Add new comment
RDA Plenary 19 Draft Programme Now Available
Dear Group Chairs, The RDA 19th Plenary draft programme is now available: https://www.rd-alliance.org/rdas-19th-plenary-programme-0. Please note that the Plenary programme is a part of the International Data Week programme that can be accessed at https://www.rd-alliance.org/international-data-week-2022-programme.0 | Add new comment
Subject: RDA Plenary 19 as a part of IDW 2022 - Notification of Acceptance
Dear Chairs of the Data Versioning IG, The review of your session application for the RDA Plenary 19 (P19) is complete and your application titled Roadmap to develop actionable guidelines from the data versioning principles has been accepted. Please consider this your official notification of acceptance. Congratulations!0 | Add new comment
P19 session proposals submissions
by Irina Hope
Dear group members, Thank you for submitting your session proposal for the Plenary 19 titled ‘Roadmap to develop actionable guidelines from the data versioning principles’. A review of all submitted proposals is now underway. Notifications of sessions acceptance will be sent out by Thursday, 24th March. Please contact the RDA Secretariat at enquiries@rd-alliance.org with any questions or concerns you may have regarding your submission.0 | Add new comment
Webinar designed for RDA Group participants
Hello - My apologies for cross-posting. We wanted to be sure RDA group Chairs and group members received this information about an upcoming webinar specifically designed to provide tips and strategies for promoting project outputs. This webinar is designed for RDA Working and Interest Group Members, but anyone interested in this topic is also invited and welcome to attend. Please register and attend "How To Get Attention for Research Project Outputs " webinar on 21 Oct 2021 - 15:00 UTC presented by Jennifer Gibson. Register here.1 | Add new comment