Recommendation on PID Kernel Information

13
Dec
2018

Recommendation on PID Kernel Information

By Stefanie Kethers


 PID Kernel Information WG
Group co-chairs: Tobias Weigel, Beth Plale

Recommendation Title: RDA Recommendation on PID Kernel Information

Authors: Tobias Weigel, Beth Plale, Mark Parsons, Gabriel Zhou, Yu Luo, Ulrich Schwardmann, Robert Quick, Margareta Hellström, Kei Kurakawa

Impact: A set of guiding principles, architectural considerations, use cases and a fundamental metadata schema to manage information in Persistent Identifier records for scalable middleware infrastructure and automated processes.

Recommendation package DOI: 10.15497/rda00031

Citation: Weigel, T., Plale, B., Parsons, M., Zhou, G., Luo, Y., Schwardmann, U., Quick, R.,  Hellström, M., Kurakawa, K. (2018). RDA Recommendation on PID Kernel Information (Version 1). DOI: 10.15497/RDA00031.

 

Abstract

Global middleware infrastructure is insufficient for robust data identification, discovery, and use. While infrastructure is emerging within sub-ecosystems such as the DOI ecosystem of services purposed for data and literature objects (i.e., DataCite, CHORUS, CrossRef), in general the layers of abstraction that have made the Internet so easy to build on, is lacking for data especially for computer (machine) automated services. The goal of the PID Kernel Information recommendation is to advance a small change to middleware infrastructure by injecting a tiny amount of carefully selected metadata into a Persistent ID (PID) record. This carefully chosen and placed information has the potential to stimulate development of an entire ecosystem of third party services that can process the billions of expected PIDs and do so with more information at hand about an object (no need for costly link following) than just a unique ID.

The key challenge of the PID Kernel Information working group was to determine which from amongst thousands of relevant metadata elements are suitable to embed in the PID record. This recommendation lays out principles to guide in the identification of information suitable for inclusion in the PID record.

The information contained in a PID record is represented by a PID Kernel Information profile which must be publicly and globally available. For PID Kernel Information to be effective in stimulating an ecosystem of data services, the number of different profiles of PID Kernel Information must be small and their content stable. The recommendation includes a draft profile with illustrating examples and cases for adoption in practice.

 


Please note that the final version of the Recommendation was revised based on comments received during the review process. The previous version is still available.


 

Output Status: 
RDA Endorsed Recommendations
Review period start: 
Thursday, 13 December, 2018 to Sunday, 20 January, 2019
Group content visibility: 
Use group defaults
Primary WG Focus / Output focus: 
Domain Agnostic: 
Domain Agnostic
AttachmentSize
PDF icon Card RDA_PID_Kernel_single 30.pdf998.45 KB
  • Joakim Philipson's picture

    Author: Joakim Philipson

    Date: 11 Feb, 2020

    Hello,

    in the final Recommendation on PID Kernel Information, I do not find any attributes or other information about important properties for PID validation and identification, such as restrictions on string length, character set, patterns (e.g. recognizable through regexps). I believe this would also be valuable information pieces to get, along with resolvers or proxies resolving the PIDs (as you know there could be several possible resolvers for the same PID, e.g. for a DOI which can be resolved equally by hdl.handle.net , identifiers.org and doi.org ) and proper validation mechanisms (e.g. a regexp.). For a more elaborated argument, please see my paper: 10.3233/DS-190024 . Best regards,   
     

    Joakim Philipson

    Ph.D. , MLIS
    Research Data Analyst
    https://orcid.org/0000-0001-5699-994X

    Stockholm University Library. Stockholm University 
    [ https://ror.org/05f0yaq80 ]

submit a comment