Variant definitions of research data

18 Aug 2015

Hello RDA Colleagues,
There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
Does this look useful to you? Does it align with the RDA research data terminology work?
Data
Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
http://orcid.org/0000-0001-5494-4806 | library.caltech.edu

  • Andy Turner's picture

    Author: Andy Turner

    Date: 18 Aug, 2015

    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu

  • Jean Bernard Minster's picture

    Author: Jean Bernard Minster

    Date: 18 Aug, 2015

    Thanks, Gail.
    I tried to read it aloud, and had to stop halfway to take a breath.
    Some people in the RDA community would argue that physical samples, considered to be worthy of the same level of curation as data (e.g. critters from the deep sea, moon samples, paleolithic venuses, ….the list is long) would not be covered. I think we should make it clear that they indeed are not covered in our discussion, but that a laser scan thereof would be. For instance a laser scan of an outdoors statues in Firenze, which looses 1 micron off its surface every year due to pollution, must have immense value, and must be protected under IPR concepts, because it can never ever be re-generated ab initio.
    Same thing about the Giant Buddhas destroyed by the Taliban, or the Statues damaged by ISIS, same thing about one-of-a kind illuminated ancient manuscripts, same thing about the Dead Sea Scrolls….the list is interminable, and there is something of value to be protected by internationally agreed upon laws, and we should be clear about what our discussions cover.
    Sorry to be annoying on this, but we had better be clear now, or face interminable discussions later.
    B.

  • Jean Bernard Minster's picture

    Author: Jean Bernard Minster

    Date: 18 Aug, 2015

    Without much thought…. I like this better. Bernard

  • Lisa Neidert's picture

    Author: Lisa Neidert

    Date: 18 Aug, 2015

    That is a pretty good definition but it leaves out a form of data
    researchers are using more and more in the social sciences and that would
    be all the variants of big data:
    Tweets
    Facebook posts
    Credit card transactions
    metadata from cell phones
    box office receipts
    etc.
    None of these data are gathered by "scientists" but are often of interest
    to researchers because of the granularity in terms of both space and time.
    -Lisa
    On Tue, Aug 18, 2015 at 1:23 PM, Repositorian <***@***.***>
    wrote:

  • Jean Bernard Minster's picture

    Author: Jean Bernard Minster

    Date: 18 Aug, 2015

    Now this is getting into vey interesting but murky territory. Should we ask ATT, Verizon and the NSA to guarantee proper data curation of these? (Just kidding, of course).
    Bernard

  • Lisa Neidert's picture

    Author: Lisa Neidert

    Date: 18 Aug, 2015

    The research data definition is better from my standpoint and incorporates
    all the examples I included. And, it covers the other more intentional
    data collection efforts.
    -Lisa

  • Donat Agosti's picture

    Author: Donat Agosti

    Date: 18 Aug, 2015

    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** � �http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData

  • Gail Clement's picture

    Author: Gail Clement

    Date: 18 Aug, 2015

    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    · The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    · Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    · Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    § 2.2.1 Digital Object (DO)
    § A. Definition
    § A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData
    <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** � �http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData

  • Paul Uhlir's picture

    Author: Paul Uhlir

    Date: 18 Aug, 2015

    Sorry to come to this conversation late and thanks for getting the ball
    rolling (or the definition gelling). I agree that we should try to use any
    definition that the RDA DFT WG develops for "Research data" and I am
    surprised that this wasn't the first term they addressed. If they do not in
    the near term, we can suggest using the CASRAI formulation, although it is
    quite long, or some other long-term definition that is from a reputable
    source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***>
    wrote:

  • Herman Stehouwer's picture

    Author: Herman Stehouwer

    Date: 19 Aug, 2015

    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I
    have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered,
    digital, data. So it is a bit odd to use their definitions as an
    argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to
    researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman

  • Andy Turner's picture

    Author: Andy Turner

    Date: 19 Aug, 2015

    FWIW I like the CASRAI Research Data definition. "Digital" kind of meaning of the fingers a useful term, but is used out of context of computers while others regard it in the realm of codified bits/bytes. The CASRAI definition for "Digital data" is currently not very helpful in this context, but the definition of "Digital object" is more pertinent to this discussion as it has been mentioned and does refer to bits/bytes:
    http://dictionary.casrai.org/Digital_object
    A digital object is editable, interactive, accessible and modifiable by means of digital objects other than the one governing its behaviour, and is distributed over information infrastructures. It is a machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems in the Internet." A digital object is composed of structured sequence of bits/bytes. As an object it is named. The bit sequence realizing the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties. SYNONYM. Digital entity
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    ----- Reply message -----
    From: "Herman Stehouwer" <***@***.***>
    To: "puhlir"
    <***@***.***>, "Repositorian" <***@***.***>, "RDA/CODATA Legal Interoperability IG" <***@***.***-groups.org>
    Cc: "Donat Agosti" <***@***.***>, "MsDrData"
    <***@***.***>, "Andy Turner" <***@***.***>
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Date: Wed, Aug 19, 2015 07:44
    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered, digital, data. So it is a bit odd to use their definitions as an argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman
    On 18/08/15 23:54, puhlir wrote:
    Sorry to come to this conversation late and thanks for getting the ball rolling (or the definition gelling). I agree that we should try to use any definition that the RDA DFT WG develops for "Research data" and I am surprised that this wasn't the first term they addressed. If they do not in the near term, we can suggest using the CASRAI formulation, although it is quite long, or some other long-term definition that is from a reputable source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***> wrote:
    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    • The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    • Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    • Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    • 2.2.1 Digital Object (DO)
    • A. Definition
    • A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** ? ?http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Dr. ir. Herman Stehouwer
    Max Planck Computing and Data Facility (MPCDF)
    RDA Secretariat
    ***@***.*** 0031-619258815
    Skype: herman.stehouwer.mpi

  • Andy Turner's picture

    Author: Andy Turner

    Date: 19 Aug, 2015

    I have just noticed that the source for the CASRAI Research object definition is RDA: http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page
    I appreciate that in many cases the RDA will concern itself with such digital objects, but the importance of physical data in some academic studies should not be underestimated. I think the RDA should explicitly concern itself with samples (and indeed entire artefacts and all the dirt they have accumulated) and specimens used in research. BTW - a practical way of categorising such physical research data is by the conditions in which they are stored (sometimes they are not, they are just located in situ). Additionally though, some research data objects may be physical bit/byte storage entities - and that edge case is also important, and may complicate definitions. Notwithstanding that most physical research data will have digital object type profiles and that over time, the digital object profiles may persist for longer.
    HTH
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    ----- Reply message -----
    From: "agdturner" <***@***.***>
    To: "Herman Stehouwer" <***@***.***>, "puhlir"
    <***@***.***>, "Repositorian" <***@***.***>, "***@***.***-groups.org" <***@***.***-groups.org>
    Cc: "Donat Agosti" <***@***.***>, "MsDrData"
    <***@***.***>
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Date: Wed, Aug 19, 2015 08:08
    FWIW I like the CASRAI Research Data definition. "Digital" kind of meaning of the fingers a useful term, but is used out of context of computers while others regard it in the realm of codified bits/bytes. The CASRAI definition for "Digital data" is currently not very helpful in this context, but the definition of "Digital object" is more pertinent to this discussion as it has been mentioned and does refer to bits/bytes:
    http://dictionary.casrai.org/Digital_object
    A digital object is editable, interactive, accessible and modifiable by means of digital objects other than the one governing its behaviour, and is distributed over information infrastructures. It is a machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems in the Internet." A digital object is composed of structured sequence of bits/bytes. As an object it is named. The bit sequence realizing the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties. SYNONYM. Digital entity
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    ----- Reply message -----
    From: "Herman Stehouwer" <***@***.***>
    To: "puhlir"
    <***@***.***>, "Repositorian" <***@***.***>, "RDA/CODATA Legal Interoperability IG" <***@***.***-groups.org>
    Cc: "Donat Agosti" <***@***.***>, "MsDrData"
    <***@***.***>, "Andy Turner" <***@***.***>
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Date: Wed, Aug 19, 2015 07:44
    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered, digital, data. So it is a bit odd to use their definitions as an argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman
    On 18/08/15 23:54, puhlir wrote:
    Sorry to come to this conversation late and thanks for getting the ball rolling (or the definition gelling). I agree that we should try to use any definition that the RDA DFT WG develops for "Research data" and I am surprised that this wasn't the first term they addressed. If they do not in the near term, we can suggest using the CASRAI formulation, although it is quite long, or some other long-term definition that is from a reputable source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***> wrote:
    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    • The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    • Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    • Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    • 2.2.1 Digital Object (DO)
    • A. Definition
    • A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** ? ?http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Dr. ir. Herman Stehouwer
    Max Planck Computing and Data Facility (MPCDF)
    RDA Secretariat
    ***@***.*** 0031-619258815
    Skype: herman.stehouwer.mpi

  • Chris Morris's picture

    Author: Chris Morris

    Date: 19 Aug, 2015

    HI,
    It is useful to discuss policies for preserving and sharing physical object of research importance. In the life sciences we usually call them samples.
    But this isn’t the same discussion as about data. Data can be copied without loss, and copying data is usually cheap. This is a key argument for open data.
    For samples, there are many other considerations. Some study techniques are destructive, and even techniques that are planned to be non-destructive involve risk. For this reason alone some access restrictions are appropriate, e.g. to precious hominid fossils. Some samples are hazardous, e.g. blood samples from Ebola patients. Some samples have unknown and changing privacy implications – how much can you find out about me from 0.5ml3 of my cerebrospinal fluid? So the range of law involved is much more than IP law – even within Europe the legal definition of biological hazard is far from uniform.
    Finishing a task involves bounding it. Surely it is useful to produce legal interoperability guidelines about digital data. Once that is done, if anyone wants to convene a workgroup about physical samples of research importance, then good luck to you.
    Regards,
    Chris
    From: A.G.D.Turner=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of agdturner
    Sent: 19 August 2015 08:46
    To: Herman Stehouwer; puhlir; Repositorian; ***@***.***-groups.org
    Cc: Donat Agosti; MsDrData
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    I have just noticed that the source for the CASRAI Research object definition is RDA: http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page
    I appreciate that in many cases the RDA will concern itself with such digital objects, but the importance of physical data in some academic studies should not be underestimated. I think the RDA should explicitly concern itself with samples (and indeed entire artefacts and all the dirt they have accumulated) and specimens used in research. BTW - a practical way of categorising such physical research data is by the conditions in which they are stored (sometimes they are not, they are just located in situ). Additionally though, some research data objects may be physical bit/byte storage entities - and that edge case is also important, and may complicate definitions. Notwithstanding that most physical research data will have digital object type profiles and that over time, the digital object profiles may persist for longer.
    HTH
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    ----- Reply message -----
    From: "agdturner" <***@***.***>
    To: "Herman Stehouwer" <***@***.***>, "puhlir"
    <***@***.***>, "Repositorian" <***@***.***>, "***@***.***-groups.org" <***@***.***-groups.org>
    Cc: "Donat Agosti" <***@***.***>, "MsDrData"
    <***@***.***>
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Date: Wed, Aug 19, 2015 08:08
    FWIW I like the CASRAI Research Data definition. "Digital" kind of meaning of the fingers a useful term, but is used out of context of computers while others regard it in the realm of codified bits/bytes. The CASRAI definition for "Digital data" is currently not very helpful in this context, but the definition of "Digital object" is more pertinent to this discussion as it has been mentioned and does refer to bits/bytes:
    http://dictionary.casrai.org/Digital_object
    A digital object is editable, interactive, accessible and modifiable by means of digital objects other than the one governing its behaviour, and is distributed over information infrastructures. It is a machine-independent data structure consisting of one or more elements in digital form that can be parsed by different information systems; the structure helps to enable interoperability among diverse information systems in the Internet." A digital object is composed of structured sequence of bits/bytes. As an object it is named. The bit sequence realizing the object can be identified and accessed by a unique and persistent identifier or by use of referencing attributes describing its properties. SYNONYM. Digital entity
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    ----- Reply message -----
    From: "Herman Stehouwer" <***@***.***>
    To: "puhlir"
    <***@***.***>, "Repositorian" <***@***.***>, "RDA/CODATA Legal Interoperability IG" <***@***.***-groups.org>
    Cc: "Donat Agosti" <***@***.***>, "MsDrData"
    <***@***.***>, "Andy Turner" <***@***.***>
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Date: Wed, Aug 19, 2015 07:44
    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered, digital, data. So it is a bit odd to use their definitions as an argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman
    On 18/08/15 23:54, puhlir wrote:
    Sorry to come to this conversation late and thanks for getting the ball rolling (or the definition gelling). I agree that we should try to use any definition that the RDA DFT WG develops for "Research data" and I am surprised that this wasn't the first term they addressed. If they do not in the near term, we can suggest using the CASRAI formulation, although it is quite long, or some other long-term definition that is from a reputable source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***> wrote:
    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    • The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    • Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    • Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    • 2.2.1 Digital Object (DO)
    • A. Definition
    • A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData
    <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** � �http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Dr. ir. Herman Stehouwer
    Max Planck Computing and Data Facility (MPCDF)
    RDA Secretariat
    ***@***.*** 0031-619258815
    Skype: herman.stehouwer.mpi

  • Paul Uhlir's picture

    Author: Paul Uhlir

    Date: 19 Aug, 2015

    Thanks, Herman. This is good to know. Assuming we will use another existing
    definition, we will review it with the DFT WG anyway.
    Cheers,
    Paul
    On Wed, Aug 19, 2015 at 2:43 AM, Herman Stehouwer <
    ***@***.***> wrote:

  • Jean Bernard Minster's picture

    Author: Jean Bernard Minster

    Date: 19 Aug, 2015

    It seems like we need a research project to collect research data on the research problem of what research data might possibly be of interest to researchers.=

  • Andy Turner's picture

    Author: Andy Turner

    Date: 19 Aug, 2015

    In one view, anything can be of research interest and so anything can become research data. After what threshold/point/stage it becomes research data is interesting. I would argue that it is usually prior to any publication being made about it and that data becomes research data in an earlier stage in the research lifecycle.
    In more restricted views we can consider only: data that have been or are currently used in research; digital objects as per the "2.2.1 Digital Object (DO) Definition" (A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata).
    For some research, other research outputs are data. These outputs may be structured sets of numbers with metadata available electonically in a standard file format. They may also be paper based conference proceedings, monographs or journal articles and parts thereof.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
     
    -----Original Message-----
    From: jbminster=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of jbminster
    Sent: 19 August 2015 13:51
    To: puhlir; RDA/CODATA Legal Interoperability IG
    Cc: Jean-Bernard Minster; Herman Stehouwer; Gail Clement; Donat Agosti; MsDrData; Andy Turner
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    It seems like we need a research project to collect research data on the research problem of what research data might possibly be of interest to researchers.=

  • Jean Bernard Minster's picture

    Author: Jean Bernard Minster

    Date: 19 Aug, 2015

    Thanks, Andy. I was not being serious! (I tend to drift off at times into not-always-successful sarcasm!) :-)
    Your categories are useful, we might be able to convey all this in the introduction.
    Bernard

  • ENRIQUE  ALONSO GARCIA's picture

    Author: ENRIQUE ALONSO...

    Date: 20 Aug, 2015

    Paul and/or Simon,
    As promised, I include the clean text of principle 5 for discussion. Can you please place it in the wiki?
    Take care
    Enrique
    ________________________________
    De: pfuhlir=***@***.***-groups.org [pfuhlir=***@***.***-groups.org] En nombre de puhlir [***@***.***]
    Enviado el: miércoles, 19 de agosto de 2015 14:32
    Para: Herman Stehouwer; RDA/CODATA Legal Interoperability IG
    CC: Repositorian; Donat Agosti; MsDrData; agdturner
    Asunto: Re: [rda-legalinterop-ig] Variant definitions of research data
    Thanks, Herman. This is good to know. Assuming we will use another existing definition, we will review it with the DFT WG anyway.
    Cheers,
    Paul
    On Wed, Aug 19, 2015 at 2:43 AM, Herman Stehouwer <***@***.***> wrote:
    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered, digital, data. So it is a bit odd to use their definitions as an argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman
    On 18/08/15 23:54, puhlir wrote:
    Sorry to come to this conversation late and thanks for getting the ball rolling (or the definition gelling). I agree that we should try to use any definition that the RDA DFT WG develops for "Research data" and I am surprised that this wasn't the first term they addressed. If they do not in the near term, we can suggest using the CASRAI formulation, although it is quite long, or some other long-term definition that is from a reputable source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***> wrote:
    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    • The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    • Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    • Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    • 2.2.1 Digital Object (DO)
    • A. Definition
    • A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** � �http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Dr. ir. Herman Stehouwer
    Max Planck Computing and Data Facility (MPCDF)
    RDA Secretariat
    ***@***.*** 0031-619258815
    Skype: herman.stehouwer.mpi
    ________________________________
    El texto de este correo es confidencial y exclusivamente está dirigido a su destinatario. Si se ha enviado a una dirección errónea rogamos elimine el mismo y, en su caso, los documentos adjuntos, y nos lo comunique urgentemente. This message is intented only for the use of the addresse and contain confidential information. If you are not the intented recipient, dissemination of this documentation is prohibited. If you have received this communication in error, please, erase all copies of the message and its attachments and notify us immediately.
    Antes de imprimir este correo electrónico, piense bien si es necesario hacerlo: El medioambiente es cosa de todos.

  • ENRIQUE  ALONSO GARCIA's picture

    Author: ENRIQUE ALONSO...

    Date: 20 Aug, 2015

    Sorry I sent this message to a wrong group address. Nevertheless I also forgot to attach the doc. It is in the next email
    ________________________________
    De: Enrique Alonso García
    Enviado el: jueves, 20 de agosto de 2015 4:33
    Para: puhlir; Herman Stehouwer; RDA/CODATA Legal Interoperability IG
    CC: Repositorian; Donat Agosti; MsDrData; agdturner
    Asunto: RE: [rda-legalinterop-ig] Variant definitions of research data
    Paul and/or Simon,
    As promised, I include the clean text of principle 5 for discussion. Can you please place it in the wiki?
    Take care
    Enrique
    ________________________________
    De: pfuhlir=***@***.***-groups.org [pfuhlir=***@***.***-groups.org] En nombre de puhlir [***@***.***]
    Enviado el: miércoles, 19 de agosto de 2015 14:32
    Para: Herman Stehouwer; RDA/CODATA Legal Interoperability IG
    CC: Repositorian; Donat Agosti; MsDrData; agdturner
    Asunto: Re: [rda-legalinterop-ig] Variant definitions of research data
    Thanks, Herman. This is good to know. Assuming we will use another existing definition, we will review it with the DFT WG anyway.
    Cheers,
    Paul
    On Wed, Aug 19, 2015 at 2:43 AM, Herman Stehouwer <***@***.***> wrote:
    Dear all,
    two quick remarks (I do try to follow your discussions, but usually I have nothing to add!).
    1) The DFT group explicitly limits itself to the domain of registered, digital, data. So it is a bit odd to use their definitions as an argument to limit yourself, as I thought the discussion here was broader.
    2) Informally RDA defines research data as "data of interest to researchers". Which is a bit of a cop-out, but there you go.
    Cheers,
    Herman
    On 18/08/15 23:54, puhlir wrote:
    Sorry to come to this conversation late and thanks for getting the ball rolling (or the definition gelling). I agree that we should try to use any definition that the RDA DFT WG develops for "Research data" and I am surprised that this wasn't the first term they addressed. If they do not in the near term, we can suggest using the CASRAI formulation, although it is quite long, or some other long-term definition that is from a reputable source. I think we should resist making one up, however.
    Cheers,
    Paul
    On Tue, Aug 18, 2015 at 5:25 PM, Repositorian <***@***.***> wrote:
    On the question of whether RDA has its own RDM glossary containing a definition of research data, the answer is “Sort of yes”. Here is what I’ve surfaced so far:
    • The Data Foundations and Terminology (DFT) Working Group of RDA has in their remit the job of devising definitions for use across RDA
    • Their released deliverables to date comprise a set of documents available online at https://rd-alliance.org/group/data-foundation-and-terminology-wg/outcome...
    • Of particular relevance to our discussion is RDA Data Foundation and Terminology DFT 3:Snapshot of DFT Core Terms, online at https://rd-alliance.org/system/files/DFT3%20-%20Snapshot%20of%20core%20t...
    o They identify core terms and core concepts, using ‘snapshots’ to fix and represent a term/concept that may be differently understood across RDA or may be evolving and still fluid
    o They have defined only those 10 core terms which have shown to find rough consensus: neither “data” nor “research data” are not among those 10 core terms
    o “Data object” is not a core term in the eyes of this WG but it does have a ‘placeholder’ in the Appendix of Additional Terms that have been discussed. The term “Data object” is associated with the following “indication of meaning”: a type of Digital Object containing processible data/information/knowledge. “Digital Object” is a core term and is defined as:
    • 2.2.1 Digital Object (DO)
    • A. Definition
    • A digital object (DO) is represented by a bitstream, is referenced and identified by a persistent identifier and has properties being characterized by metadata.
    Thus it appears for our purposes that analog (non-digital) objects fall outside the scope of our Principles and Guidelines.
    It also appears from Data Foundation and Terminology (DFT) WG webpage that they are aware of the newly introduced CASRAI glossary. A posting to that page on 8-17-2015 reports:
    Announcing a new transdisciplinary Glossary for research data management
    Research Data Canada (RDC) in partnership with the international Consortia Advancing Standards in Research Administration Information (CASRAI)
    is pleased to announce the launch of a PILOT for a new interactive Glossary containing 500+ draft terms and definitions to support work in the field of research data management.
    The glossary is publicly available under a Creative Commons Attribution Only license (CC-BY) at
    http://dictionary.casrai.org/Category:Research_Data_Domain
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    From: Donat Agosti [mailto:***@***.***]
    Sent: Tuesday, August 18, 2015 1:46 PM
    To: MsDrData <***@***.***>; agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Gail Clement <***@***.***>
    Subject: RE: [rda-legalinterop-ig] Variant definitions of research data
    Does RDA have a definition of research data? What about US NSF or Horizon 2020/EU research? I am sure, Paul through his work at the National Academy has a source(s)? How does research data relate to research results that ought to be open in the US? Though Research data is not mentioned in the memo data is included in research results https://www.whitehouse.gov/sites/default/files/microsites/ostp/ostp_publ...
    cheers
    donat
    From: lisan=***@***.***-groups.org [mailto:***@***.***-groups.org] On Behalf Of MsDrData
    Sent: Tuesday, August 18, 2015 10:32 PM
    To: agdturner <***@***.***>; RDA/CODATA Legal Interoperability IG <***@***.***-groups.org>
    Cc: Repositorian <***@***.***>
    Subject: Re: [rda-legalinterop-ig] Variant definitions of research data
    The research data definition is better from my standpoint and incorporates all the examples I included. And, it covers the other more intentional data collection efforts.
    -Lisa
    On Tue, Aug 18, 2015 at 1:56 PM, agdturner <***@***.***> wrote:
    Just to point out that the CASRAI glossary also has a definition for research data that might be useful:
    http://dictionary.casrai.org/Research_data
    Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
    Andy
    http://www.geog.leeds.ac.uk/people/a.turner/index.html
    - Show quoted text -From: Gclement=***@***.***-groups.org [mailto:Gclement=***@***.***-groups.org] On Behalf Of Repositorian
    Sent: 18 August 2015 18:23
    To: 'RDA/CODATA Legal Interoperability IG'
    Subject: [rda-legalinterop-ig] Variant definitions of research data
    Hello RDA Colleagues,
    There seems to be a proliferation of definitions across our domain, including this one from CASRAI as part of their research data glossary initiative. http://dictionary.casrai.org/Data
    CASRAI just put out a call for reviews of their glossary and the one for data looks pretty good.
    Does this look useful to you? Does it align with the RDA research data terminology work?
    Data
    Facts, measurements, recordings, records, or observations about the world collected by scientists and others, with a minimum of contextual interpretation. Data may be in any format or medium taking the form of writings, notes, numbers, symbols, text, images, films, video, sound recordings, pictorial reproductions, drawings, designs or other graphical representations, procedural manuals, forms, diagrams, work flow charts, equipment descriptions, data files, data processing algorithms, or statistical records
    Gail P. Clement | Head of Research Services | Caltech Library | Mail Code 1-43 | Pasadena CA 91125-4300 | 626-395-1203
    http://orcid.org/0000-0001-5494-4806 | library.caltech.edu
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    _______________________________________________
    Lisa Neidert Population Studies Center
    Data Scientist Institute for Social Research
    734-763-2203(P) 426 Thompson, P.O. Box 1248
    734-763-1428(F) Ann Arbor, MI 48106-1248
    ***@***.*** � �http://www.psc.isr.umich.edu
    Twitter: @msdrdata Skype: MsDrData
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Full post: https://www.rd-alliance.org/group/rdacodata-legal-interoperability-ig/po...
    Manage my subscriptions: https://www.rd-alliance.org/mailinglist
    Stop emails for this post: https://www.rd-alliance.org/mailinglist/unsubscribe/49569
    --
    Dr. ir. Herman Stehouwer
    Max Planck Computing and Data Facility (MPCDF)
    RDA Secretariat
    ***@***.*** 0031-619258815
    Skype: herman.stehouwer.mpi
    ________________________________
    El texto de este correo es confidencial y exclusivamente está dirigido a su destinatario. Si se ha enviado a una dirección errónea rogamos elimine el mismo y, en su caso, los documentos adjuntos, y nos lo comunique urgentemente. This message is intented only for the use of the addresse and contain confidential information. If you are not the intented recipient, dissemination of this documentation is prohibited. If you have received this communication in error, please, erase all copies of the message and its attachments and notify us immediately.
    Antes de imprimir este correo electrónico, piense bien si es necesario hacerlo: El medioambiente es cosa de todos.

submit a comment