Data Citation Principles
In 2013, the Amsterdam Manifesto on Data Citation Principles was published, detailing the concept of data as a citable product of research in 8 short statements.
In short,
- Data should be considered legitimate, citable products of research. Data citations should be accorded the same importance in the scholarly record as citations of other research objects, such as publications.
- Data citations should facilitate giving scholarly credit and normative and legal attribution to all contributors to the data, recognizing that a single style or mechanism of attribution may not be applicable to all data.
- In scholarly literature, whenever and wherever a claim relies upon data, the corresponding data should be cited.
- A data citation should include a persistent method for identification that is machine actionable, globally unique, and widely used by a community. [An example of a persistent identifier might be a DOI or digital object identifier]
- Data citations should facilitate access to the data themselves and to such associated metadata, documentation, code, and other materials, as are necessary for both humans and machines to make informed use of the referenced data.
- Unique identifiers, and metadata describing the data, and its disposition, should persist — even beyond the lifespan of the data they describe. [See #4]
- Data citations should facilitate identification of, access to, and verification of the specific data that support a claim. Citations or citation metadata should include information about provenance and fixity sufficient to facilitate verifying that the specific timeslice, version and/or granular portion of data retrieved subsequently is the same as was originally cited.
- Data citation methods should be sufficiently flexible to accommodate the variant practices among communities, but should not differ so much that they compromise interoperability of data citation practices across communities
Source: force11.org/datacitation
Data Set Citing Basics
Basic Items to Include:
- Author: Name of the individual, group, or organization responsible for the creation of the data set.
- Title: Name of the data set.
- Format: Notation that this is a data set as opposed to a journal article, book, or website.
- Location: City/State of the organization/institution that produced the data set.
- Date: Year the data set was released/published. Also, in some cases, the date you accessed the data set.
- Version: If multiple versions of the data set are available, include the version number of the data set you used.
- Unique Identifier: AÂ unique identifier to link back to the specific data set (examples include: DOI, PURL, repository ID number, etc.).
- Distributor: Name of the organization/site providing access to the data set.
The order and formatting of these pieces of information will vary according to different citation styles, journal publishers, and data repositories.
DATA SET CITATION EXAMPLES
APA
Format:
Author/Rightsholder. (Year). Title of data set (Version number) [Description of form]. Location: Name of producer.
or
Author/Rightsholder. (Year). Title of data set (Version number) [Description of form]. Retrieved from http://
Example:
Advanced Cooperative Arctic Data and Information Service (ACADIS). (2010). LiDAR (DEM) NIMS grid Barrow, Alaska 2010. [Data set]. Retrieved from:https://www.aoncadis.org/dataset/lidar_dem_nims_grid_barrow_alaska_2010.html
Example:
Zhang, G., Parker, P., Li, B., Li, H., & Wang, J. (2012). The genome of Darwin’s Finch (Geospiza fortis). GigaScience. [Data set]. doi: 10.5524/100040
See more information from the APA Style Blog here:Â http://blog.apastyle.org/apastyle/2013/12/how-to-cite-a-data-set-in-apa-style.html
Note: Some data sources will provide additional citation information and help.
MLA
Format:
MLA has not yet developed specific rules for dataset citations, so follow the rules for a general website.
Example:
Tweedie, Craig. E., and Steven Oberbauer. Kite Aerial Photography NIMS Grid Barrow, Alaska 2013. (Data set). Barrow, AK: Advanced Cooperative Arctic Data and Information Service, 2010. Web. 7 Apr. 2014. <https://www.aoncadis.org/dataset/kite_aerial_photography_nims_grid_barrow_alaska_2013.html>
Example:
Zhang, G., D. Lambert, and J. Wang. Genomic Data from Adelie Penguin (Pygoscelis adeliae). (Data set). Gigascience, 2011. Web. 17 Apr. 2014. <http://dx.doi.org/10.5524/100006>
Note: Some data sources will provide additional citation information and help.