The rationale behind cited data:
There is increasing demand from the scientific community for a strong linkage between papers published in the scientific literature, and the data upon which they are based, and for a mechanism to reward data collection through citation.
A list of all the formally cited datasets held by the NGDC showing the title, author(s) and the digital object identifier (DOI), which links to the landing page with metadata links and direct access to the data where appropriate.
The NGDC now has the ability to issue a DOI to any datasets it holds that meet certain rigorous management criteria. This is a result of collaboration between the NERC data centres, the British Library and DataCite.
The DOI allows scientists to cite datasets in the same manner as a scientific journal article, enabling credit to be assigned to the dataset creators, and ensuring the discoverability, permanence and stability of the dataset. This recognises the value of the data and the effort that has gone into its creation, capture and effective management. DOIs allow formal publication of the dataset in data journals.
Datasets must be fully ingested into the data centre before a DOI can be minted. In exceptional cases, a DOI can be reserved for minting later (for example when a DOI is required for a dataset that forms the basis of a journal publication). Legacy datasets that have already been ingested into a data centre may also be assigned DOIs.
For a dataset to be assigned a DOI, it must be provided to the data centre in good condition, with appropriate metadata and of a suitable level of technical quality. The dataset submitter will be responsible for ensuring the data meets the required level of quality. Details of the minimum requirements for data are provided in the Guidelines for Scientists, with further information provided by the relevant sector-specific data centre.
When the NGDC assigns a DOI to a dataset, it is providing certain assurances to the subsequent data user. These assurances include that the dataset cited is:
Therefore when a dataset is assigned a DOI, the NGDC confirms that:
The NGDC will provide a full catalogue page (landing or splash page), which will appear when any user clicks on the DOI hyperlink.
Once a dataset has been deposited with the NGDC and a DOI issued the dataset cannot be modified. If there are updates or changes to the dataset a new version of the dataset will need to be deposited and the NGDC will:
The NGDC will accept data according to NERC data policy and the NERC data value checklist or NGDC data value checklist, depending on which is most appropriate. will also ensure that the data meets the NGDC collection policy. Guidance is available at ESAA guides and documentation.
One objective of data management within NGDC is to ensure that data can be reused with confidence decades after collection and without the need for any kind of communication with the scientists who collected that data.
The following good practice, adopted across all the NERC environmental data centres, must be met for a dataset to be accepted.
The format must be well documented and conform to widely accepted standards.
The format must be readable by tools that are freely available now and are likely to remain freely available in the future.
Data files should be named in a clear and consistent manner throughout the dataset, with filenames (rather than pathnames) that reflect the contents and uniquely identify the file. Filename extensions should conform to appropriate extensions for the file type. Filenames should be constructed from lower case letters, numbers, dashes and underscores and be no longer than 64 characters.
Parameters in data files should either be labelled using an internationally recognised standard, or by local labels that are accompanied by clear, unambiguous plain text descriptions.
Units of measure must be included for all parameters and clearly labelled.
Data must be accompanied by sufficient usage metadata to enable its reliable reuse. Some of this may be embedded within the data files. If not it should be included as additional documents.
The technical experts in the NGDC are responsible for ensuring that the dataset meets the required level of technical quality before a DOI can be issued to it.