Data warehouses
Opening Up Scientific Datasets: 6 Key Points
- What is a scientific dataset?
- What is open data?
- Opening up your datasets is a strategic decision
- What are the options for sharing research data?
- Preparing data for release
- Choose a distribution license for data reuse
- Practical information sheet
- CIRAD
- CC-BY-NC-SA
Choosing the best data warehouse
- What is a research data warehouse?
- Why store data in a data warehouse
- Different types of data warehouses
- Questions to ask yourself before loading data into a data warehouse
- How to Choose a Data Warehouse
- Useful links
- Practical information sheet
- CIRAD
- CC-BY-NC-SA
Finding datasets through multidisciplinary databases and search engines: 8 steps
- The Value of Datasets
- Databases and search engines for finding datasets
- DataCitation Index, Clarivate Analytics' subscription-based database
- Dataset Search, Google's free data search engine
- Dimensions, Digital Science's free academic search engine
- Explore OpenAIRE, the European platform for accessing publications and datasets
- BASE, the Bielefeld Academic Search Engine, from the University of Bielefeld
- Mendeley Data, Elsevier’s free database for research data
- Practical information sheet
- CIRAD
- CC-BY-NC-SA
Data warehouses: opening up your own data, exploring others' data
A practical guide to data warehouses, which are online databases designed to store and preserve standardized data described by rich metadata; to make this data accessible by ensuring traceability (persistent identifiers), access conditions (rights management), and reuse (distribution licenses); and to facilitate their discovery and utilization (online display, search tools, etc.).
- Practical information sheet
- AgroParisTech
- CC-BY
Opening up scientific results with open licenses
Fact sheet on open licenses, a legal tool that allows the rights holder of a work to publicly specify the permitted forms of distribution and reuse of that content.
- Practical information sheet
- AgroParisTech
- CC-BY
How can I ensure that my research is understandable and reusable? The use of metadata
A practical guide to metadata, which is “data about data”—that is, information that describes and contextualizes the data: context of acquisition (why, how?), unit of measurement, collection date, file format, etc.
It encompasses any element that helps us understand, make sense of, and reuse the data. In a way, metadata serves as the identity card for datasets.
- Practical information sheet
- AgroParisTech
- CC-BY
How to publish research data? The example of Recherche Data Gouv
Feedback on how to approach data publication, with a focus on the French repository Recherche Data Gouv and the JUICCE project.
- Practical information sheet
- AgroParisTech
- CC-BY-NC-ND
Publication of health data in the DataSuds warehouse: feedback
Depositing and publishing health data in the DataSuds research repository
- Practical information sheet
- AgroParisTech
- CC-BY-NC
The Inserm Data Warehouse (EDI)
The Inserm data warehouse on the Recherche Data Gouv platform enables the preservation, sharing, and open access to research data in accordance with the FAIR principles. This infrastructure, made available to scientists by the Ministry of Research, is part of the advancement of open science.
- Practical information sheet
- Inserm
The National Health Data System (SNDS) health database
The National Health Data System (SNDS) is one of the largest health databases in the world. The use of this confidential personal data is subject to very strict regulations. Inserm supports teams seeking access to the data for projects that do not involve human subjects.
- Practical information sheet
- Inserm