Data warehouses

Opening Up Scientific Datasets: 6 Key Points

  • What is a scientific dataset?
  • What is open data?
  • Opening up your datasets is a strategic decision
  • What are the options for sharing research data?
  • Preparing data for dissemination
  • Choose a distribution license for data reuse
  • Fact Sheet
  • CIRAD
  • CC-BY-NC-SA

Choosing the Best Data Warehouse

  • What is a research data warehouse?
  • Why store data in a data warehouse
  • Different types of data warehouses
  • Questions to ask yourself before loading data into a data warehouse
  • How to Choose a Data Warehouse
  • Useful links
  • Fact Sheet
  • CIRAD
  • CC-BY-NC-SA

Finding datasets through multidisciplinary databases and search engines: 8 steps

  • The Value of Datasets
  • Databases and search engines for finding datasets
  • DataCitation Index, Clarivate Analytics' subscription-based database
  • Dataset Search, Google's free data search engine
  • Dimensions, Digital Science's free academic search engine
  • Explore OpenAIRE, the European platform for accessing publications and datasets
  • BASE, the Bielefeld Academic Search Engine, from the German university
  • Mendeley Data, Elsevier’s free database for research data
  • Fact Sheet
  • CIRAD
  • CC-BY-NC-SA

Data warehouses: opening up your data, exploring others' data

A practical guide to data warehouses, which are online databases designed to store and preserve standardized data described by rich metadata; to make this data accessible while ensuring traceability (persistent identifiers), access conditions (rights management), and reuse (distribution licenses); and to facilitate their discovery and utilization (online display, search tools, etc.).

  • Fact Sheet
  • AgroParisTech
  • CC-BY

Making scientific results openly available through open licenses

Fact sheet on open licenses, a legal tool that allows the copyright holder of a work to publicly specify the forms of distribution and reuse permitted for that content.

  • Fact Sheet
  • AgroParisTech
  • CC-BY

How can I ensure that my research is understandable and reusable? The use of metadata

A practical guide to metadata, which is “data about data”—that is, information that describes and contextualizes the data: context of acquisition (why, how?), unit of measurement, collection date, file format, etc.
It therefore encompasses any element that helps us understand, make sense of, and reuse the data. In a way, metadata serves as the “identity card” for datasets.

  • Fact Sheet
  • AgroParisTech
  • CC-BY

How to publish research data? The example of Recherche Data Gouv

Feedback on how to approach data publication, with a focus on the French repository Recherche Data Gouv and the JUICCE project.

  • Fact Sheet
  • AgroParisTech
  • CC-BY-NC-ND

Publishing Health Data in the DataSuds Repository: Lessons Learned

Deposit and publication of health data in the DataSuds research repository

  • Fact Sheet
  • AgroParisTech
  • CC-BY-NC

The Inserm Data Warehouse (EDI)

The Inserm data warehouse on the Recherche Data Gouv platform enables the preservation, sharing, and open access to research data in accordance with the FAIR principles. This infrastructure, made available to scientists by the Ministry of Research, is part of the broader effort to advance open science.

  • Fact Sheet
  • Inserm

The National Health Data System (NHDS) health databases

The National Health Data System (SNDS) is one of the largest health databases in the world. The use of this confidential personal data is subject to very strict regulations. Inserm supports teams seeking access to the data for projects that do not involve human subjects.

  • Fact Sheet
  • Inserm