Section: New Software and Platforms
Input Output Data Science
Keywords: Open data - Semantic Web - FAIR (Findable, Accessible, Interoperable, and Reusable)
Functional Description: io.datascience (Input Output Data Science) is the instance of the Linked Wiki platform developed specifically in Paris-Saclay University as part of its Center for Data Science.
The goal of io.datascience: to facilitate the sharing and use of scientific data. The technological concept of io.datascience: the exploitation of semantic web advances, and in particular wiki technologies.
One of the grand challenges of data-intensive science is to facilitate knowledge discovery by assisting humans and machines in their discovery of, access to, integration and analysis of, task-appropriate scientific data and their associated algorithms and workflows. The guiding principles for this challenge have been defined: Data should become FAIR (Findable, Accessible, Interoperable, and Reusable) (Wilkinson, M., and The FAIR Guiding Principles for Scientific Data Management and Stewardship, Nature Scientific Data 2016)
io.datascience is both a data sharing platform and a framework for further development. It realizes a practical implementation of FAIR principles through a user-centric approach. • Share: Software users can declare the sources of the data they use as well as their query requests. • Discover: Using a form, users can link their data sources to each other. The repository used is that of Wikidata. The user can then retrieve his data sources and example queries through a search interface or directly through Google and Wikipedia. • Reuse: data is identified and qualified, a simple interface allows the user to provide the desired level of description for the data they refer to, as well as examples of use. • Analyze: io.datascience will soon be proposing the creation of RDF databases on the cloud on the cloud of Paris Sud University.
Publications: Data acquisition for analytical platforms: Automating scientific workflows and building an open database platform for chemical anlysis metadata - A platform for scientific data sharing - TFT, Tests For Triplestores - Une autocomplétion générique de SPARQL dans un contexte multi-services - Certifying the interoperability of RDF database systems - Transforming Wikipedia into an Ontology-based Information Retrieval Search Engine for Local Experts using a Third-Party Taxonomy - The Grid Observatory 3.0 - Towards reproducible research and open collaborations using semantic technologies