Facilitating Semantic Interoperability Among Ocean Data Systems: ODIP-R2R Student Outcomes

Wednesday, 17 December 2014: 10:50 AM
Karen I Stocks1, Yu Chen2, Adam Shepherd3, Cynthia L Chandler3, Nkemdirim Dockery4, Jocelyn Lee Elya4, Shawn R Smith4, Renata Ferreira1, Linyun Fu2 and Robert A Arko5, (1)Scripps Institution of Oceanography, La Jolla, CA, United States, (2)Rensselaer Polytechnic Institute, Troy, NY, United States, (3)Woods Hole Oceangraphic Inst, Woods Hole, MA, United States, (4)Florida State University, Tallahassee, FL, United States, (5)Lamont-Doherty Earth Observatory, Palisades, NY, United States
With informatics providing an increasingly important set of tools for geoscientists, it is critical to train the next generation of scientists in information and data techniques. The NSF-supported Rolling Deck to Repository (R2R) Program works with the academic fleet community to routinely document, assess, and preserve the underway sensor data from U.S. research vessels. The Ocean Data Interoperability Platform (ODIP) is an EU-US-Australian collaboration fostering interoperability among regional e-infrastructures through workshops and joint prototype development. The need to align terminology between systems is a common challenge across all of the ODIP prototypes. Five R2R students were supported to address aspects of semantic interoperability within ODIP.
  • Developing a vocabulary matching service that links terms from different vocabularies with similar concept. The service implements Google Refine reconciliation service interface such that users can leverage Google Refine application as a friendly user interface while linking different vocabulary terms.
  • Developing Resource Description Framework (RDF) resources that map Shipboard Automated Meteorological Oceanographic System (SAMOS) vocabularies to internationally served vocabularies. Each SAMOS vocabulary term (data parameter and quality control flag) will be described as an RDF resource page. These RDF resources allow for enhanced discoverability and retrieval of SAMOS data by enabling data searches based on parameter.
  • Improving data retrieval and interoperability by exposing data and mapped vocabularies using Semantic Web technologies. We have collaborated with ODIP participating organizations in order to build a generalized data model that will be used to populate a SPARQL endpoint in order to provide expressive querying over our data files.
  • Mapping local and regional vocabularies used by R2R to those used by ODIP partners. This work is described more fully in a companion poster.
  • Making published Linked Data Web developer-friendly with a RESTful service. This goal was achieved by defining a proxy layer on top of the existing SPARQL endpoint that 1) translates HTTP requests into SPARQL queries, and 2) renders the returned results as required by the request sender using content negotiation, suffixes and parameters.