Improving the Publication and Re-Use of Data via PresQT Tools and Services

Wednesday, 9 December 2020: 04:08
Sandra Gesing, University of Notre Dame, Center for Research Computing, Notre Dame, IN, United States, Natalie K Meyers, University of Notre Dame, Notre Dame, IN, United States, Richard Johnson, University of Notre Dame, Hesburgh Libraries, Notre Dame, IN, United States and Zheng Wang, University of Notre Dame, Hesburgh LIbraries, Notre Dame, IN, United States
Researchers face the challenge to choose from a diverse set of repositories to publish, preserve and share their data. Ideally, they have features available to seamlessly integrate the preservation step in their daily research routine. The project PresQT (Preservation Quality Tool) eases the use of repositories and serves as boilerplate between existing solutions while adding beneficial metadata and FAIR tests (Findability, Accessibility, Interoperability, and Reuse). PresQT and its standards-based design with RESTful web services have been informed via user-centered design and is a collaborative open-source implementation effort. The PresQT services extend the preservation tool landscape in a way that stakeholders can keep working in their chosen computational environment and receive additional features instead of having to switch to a different software. PresQT services form the connection between tools, workflows and databases to existing repositories. Current partners or implementations for open APIs include OSF, CurateND, EaaSI, GitHub, GitLab, Zenodo, FigShare, WholeTale, Jupyter and HUBzero. The diversity of partners contributes to understanding the needs of the stakeholders of PresQT services.

PresQT services are easily integratable and target systems can be added via extending JSON files and Python functions. Data is packaged as BagITs for uploads, downloads and transfers. The current services include transfers with fixity checks supporting diverse hash algorithms, keyword enhancement via SciGraph, upload, download and connection to EaaSI services. The FAIR tests are under development building on FairShake for initial tests. The next planned steps include to create indicators how FAIR the data in the target repository is stored with additional hints for improvement. To present the capabilities to interested developers of computational solutions, users of PresQT services and funding bodies, we have developed a demo user interface that allows for demoing and testing the different features of PresQT services.