Towards a FAIR-compliant ocean and environmental genome database
Abstract:
KBase currently provides access to all genomes available through NCBI RefSeq. To augment these genome data, we are constructing Narratives to provide access to MAGs and SAGs data on a per publication basis, and encourage the community to start doing the same. The generated Narratives live in a public KBase organization to allow convenient access by users. In addition to providing access to recently published MAG and SAG data not currently in NCBI, the KBase Narratives allows users to explore and do analysis on system, while maintaining the provenance back to the original Narrative containing a list of authors and link to the original publication. We envision that a user-driven, centralized and publicly available MAG/SAG database within KBase, that is accessible to users and machines, will democratize access and help improve the infrastructure supporting the reuse of ocean- and environmental-derived genome sequence data.