MGnifyR: An R package for accessing MGnify microbiome data
Contents
- Introduction to MGnify
- Overview of
MGnifyR
- Bioconductor and
MGnifyR
MGnify
Data availability
- Open access
- Taxonomy and functional mappings along with genome catalogues
- Application Programming Interface (API)
Challenges in API
- Requires familiarity with API workflows, database structures, and linkages.
- Leads to challenges in reproducibility and time-consuming data wrangling.
MGnifyR
- R/Bioconductor package
- Streamlines access to MGnify database
- Bridges the gap between data resources and tools
Functions
MgnifyClient()
: Constructor for MgnifyClient
object.
doQuery()
: Search MGnify database for studies, samples, runs, analyses, biomes, assemblies, and genomes.
searchAnalysis()
: Look up analysis accession IDs for one or more study or sample accessions.
getMetadata()
: Get all study, sample and analysis metadata for the supplied analysis accessions.
getResult()
: Get microbial and/or functional profiling data for a list of accessions.
getData()
: Versatile function to retrieve raw results.
getFile()
& searchFile()
: Download any MGnify files, also including processed reads and identified protein sequences.
Bioconductor and MGnifyR
- Microbial and functional profiling data is retrieved as
TreeSE
/MAE
format.
- Common data format supported broadly by Bioconductor.
- Enables direct access to cutting-edge tools (e.g. miaverse).