Overview
The strollur package stores the data associated with your microbial DNA analysis. This includes nucleotide sequences, abundance, sample and treatment assignments, taxonomic classifications, sequence bin assignments, metadata, trees and various reports. It is designed to facilitate data analysis across multiple R packages with utility functions to import from mothur, qiime2, dada2 and phyloseq.
-
add()adds sequences, reports, metadata, and resource references -
assign()assigns abundances, classifications, bins, samples and treatments and more -
names()gets the names of sequences, bins, samples, treatments and reports -
count()gets the number of sequences, bins, samples and treatments -
abundance()gets the abundances for sequences, bins, samples, and treatments -
report()gets FASTA sequences, sequence and classification reports, bin assignments, sample assignments, metadata, sequence data reports, custom reports, resource references and scrapped data reports. -
summary()summarizes sequences, your custom reports, and scrapped data
Installation
You can install the CRAN version with:
install.packages("strollur")Development version
You can install the development version of strollur from GitHub with:
# install.packages("devtools")
devtools::install_github("mothur/strollur")Usage
The example below adds FASTA sequence data, assigns sequence abundance, samples and treatments, as well as assigning bins and taxonomic data to a strollur object.
fasta_data <- read_fasta(strollur_example("final.fasta.gz"))
abundance_table <- readRDS(strollur_example("miseq_abundance_by_sample.rds"))
bin_table <- readRDS(strollur_example("miseq_list_otu.rds"))
classification_data <- read_mothur_taxonomy(taxonomy = strollur_example("final.taxonomy.gz"))
data <- new_dataset(dataset_name = "microbial RNA example")
add(data, table = fasta_data, type = "sequences")
#> ℹ Added 2425 sequences.
#> [1] 2425
assign(data, table = abundance_table, type = "sequence_abundance")
#> ℹ Assigned 2425 sequence abundances.
#> [1] 2425
assign(data, table = bin_table, type = "bins", bin_type = "otu")
#> ℹ Assigned 531 otu bins.
#> [1] 531
assign(data, table = classification_data, type = "sequence_taxonomy")
#> ℹ Assigned 2425 sequence taxonomies.
#> [1] 2425
data
#> microbial RNA example:
#>
#> starts ends nbases ambigs polymers numns numseqs
#> Minimum: 1 375 249 0 3 0 1.00
#> 2.5%-tile: 1 375 252 0 3 0 2850.08
#> 25%-tile: 1 375 252 0 4 0 28491.75
#> Median: 1 375 252 0 4 0 56982.50
#> 75%-tile: 1 375 253 0 5 0 85473.25
#> 97.5%-tile: 1 375 253 0 6 0 111114.93
#> Maximum: 1 375 256 0 6 0 113963.00
#> Mean: 1 375 252 0 4 0 0.00
#>
#> Number of unique seqs: 2425
#> Total number of seqs: 113963
#>
#> Total number of samples: 19
#> Total number of treatments: 2
#> Total number of otus: 531
#> Total number of otu bin classifications: 531
#> Total number of sequence classifications: 2425Getting help
If you encounter an issue, please file an issue on GitHub. Please include a minimal reproducible example with your issue.
Contributing
Is there a feature you’d like to see included, please let us know! Pull requests are welcome on GitHub.
Code of Conduct
Please note that the strollur project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.
