We will be offering an R workshop December 18-20, 2019. Learn more.


From mothur
Revision as of 14:44, 22 May 2019 by Westcott (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

The make.biom command is design to allow you to convert your shared file to a biom file. http://biom-format.org/documentation/biom_format.html (Note: mothur currently supports format version 1.0) To run this tutorial please download, Example Files

Default Settings

The shared parameter is required to run the make.biom command.

mothur > make.biom(shared=final.tx.1.subsample.1.pick.shared)



The constaxonomy file is the taxonomy file outputted by classify.otu(list=yourListfile, taxonomy=yourTaxonomyFile). Be SURE that the you are the constaxonomy file distance matches the shared file distance. ie, for *.0.03.cons.taxonomy set label=0.03.

mothur > make.biom(shared=final.tx.1.subsample.1.pick.shared, constaxonomy=final.tx.1.cons.taxonomy)


You can add sample data support (e.g., experimental factor, sample collection date, etc) using the metadata parameter.

For example if your metadata file looked like:

group	dpw	description
F003D000	0	"F003D000 description"
F003D002	2	"F003D002 description"
F003D004	4 	"F003D004 description"
F003D006	6       "F003D006 description"
F003D008	8       "F003D008 description"
F003D142	142     "F003D142 description"
F003D144	144     "F003D144 description"
F003D146	146     "F003D146 description"
F003D148	148     "F003D148 description"
F003D150	150     "F003D150 description"
MOCK.GQY1XT001	12	"MOCK.GQY1XT001 description"

Your biom file columns would look like:

           {"id":"F003D000", "metadata":{"dpw":"0", "description":"F003D000 description"}},
           {"id":"F003D002", "metadata":{"dpw":"2", "description":"F003D002 description"}},
           {"id":"F003D004", "metadata":{"dpw":"4", "description":"F003D004 description"}},
           {"id":"F003D006", "metadata":{"dpw":"6", "description":"F003D006 description"}},
           {"id":"F003D008", "metadata":{"dpw":"8", "description":"F003D008 description"}},
           {"id":"F003D142", "metadata":{"dpw":"142", "description":"F003D142 description"}},
           {"id":"F003D144", "metadata":{"dpw":"144", "description":"F003D144 description"}},
           {"id":"F003D146", "metadata":{"dpw":"146", "description":"F003D146 description"}},
           {"id":"F003D148", "metadata":{"dpw":"148", "description":"F003D148 description"}},
           {"id":"F003D150", "metadata":{"dpw":"150", "description":"F003D150 description"}},
           {"id":"MOCK.GQY1XT001", "metadata":{"dpw":"12", "description":"MOCK.GQY1XT001 description"}}


The matrixtype parameter allows you to select what type you would like to make. Choices are sparse and dense, default is sparse.

mothur > make.biom(shared=final.tx.1.subsample.1.pick.shared, matrixtype=dense)


The groups parameter allows you to specify which of the groups in your groupfile you would like included. The group names are separated by dashes.

mothur > make.biom(shared=final.tx.1.subsample.1.pick.shared, groups=F003D000-F003D002-F003D008)


The label parameter allows you to select what distance levels you would like, and are also separated by dashes.


The picrust program requires green genes OTU IDs. The picrust parameter allows you to provide the OTU ID mapping table associated with your reference taxonomy. When using the picrust option you must provide a reference taxonomy and a *.cons.taxonomy file. You can download the version 13.5 OTU ID maps here: GreenGenes Mapping Tables.

mothur > make.biom(shared=abrecovery.an.shared, label=0.03, reftaxonomy=gg_13_5_99.gg.tax, constaxonomy=abrecovery.an.0.03.cons.taxonomy, picrust=97.gg.otu_map)

Mothur will create a biom file and a new shared file. The new shared file is created because picrust does not allow for duplicate OTU IDs, so any OTUs that are classified to the same taxonomy are merged.


The referencetax parameter is used with the picrust parameter. Picrust requires the greengenes OTU IDs to be in the biom file, and the referencetax parameter allows you to provide your reference taxonomy file you used when classifying your sequences.