mgcluster
The mgcluster command .…
For this tutorial you should download the Example Files file and decompress it.
Default settings
The blast parameter is required...
mgcluster(blast=bap.100.blast)
Options
name
mgcluster(blast=bap.100.blast, name=bap.100.names)
count
The count file is similar to the name file in that it is used to represent the number of duplicate sequences for a given representative sequence. mothur will use this information to form the correct OTU’s. Unlike, when you use a name file the list file generated will contain only the unique names, so be sure to include the count file in downstream analysis with the list file.
mothur > make.table(name=bap.100.names)
mothur > mgcluster(blast=bap.100.blast, count=bap.100.count_table)
method
- Nearest_neighbor (nearest): Each of the sequences within an OTU are at most X% distant from the most similar sequence in the OTU.
- Furthest_neighbor (furthest): All of the sequences within an OTU are at most X% distant from all of the other sequences within the OTU.
- Average_neighbor (average): This method is a middle ground between the other two algorithms.
- OptiClust (opti, the default): OTUs are assembled using metrics to determine the quality of clustering.
metric
The metric parameter allows to select the metric in the opticluster method. Options are Matthews correlation coefficient (mcc), sensitivity (sens), specificity (spec), true positives + true negatives (tptn), false positives + false negatives (fpfn), true positives (tp), true negative (tn), false positive (fp), false negative (fn), f1score (f1score), accuracy (accuracy), positive predictive value (ppv), negative predictive value (npv), false discovery rate (fdr). Default=mcc.
initialize
The initialize parameter allows to select the initial randomization for the opticluster method. Options are singleton, meaning each sequence is randomly assigned to its own OTU, or oneotu meaning all sequences are assigned to one otu. We have found initialize=singleton to produce better clustering in less time. Default=singleton.
delta
The delta parameter allows to set the stable value for the metric in the opticluster method Default delta=0.0001. To reach a full convergence, set delta=0.
iters
The iters parameter allow you to set the maxiters for the opticluster method. Default=100.
cutoff
default = 0.70
mgcluster(blast=bap.100.blast, cutoff=1.0)
precision
default = 100
merge
Default = true, meaning merge after clustering mgcluster(blast=bap.100.blast, merge=f)
min
default = true, meaning use minimum blast score ratio
mgcluster(blast=bap.100.blast, min=f)
length
default = 5, minimum overlap
mgcluster(blast=bap.100.blast, length=10)
penalty
default = 0.10
mgcluster(blast=bap.100.blast, penalty=0.15)
Revisions
- 1.28.0 - Added the count parameter
- 1.40.0 - Adds opti method
- 1.40.4 - Bug Fix: Cluster commands printing of list file. #454