chimera.ccode
Use Ccode approach .…
Algorithm
Default settings
The fasta and template parameters are required. You may enter multiple fasta files by separating them by dashes. Example: fasta=ex.align-abrecovery.align.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta)
The output to the screen looks like:
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta)
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093933|MNA5|AF293005 was found have at least one chimeric window.
gi|11093932|MNA1|AF293004 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
gi|11093924|MNF4|AF292996 was found have at least one chimeric window.
Opening ex.ccode.chimeras you would see:
For full window mapping info refer to ex.mapinfo
...
gi|11093940|MNF8|AF293012
Reference sequences used and distance to query:
1456 0.124
141312 0.138
102418 0.140
110103 0.149
183441 0.234
99346 0.236
142598 0.239
136664 0.239
1443 0.251
216057 0.256
70898 0.262
167370 0.263
10709 0.264
225365 0.265
168547 0.265
27824 0.268
10686 0.269
10763 0.269
147978 0.269
123055 0.269
Mapping information:
Window StartPos EndPos
1 108 778
2 778 1448
3 1448 2118
4 2118 2788
5 2788 3458
6 3458 4128
7 4128 4798
8 4798 5468
9 5468 6138
10 6138 6808
11 6808 6812
Window AvgQ (sdQ) AvgR (sdR) Ratio Anova
1 47.550 20.109 54.095 11.853 0.879 4.709
2 21.350 8.677 21.711 5.681 0.983 0.065
3 43.250 20.749 49.658 13.050 0.871 3.829
4 49.050 16.863 50.247 11.667 0.976 0.173
5 0.001 0.001 0.001 0.001 1.000 0.004
6 16.350 8.916 20.821 7.403 0.785 6.340
7 45.150 10.811 46.626 12.730 0.968 0.250
8 29.600 5.020 25.779 7.435 1.148 5.029
9 30.650 3.345 24.074 7.534 1.273 14.877
10 54.500 6.126 44.747 13.275 1.218 10.523
11 0.450 0.510 0.521 0.501 0.864 0.363
Window ConfidenceLimit t-Student Anova
1
2
3
4
5
6
7
8 * *
9 * * *
10 * *
11
...
The * indicate that the window was found to be chimeric using the given limits.
Options
filter
By default the filter parameter is set to false, but if you set it to true a 50% soft vertical filter will be applied.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, filter=t)
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
Filter removed 6397 columns.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093939|MNB2|AF293011 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093932|MNA1|AF293004 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
gi|11093924|MNF4|AF292996 was found have at least one chimeric window.
mask
By default there is no mask applied, but you can set it to a file containing your mask or mask=default will apply the ecoli mask.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, mask=default)
I am using the default 236627 EU009184.1 Shigella dysenteriae str. FBD013.
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
gi|11093939|MNB2|AF293011 does not have an adaquate number of reference sequences that are within 20% and 0.5%
similarity. I will continue, but please check.
gi|11093941|MNA3|AF293013 was found have at least one chimeric window.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093939|MNB2|AF293011 was found have at least one chimeric window.
gi|11093938|MNC2|AF293010 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093933|MNA5|AF293005 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
gi|11093924|MNF4|AF292996 was found have at least one chimeric window.
With the ecoli mask and filter applied...
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, mask=default, filter=t)
I am using the default 236627 EU009184.1 Shigella dysenteriae str. FBD013.
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
Filter removed 259 columns.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093939|MNB2|AF293011 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093932|MNA1|AF293004 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
window
The window parameter is used to determine the length of sequence you want in each window analyzed. By default it is set to 10% of the sequence length. It is recommended that your window size be within 5 and 20% of your trimmed sequence length.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, window=400)
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093939|MNB2|AF293011 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093932|MNA1|AF293004 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093929|MNC12|AF293001 was found have at least one chimeric window.
gi|11093928|MNG3|AF293000 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
numwanted
The numwanted parameter allows you to specify how many sequences you would each query sequence compared with. By default it is set to 20.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, numwanted=10)
Reading sequences and template file... Done.
Finding top matches for sequences... Done.
gi|11093940|MNF8|AF293012 was found have at least one chimeric window.
gi|11093937|MNF2|AF293009 was found have at least one chimeric window.
gi|11093934|MND1|AF293006 was found have at least one chimeric window.
gi|11093933|MNA5|AF293005 was found have at least one chimeric window.
gi|11093932|MNA1|AF293004 was found have at least one chimeric window.
gi|11093930|MNH4|AF293002 was found have at least one chimeric window.
gi|11093926|MNH2|AF292998 was found have at least one chimeric window.
removechimeras
The removechimeras parameter allow you to remove the chimeras from your files instead of just flagging them. Default=t.
processors
To speed up your processing the chimera.ccode command can be run with multiple processors by using the processors parameter. By default the processors parameter is 1. If you are using the mpi-enabled version processors is set to the number of processes running.
mothur > chimera.ccode(fasta=ex.align, template=core_set_aligned.imputed.fasta, processors=2)
This method was written using the algorithms described in the “Evaluating putative chimeric sequences from PCR-amplified products” paper by Juan M. Gonzalez, Johannes Zimmerman and Cesareo Saiz-Jimenez.
Revisions
- 1.38.0 - Removes save option.
- 1.47.0 Adds removechimeras parameter to chimera commands to auto remove chimeras from files. #795