cogent.core.alphabet.AlphabetError: F

55 views
Skip to first unread message

El C

unread,
Mar 28, 2017, 5:42:34 AM3/28/17
to Qiime 1 Forum

After filtering to a 99% similarity,I am creating a new phylogenetic tree that includes known sequences with my samples. I am using mat alignment to align sequences of different lengths. 

An error comes up when running the following: 

Does anyone know why is this? What type of error is alphabet error F?

Thanks(qiime1) user-15609: user15$ align_seqs.py -i align_otus_w_known_seqs/all_seqs.fasta -o align_otus_w_known_seqs/aligned_seqs -m mafft

Traceback (most recent call last):

  File "/Users/user15/miniconda3/envs/qiime1/bin/align_seqs.py", line 4, in <module>

    __import__('pkg_resources').run_script('qiime==1.9.1', 'align_seqs.py')

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/setuptools-27.2.0-py2.7.egg/pkg_resources/__init__.py", line 744, in run_script

    

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/setuptools-27.2.0-py2.7.egg/pkg_resources/__init__.py", line 1499, in run_script

    

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/qiime-1.9.1-py2.7.egg-info/scripts/align_seqs.py", line 211, in <module>

    main()

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/qiime-1.9.1-py2.7.egg-info/scripts/align_seqs.py", line 208, in main

    log_path=log_path)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/qiime/align_seqs.py", line 123, in __call__

    log_path=log_path, *args, **kwargs)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/qiime/util.py", line 259, in __call__

    result = self.getResult(*args, **kwargs)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/qiime/align_seqs.py", line 117, in getResult

    result = module.align_unaligned_seqs(seqs, params=params)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/burrito_fillings-0.1.1-py2.7.egg/bfillings/mafft.py", line 261, in align_unaligned_seqs

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/cogent/core/alignment.py", line 298, in __init__

    Alphabet=self.Alphabet)

  File "/Users/ecg1g15/miniconda3/envs/qiime1/lib/python2.7/site-packages/cogent/core/alignment.py", line 425, in _names_seqs_order

    curr_seqs = self._coerce_seqs(seqs, is_array)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/cogent/core/alignment.py", line 471, in _coerce_seqs

    return map(self.MolType.Sequence, seqs)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/cogent/core/sequence.py", line 519, in __init__

    wildcards_allowed)

  File "/Users/user15/miniconda3/envs/qiime1/lib/python2.7/site-packages/cogent/core/moltype.py", line 575, in verifySequence

    raise AlphabetError(motif)

cogent.core.alphabet.AlphabetError: F

abir...@gmail.com

unread,
Mar 29, 2017, 1:33:26 AM3/29/17
to Qiime 1 Forum
Hi,
The error here is actually more straightforward than it may seem: it is telling you that one of the input sequences you provided, somewhere, contains an "F" :) This, of course, is  problem because F is not a recognized IUPAC nucleotide code, so the software can't align a sequence that contains it, and has to give up.  I suggest searching your inputs for that "F" to investigate further.
Best,
Amanda

El C

unread,
Apr 25, 2017, 12:57:57 PM4/25/17
to Qiime 1 Forum
Hi, 
I have grep the document and the only F that appear there are the names:

AF013027

AF013030

AF091322

AF013028

AF013029

AF091321

AF518770

JF262061


How can I do so the command align sequences does not recognise those F's as part of the sequence?


The way the document is set looks like this (I didn't copy the whole file, just showing the structure):


>denovo1078 gDNA15_112947

ACTACGGGTGGGCAGCAGTGGGGAATTTTC...

>denovo11849 gDNA20_1051349

CCTACGGGTGGCAGCAGTCGGGAATATTGG...

.

.

.

--

AF091321

GATGAACGCTGGCG...

--

AF518770

AAGTCGAACGGACTCTTCGGAG...

.

.

.


Thanks

Greg Caporaso

unread,
Apr 25, 2017, 6:12:01 PM4/25/17
to Qiime 1 Forum
Hello, Would you be able to share the all_seqs.fasta that you're trying to align? Amanda is correct - this error generally would mean that there was a disallowed character in your input file (in this case an F). I'd like to try to run this locally to see if I get the same error message. 

Thanks! 
Greg

El C

unread,
Apr 26, 2017, 3:58:43 AM4/26/17
to Qiime 1 Forum
Hi Greg, 
Here is the file:
Many thanks
E

>denovo1078 gDNA15_112947

ACTACGGGTGGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGTGTAGTC

>denovo11849 gDNA20_1051349

CCTACGGGTGGCAGCAGTCGGGAATATTGGACAATGGGGGCAACCCTGATCCAGCCATGCCGCGTGAGTGATGAAGGCCTTCGGGTTGTAAAGCTCTTTTCTCTGTGAGGATGATGACAGTAACAGAGGAATAAGCCCCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGGGCTAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGCGTAGTC

>denovo12973 gDNA4_2409906

CCTACTGGTGGTTGCCGTTGGTTCTTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTAGTAGTC

>denovo13868 gDNA18_1409608

CCTACGTGTGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAATCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTTGTAGTC

>denovo14087 gDNA20_996660

CCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAGGAATACCAGTGGCGAAGGCGGCCTTCTGGACTGACACTGACACTGAGGTGCGAAAGCGTGGGTAGCAAACAGGATTAGATACCCTAGTAGTC

>denovo14583 gDNA10_1996827

TCTACGGGTGGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGAGTAGTC

>denovo14985 gDNA6_3303618

CCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGCGTAAGTC

>denovo16072 gDNA18_1375582

CCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGACTCCCTGGCCAAAGACTGACGCTCATGTGCGAAAGTGTGGGTAGCGAACAGGATTAGATACCCCCGTAGTC

>denovo16534 gDNA7_3570864

CCTACGGGAGCGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTGGTAGTC

>denovo16845 gDNA20_958226

CCTACGGGTGGCAGCAGTCGGGAATATTGGACAATGGGGGCAACCCTGATCCAGCCATGCCGCGTGAGTGATGAATGCCTTCGGGTTGTAAAGCTCTTTTCTCTGTGAGGATGATGACAGTAACAGAGGAATAAGCCCCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGGGCTAGCGTTGTTCGGAATTACTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGCGTAGTC

>denovo17021 gDNA18_1544902

CCTACGGGATGCAGCAGTGGGGAATATTGCACAATGGGGGAAACCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCCGTAGTC

>denovo1859 gDNA16_435287

CCTCCTTTCTTCCTCCTTTTGTCCTTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTCGTAGTC

>denovo2009 gDNA14_660764

CCTACGGGGTGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAATAACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTTAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACGCTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTGGTAGTC

>denovo2371 gDNA15_136491

CCTACGGGGGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTTCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTTGTAGTC

>denovo3624 gDNA18_1355185

CCTACGGTGTGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGATGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCAATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCCGTAGTC

>denovo39 gDNA20_914511

CCTACGGGGGGCAGCAGTCGGGAATATTGGACAATGGGGGCAACCCTGATCCAGCCATGCCGCGTGAGTGATGAAGGCCTTCGGGTTGTAAAGCTCTTTTCTCTGTGAGGATGATGACAGTAACAGAGGAATAAGCCCCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCGGTAGTC

>denovo4334 gDNA10_1997841

TCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGTGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGCCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCGGTAGTC

>denovo5796 gDNA15_4465

TCTACGGGTTGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATCAGATACCCTCGTAGTC

>denovo6173 gDNA20_1070986

CCTACGGGAGGCAGCAGTCGGGAATATTGGACAATGGGGGCAACCCTGATCCAGCCATGCCGCGTGAGTGATGAAGGCCTTCGGGTTGTAAAGCTCTTTTCTCTGTGAGGATGATGACAGTAACAGAGGAATAAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCTCGTAGTC

>denovo6683 gDNA15_10908

CCTACGGGGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGAGTAGTC

>denovo7165 gDNA15_2159

ACTACGGGGGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGCGTAGTC

>denovo7590 gDNA15_53122

CCTACGGGGTGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCCGTAGTC

>denovo7760 gDNA16_405362

CCTACTTTTTTCCTCCTTTGGTCCTTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCTGTAGTC

>denovo9392 gDNA16_257131

CCTACGGGGTGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCGGGTAGTC

AF013027

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGGCTCTTCGGAGCTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGATAAAAGCATACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGGCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGNAACGCGAAGAACCTTACAAGGACTTGACATGTCGCGAATCTCAGGNAAACTTGAGAGCGCCTTCGGGAGCGTGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTAGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAAGGCAGCGAACCCGAGAGGGGGAGCTAATCCCAAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF013030

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTCGGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCCGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGTGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGACTTGACATGTCGCGAATCTAATGGAAACATGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGAGACTGCCGGTGACAAACCGGAGGAAGGCGGGGATGACGTCAAGTCAGCATGCCCCTTGCGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGTCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF091322

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGAGCTCTTCGGAGCTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGATAAAAGCATACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAGGACCGCGTGGGGGAGGAAAGCTCTAGGGTTATAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTANGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGGCCGCACAAGCGGTGGAGTATGTGGTTTANTTCGATGCAACGCGAAGAACCTTACCAGGACTTGACATGTCGCGAATCTCAGGGAAACTTGAGAGTGCCTTCGGGAGCGTGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTAGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAAGGCAGCGAACCCGAGAGGGGGAGCTAATCCCAAAAACCCAGNCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGNTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATCACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF013028

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTTTGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCTTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAGGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAATAACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTTAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACGCTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGNTTGACATGTCGCGAATCTCAAGGAAACTTGGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCTCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF013029

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTCGGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCCGCTAATCCCGGATGAGCCTTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTTGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGCAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGAAATGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCCGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTTAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCGAAACTGACGCTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTCAAGGAAACTTGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCTCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGCAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF091321

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGGCTCTTCGGAGCTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAGCAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGATAAAAGCATACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCCGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTANTTCGATGNAACGNGAAGAACCTTACCAGGACTTGACATGTCGNGAATCTCAGGGAAACTTGAGAGTGCCTTCGGGAGCGTGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTAGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAAGGCAGCGAACCCGAGAGGGGGAGCTAATCCCAAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATNCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAACCGTACCGGAA

--

AF518770

AAGTCGAACGGACTCTTCGGAGTTAGTGNCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCTTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCGGATTAGCTAGTTGGTGTGGTAAAAGCATTCCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCAAAAGCGCTTTGCTAGATCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGACTTGACATGTCGCGAATCTAATGGAAACATGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGTCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

CP000393

GAGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTCGGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCTTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGACTTGACATGTCGCGAATCTAATGGAAACATGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGTCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGTGGCTGGATCACCTCCTTT

--

GU238278

ATGCAGTCGAACGGAGTCCTTCGGGACTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCGTTAGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGGGGTAAAGGCCTACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGTTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAACGTTGAAATGCGTATATATAGGTAAGAACATCGGTGGCGAAAGCGCTTTTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTTAGGGAAACTTGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGAACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAACCCGCGAGGAGGAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCTTAA

--

GU238269

AGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCTGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGAACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAAGCCGCGAGGTGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGCGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCATTTTGTGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGTGGCTGGATCACCTCCTTT

--

GU238276

TCCCGGATGAGCCTTTGGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGTTCTAGGATTGTAAACCCCTTTTCTTTGGGAAGAAGTTCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCAGTGTAAGTCTGCTGTTAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACACAGCTAGAGTCTGGTAGGGGTAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTACTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCCAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCNCAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCTGGAGTGCCTTAGGGAACGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGAACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAAGCCGCGAGGTGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGCGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCATTTTGTGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGTGGCTGGATCACCTCCTTT

--

GU724203

CGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCTAACGGTCAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTAGGAAGAAGTTCTGACGGTACTAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGTGCTCAACACTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCGAAACTGACACTCAGGGACGAAGGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCTGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCAATCATTAAGTTGGGCACTCTAAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAAACCGCGAGGTGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCTGATGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGT

--

HM585025

GGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATTAATTGCCTGAAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGGGTAAAGGCATACCAAGGCAACGATCGGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGCAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTTCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCTGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGAACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAACCCGCAAGGAGGAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCGTTACC

--

HM630164

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGGCTCTTCGGAGCTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCCGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCATTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCCGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGCTGGGACAGAGGGCAGCGAACCCGCGAGGAGGAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCTTATGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTG

--

HM630192

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTCGGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCGATGGGTAAAAGATTAATTGCCTGAAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAGGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCAATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTCGGTGAAAGTCGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGAACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAAGCCGCGAGGTGGAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATAGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCTTAATGGAGGGGATGCCAAGT

--

HQ419199

CGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTAGGAAGAACAAAATGACGGTACTAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCGGCCGTGCCAAAAACATTGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTCGGTGAAAGCTGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGCGAGGGGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTC

--

HQ419200

CGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGGGGTAAAAGCTTACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTTCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGAAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCGAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTCGGTGAAAGCTGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGCGAGGGGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTC

--

HQ419197

CGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTAGGAAGAACAAAATGACGGTACTAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACGTAGCTAGAGTCTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGAGATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTTGACCAGAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCGGCCGTGCCAAAAACATTGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCTCGGTGAAAGCTGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGCGAGGGGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTC

--

JF262061

GGATGAGCCTAACGGTCAAAGATTAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTAGGGAGAAGTTCTGACGGTACTAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGTGCTCAACACTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGAAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCGAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAGTTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGGCTTGACATGTCGCGAATCCGGGTGAAAGTCTGGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTCGCCATCATTAAGTTGGGCACTCTAAAGGGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGCCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGCAGCGAAGCCGCGAGGTGAAGCCAATCCCCAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAG

--

X70767

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGACTCTTCGGAGTTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCTTAGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGGTAAAAGCATACCAAGGCAACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAGGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGATCTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGRRGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGTAGATGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTCAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTTGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACATTGAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCAGGACTTGACATGTCGCGAATCTAATGGAAACATGAGAGTGCCTTCGGGAGCGCGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTGAAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAGGGTAGCGAACCCGAGAGGGGAAGCCAATCCCCAAAACCCAGTCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGRGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGTGGCTGGATCACCTCCTTT

Jai Ram Rideout

unread,
Apr 26, 2017, 7:02:48 PM4/26/17
to Qiime 1 Forum
Hello,

The file you pasted doesn't appear to be a valid FASTA file, which is required by align_seqs.py. The first portion of the file looks okay but the second portion isn't valid FASTA. Did two files get concatenated accidentally?

Specifically, this is the portion of the file that looks odd:

AF013027

GATGAACGCTGGCGGTCTGCTTAACACATGCAAGTCGAACGGGCTCTTCGGAGCTAGTGGCGGACGGGTGAGTAACGCGTGAGAATCTGCCTTCAGGTCTGGGACAACAGAAGGAAACTTCTGCTAATCCCGGATGAGCCCTTGGGTAAAAGATAAATTGCCTGGAGATGAGCTCGCGTCTGATTAGCTAGTTGGTGTGATAAAAGCATACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGATGAGCAGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATTTTCCGCAATGGGCGAAAGCCTGACGGAGCAAGACCGCGTGGGGGAGGAAAGCTCTAGGGTTGTAAACCCCTTTTCTTTGGGAAGAAGTACTGACGGTACCAAAGGAATCAGCCTCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAGGCAAGCGTTATCCGGAATGATTGGGCGTAAAGCGTCCGCAGGTGGCCATGTAAGTCTGCTGTCAAAACCCAGGGCTTAACTCTGGTCAGGCAGTGGAAACTACAAAGCTAGAGTTCGGTAGGGGCAAAGGGAATTCCCGGTGTAGCGGTGAAATGCGTAGATATCGGGAAGAACATCGGTGGCGAAAGCGCTTTGCTAGACCAAAACTGACACTCAGGGACGAAAGCTAGGGGAGCGAATGGGATTAGATACCCCAGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTTGCCTGTATCGACCCAGGCAGTGCCGTAGCTAACGCGTTAAGTATCCCGCCTGGGGAGTACGCACGCAAGTGTGAAACTCAAAGGAATTGACGGGGGGCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGATGNAACGCGAAGAACCTTACAAGGACTTGACATGTCGCGAATCTCAGGNAAACTTGAGAGCGCCTTCGGGAGCGTGAACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTCTTTAGTTGCCATCATTAAGTTGGGCACTCTAGAGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCAGCATGCCCCTTACGTCCTGGGCTACACACGTACTACAATGGTTGGGACAGAAGGCAGCGAACCCGAGAGGGGGAGCTAATCCCAAAAACCCAGCCTCAGTTCAGATTGCAGGCTGCAACTCGCCTGCATGAAGGAGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATCCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGAAGTTGGCCACGCCCGAAGTCATTACTCTAACCCGAAAGGGAGGGGGATGCCGAAGGTAGGGCTGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTACCGGAA

--

AF013030

GATGAACGCTGGCG...


Best,

Jai

El C

unread,
May 3, 2017, 7:16:39 AM5/3/17
to Qiime 1 Forum
Thanks Jai, I edited and deleted that last part and it worked well.
The two files might have merged accidentally as you said.
Thanks

El
Reply all
Reply to author
Forward
0 new messages