$ vsearch --derep_full ERR348713.fasta --output ERR348713.derep.fasta --sizeout
vsearch v1.10.0_osx_x86_64, 16.0GB RAM, 8 cores
https://github.com/torognes/vsearch
Reading file ERR348713.fasta 100%
53888560 nt in 213043 seqs, min 248, max 255, avg 253
Dereplicating 100%
Sorting 100%
21496 unique sequences, avg cluster 9.9, median 1, max 22479
Writing output file 100%
Chimera detection:
$ vsearch --uchime_denovo ERR348713.derep.fasta --chimeras chimera.out.fasta --nonchimeras non.chimera.out.fasta
vsearch v1.10.0_osx_x86_64, 16.0GB RAM, 8 cores
https://github.com/torognes/vsearch
Reading file ERR348713.derep.fasta 100%
5434813 nt in 21496 seqs, min 248, max 255, avg 253
Masking 100%
Sorting by abundance 100%
Counting unique k-mers 100%
Detecting chimeras 100%
Found 8385 (39.0%) chimeras, 12887 (60.0%) non-chimeras,
and 224 (1.0%) borderline sequences in 21496 unique sequences.
Taking abundance information into account, this corresponds to
26953 (12.7%) chimeras, 184839 (86.8%) non-chimeras,
and 1251 (0.6%) borderline sequences in 213043 total sequences.