$ sqlite3 CONTIGS.db '.tables'collections_colors genes_in_contigs kmer_contigscollections_info genes_in_splits kmer_splitscollections_of_contigs genes_in_splits_summary selfcollections_of_splits hmm_hits_in_contigs splits_basic_infocontig_sequences hmm_hits_in_splitscontigs_basic_info hmm_hits_info$ sqlite3 CONTIGS.db '.schema genes_in_contigs'CREATE TABLE genes_in_contigs (prot text, contig text, start numeric, stop numeric, direction text, figfam text, function text, t_phylum text, t_class text, t_order text, t_family text, t_genus text, t_species text);$ sqlite3 CONTIGS.db 'select * from genes_in_contigs where prot = "prot_00007";'prot_00007|204_10M_MERGED.PERFECT.gz.keep_contig_878|7822|8409|f||Guanylate kinase (EC 2.7.4.8)||||||
$ anvi-get-db-table-as-matrix CONTIGS.db --table genes_in_contigsDatabase .....................................: "CONTIGS.db" has been initiated with its 16 tables.Table ........................................: "genes_in_contigs" has been read with 48 entries and 13 columns.Output .......................................: genes_in_contigs.txt$ head genes_in_contigs.txtprot contig start stop direction figfam function t_phylum t_class t_order t_family t_genus t_speciesprot_00001 204_10M_MERGED.PERFECT.gz.keep_contig_878 13 933 fprot_00002 204_10M_MERGED.PERFECT.gz.keep_contig_878 1114 1677 f Translation elongation factor Pprot_00003 204_10M_MERGED.PERFECT.gz.keep_contig_878 1738 2142 f Transcription termination protein NusBprot_00004 204_10M_MERGED.PERFECT.gz.keep_contig_878 2230 3447 f Carbamoyl-phosphate synthase small chain (EC 6.3.5.5)prot_00005 204_10M_MERGED.PERFECT.gz.keep_contig_878 3440 6820 f Carbamoyl-phosphate synthase large chain (EC 6.3.5.5)prot_00006 204_10M_MERGED.PERFECT.gz.keep_contig_878 6820 7761 f Orotidine 5'-phosphate decarboxylase (EC 4.1.1.23)prot_00007 204_10M_MERGED.PERFECT.gz.keep_contig_878 7822 8409 f Guanylate kinase (EC 2.7.4.8)prot_00008 204_10M_MERGED.PERFECT.gz.keep_contig_878 8497 10350 r Dihydroxy-acid dehydratase (EC 4.2.1.9)prot_00009 204_10M_MERGED.PERFECT.gz.keep_contig_878 10409 10855 f DNA-directed RNA polymerase omega subunit (EC 2.7.7.6)
--
Anvi'o Paper: https://peerj.com/articles/1319/
Project Page: http://merenlab.org/projects/anvio/
Code Repository: https://github.com/meren/anvio
---
You received this message because you are subscribed to the Google Groups "Anvi'o" group.
To unsubscribe from this group and stop receiving emails from it, send an email to anvio+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/CAKSgtzf7EYX8R9KAqoGx0akYYbq8RoSQWn6DvitTRGAKpDBPiw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
gene_callers_id contig start stop direction partial source version
0 k99_6 0 312 f 1 prodigal v2.6.2
1 k99_8 2 305 r 1 prodigal v2.6.2
2 k99_9 0 177 r 1 prodigal v2.6.2
3 k99_9 237 297 f 1 prodigal v2.6.2
4 k99_10 2 353 r 1 prodigal v2.6.2
5 k99_11 1 301 r 1 prodigal v2.6.2
6 k99_13 1 379 f 1 prodigal v2.6.2
7 k99_15 0 378 f 1 prodigal v2.6.2
8 k99_19 0 282 r 1 prodigal v2.6.2
To unsubscribe from this group and stop receiving emails from it, send an email to anvio+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/1922f90b-7818-4f76-96b4-7c39a2c57fa6%40googlegroups.com.
I now have a new question: I only have a single sample, so I was not able to run anvi-merge, which means I do not have any collections to input to anvi-summarize. Is there a way around that?
for split in `sqlite3 SAMPLE-01/PROFILE.db 'select contig from atomic_data_splits;'`; do echo $split | awk '{print $1 "\tall_contigs"}'; done > collection.txt
anvi-import-collection collection.txt -p SAMPLE-01/PROFILE.db -c CONTIGS.db -C default
anvi-summarize -c CONTIGS.db -p SAMPLE-01/PROFILE.db -C default -o summary
My question is how this affects the binning and functional analysis by anvio.
$ less all_contigs-scg_domain.txt
archaea
$ less all_contigs-scg_domain_confidence.txt
-14.26
But what I was looking for would be a taxonomic tree. Am I looking in the right place?
Thanks!
Elizabeth
--
Elizabeth Hénaff, PhD | elizabeth-henaff.net
Postdoctoral Associate
Institute for Computational Biomedicine,
Weill Cornell Medical College
--
Anvi'o Paper: https://peerj.com/articles/1319/
Project Page: http://merenlab.org/projects/anvio/
Code Repository: https://github.com/meren/anvio
---
You received this message because you are subscribed to a topic in the Google Groups "Anvi'o" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/anvio/WGU85P7Gdao/unsubscribe.
To unsubscribe from this group and all its topics, send an email to anvio+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/CAK5_LafaureFaVwHkMJLBZu465PGGfd%2BW%2BkWEEBW9smPpjFNYg%40mail.gmail.com.
You received this message because you are subscribed to the Google Groups "Anvi'o" group.
To unsubscribe from this group and stop receiving emails from it, send an email to anvio+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/CAMj1pR%3DcgYTKoO6bnyLYXNV5UBU1Tsm5%2BCVVMD6Q9edzEAEvXA%40mail.gmail.com.
--
Elizabeth Hénaff, PhD | elizabeth-henaff.net
Postdoctoral Associate
Institute for Computational Biomedicine,
Weill Cornell Medical College
To view this discussion on the web visit https://groups.google.com/d/msgid/anvio/CAK5_LacgejeEKb6YHpQD6zyK2jiw3GnteVrALHT9yfsk9sYwTA%40mail.gmail.com.
I did run centrifuge, and imported those results. As a sanity check -- is there a way to checkwhat imports have been made into a contig database?
$ sqlite3 CONTIGS.db 'select * from genes_taxonomy limit 10;'0|11|12|13|14|15|16|17|18|19|1$ sqlite3 CONTIGS.db 'select * from splits_taxonomy limit 10;'204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00001|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00002|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00003|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00004|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00005|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00006|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00007|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00008|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00009|1204_10M_MERGED.PERFECT.gz.keep_contig_878_split_00010|1$ sqlite3 CONTIGS.db 'select * from taxon_names limit 10;'1|||||Bifidobacterium|Bifidobacterium adolescentis(...)
I'm running a lot of tests in parallel and I want to make sure I've got all my steps straight :)
Also, can I import taxonomy/functions after profiling a bam? Or does that need to be done before to be meaningful?