Expresso/3Dcoffee with local pdb database

Esteban López Tavera

unread,

Aug 19, 2022, 4:23:57 AM8/19/22

to Tcoffee

Dear T-coffee team,

I would like to run expresso relying only on a set of pdb files that I have (approx. 150), to align a set of ~1200 sequences. I would like to avoid t-coffee trying to fetch

My guess is I need to run expresso (if I don't want to build a template file myself) and:

- Build a file that can be the input to the option -pdb_db from my set of pdb files.

- Input all the pdb files with the -pdb option.

My questions are:

- How should the -pdb_db input look like? Any advice on how to build it?

- Is there a way of inputting all the pdb files at once instead of individually?

Thanks in advance for your help.

Best,

Esteban

Athanasios Baltzis

unread,

Aug 19, 2022, 4:52:35 AM8/19/22

to tco...@googlegroups.com

Dear Esteban,

If I understand it correctly, you want expresso to search locally for templates only on this set of pdb files.

If this is the case, then you have to create a local blast-able sequence database from this set of pdb files and run the following command:

t_coffee <QUERY_FASTA> -mode expresso -blast=LOCAL -pdb_db=<PDB_DB_PATH - e.g. $HOME/db/myset.fasta> -pdb_type d -pdb_min_sim 95 -pdb_min cov 90

Kind regards,

Athanasios Baltzis

PhD Fellow in Bioinformatics | Data scientist

Notredame's lab - Comparative Bioinformatics Group

Centre for Genomic Regulation (CRG), Barcelona (Spain)

--
You received this message because you are subscribed to the Google Groups "Tcoffee" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tcoffee+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tcoffee/189a8c95-f17e-47a9-8f11-98c39eaf28d5n%40googlegroups.com.

Esteban López Tavera

unread,

Aug 19, 2022, 7:28:31 AM8/19/22

to Tcoffee

Thank you, Athanasios.

Yes, precisely.

So the pdb_db file is just a simple fasta with the sequences corresponding to my pdbs?

I will try it. Thanks!

Best,

Esteban

Athanasios Baltzis

unread,

Aug 19, 2022, 7:37:21 AM8/19/22

to tco...@googlegroups.com

It is a fasta file that points to a blast database. This might help you on how to make such a database: https://www.ncbi.nlm.nih.gov/books/NBK569841/

Best,

Athanasios Baltzis

PhD Fellow in Bioinformatics | Data scientist

Notredame's lab - Comparative Bioinformatics Group

Centre for Genomic Regulation (CRG), Barcelona (Spain)

To view this discussion on the web visit https://groups.google.com/d/msgid/tcoffee/8fef7981-4464-468f-a536-75e2e2e7efd1n%40googlegroups.com.

Esteban López Tavera

unread,

Aug 19, 2022, 7:44:47 AM8/19/22

to Tcoffee

All right, thanks!

I have also created that blast db as in the link you just shared. I assume then the fasta I give to pdb_db is the one from which I created the blastdb, and it all must be in the same directory so that t-coffee can so-to-say find the db. Is that right?

Best,

Esteban

Athanasios Baltzis

unread,

Aug 19, 2022, 7:48:03 AM8/19/22

to tco...@googlegroups.com

Yes, right.

Athanasios Baltzis

PhD Fellow in Bioinformatics | Data scientist

Notredame's lab - Comparative Bioinformatics Group

Centre for Genomic Regulation (CRG), Barcelona (Spain)

To view this discussion on the web visit https://groups.google.com/d/msgid/tcoffee/3170bcc1-baec-4374-9f08-e975f32824f3n%40googlegroups.com.

Esteban López Tavera

unread,

Aug 19, 2022, 7:52:24 AM8/19/22

to Tcoffee

Great, thanks!

Reply all

Reply to author

Forward