Update hlall.fa or hla.clean.fasta

71 views
Skip to first unread message

Daniel Eriksson

unread,
Apr 28, 2015, 8:39:10 AM4/28/15
to viral-to...@googlegroups.com
Dear group members,

I have tried to assemble an update of hlall.fa (hla.clean.fasta) but downloading and merging the gen.fasta-files from IMGT generates a much smaller file than the original supplied with the ATHLATES download. Are there any instructions on how this can be done properly? How do you update the reference?

Here are the fasta-files I have been trying to merge:

ftp://ftp.ebi.ac.uk/pub/databases/ipd/imgt/hla/fasta/


Thank you for your kind help!

bmp1...@student.lu.se

unread,
Oct 29, 2015, 4:11:52 AM10/29/15
to Broad Viral Tool Users
Hi,

I also had the same question a few days ago. 
However the hla.clean.fasta file contains both the gen.fasta files and nuc.fasta files.
I prepared the hlaall.fa by merging all of these files. In this way I got 15035 sequence-headers. 

hla.clean.fasta : 7.4 Mb, 8230 sequence headers

merged fasta (.nuc +.gen):  13 Mb,15035 sequence headers

Does this answer your question? 

Daniel Eriksson

unread,
Nov 2, 2015, 1:33:49 AM11/2/15
to viral-to...@googlegroups.com
Yes, this solves the problem. Thank you.
It seemed to me that some, a few, nuc/gen sequences are identical. To get athlates running I had to remove the duplicates. 

Thank you again!

--
You received this message because you are subscribed to a topic in the Google Groups "Broad Viral Tool Users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/viral-tool-users/DqJ7mkynDf0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to viral-tool-use...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages