Converting Excel Genotype data to Vcf or Hapmap

51 views
Skip to first unread message

Akani Hlungwana

unread,
May 24, 2026, 4:27:37 PMMay 24
to TASSEL - Trait Analysis by Association, Evolution and Linkage
Dear TASSEL Community,
I hope you are all well,
I currently have a genotype SNP dataset in Excel format, as shown in the attached picture. I would like to kindly ask for guidance on how to convert the dataset into either VCF or HapMap format for downstream analysis in TASSEL and GWAS studies.

Any guidance, examples, or recommended scripts/software would be greatly appreciated.
Thank you very much for your assistance and support.


Screenshot (1).png

Terry Casstevens

unread,
May 24, 2026, 4:29:59 PMMay 24
to tas...@googlegroups.com
https://bitbucket.org/tasseladmin/tassel-5-source/wiki/UserManual/Load/Load
> --
> You received this message because you are subscribed to the Google Groups "TASSEL - Trait Analysis by Association, Evolution and Linkage" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to tassel+un...@googlegroups.com.
> To view this discussion visit https://groups.google.com/d/msgid/tassel/c2e2b7b8-b1ef-4c1f-9df2-2abdafee4e59n%40googlegroups.com.

Akani Hlungwana

unread,
May 25, 2026, 8:06:52 AMMay 25
to TASSEL - Trait Analysis by Association, Evolution and Linkage

Hello,

I did go through the user manual and saved the Excel document as Text (Tab delimited). However, when I tried to load it on TASSEL, it gave an error. Please see the attached picture for reference.

Screenshot (4).png

Hugo Cuevas

unread,
May 25, 2026, 8:13:17 AMMay 25
to tas...@googlegroups.com
How many SNP and samples do you have?

Akani Hlungwana

unread,
May 25, 2026, 8:21:30 AMMay 25
to TASSEL - Trait Analysis by Association, Evolution and Linkage
the dataset consists of 9,065 SNP markers genotyped across 336 samples. 

Terry Casstevens

unread,
May 25, 2026, 8:49:39 AMMay 25
to tas...@googlegroups.com
If you format it as Hapmap, the file extension should be .hmp.txt

If you format it as VCF, the file extension should be .vcf
> To view this discussion visit https://groups.google.com/d/msgid/tassel/26d4b1fe-b50e-4432-84be-6a20be1d63f8n%40googlegroups.com.

Hugo Cuevas

unread,
May 25, 2026, 10:16:49 AMMay 25
to tas...@googlegroups.com
Since it is a small data set you can prepare the hapmafile in Excel, then copy/paste to save the file as name.hmp.txt.

1 - transpose the data file to get row of SNP.
2 - add the colums and name of the hapmap format.
3 - copy/paste to save as name.hmp.txt
4- open in Tassel 

Remember chromosome a bp positon must be in an acending order.

Best,

Hugo

Chandika RG

unread,
May 25, 2026, 10:34:52 AMMay 25
to TASSEL - Trait Analysis by Association, Evolution and Linkage
Hello Akani,

I would manually convert the sheet to match the HapMap format. 

Copy all data and transpose paste in a new Excel sheet. Check the column name of the from - 
https://bitbucket.org/tasseladmin/tassel-5-source/wiki/UserManual/Load/Load
Insert all required additional columns and set them to NA for each entry. 

Save the transformed data with extension .hmp.txt - it is a tab-delimited file. 

Best,
Chandika

Akani Hlungwana

unread,
May 26, 2026, 2:39:53 AMMay 26
to TASSEL - Trait Analysis by Association, Evolution and Linkage
Hello everyone,  
Thank you all for your support — it worked successfully.

Muhammad Atif Wahid

unread,
May 26, 2026, 1:50:44 PMMay 26
to tas...@googlegroups.com
consult maizwgenetics

--
You received this message because you are subscribed to the Google Groups "TASSEL - Trait Analysis by Association, Evolution and Linkage" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tassel+un...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages