Hi Prof. Gutenkunst,
I have two populations’ whole genomic resequencing data, and I mapped them to a HIC version reference genome to obtain the vcf file.
Firstly, I tried to use:
dd = dadi.Misc.make_data_dict_vcf (“vcf.file”,”population.txt”) to generate SFS file,
but failed, the ERROR report is that:
“Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/smh/miniconda3/lib/python3.9/site-packages/dadi/Misc.py", line 489, in make_data_dict_vcf
popinfo_dict = _get_popinfo(popinfo_file)
File "/home/smh/miniconda3/lib/python3.9/site-packages/dadi/Misc.py", line 692, in _get_popinfo
sample = cols[sample_col]
IndexError: list index out of range”.
I think it may because for the first scaffold of my reference genome is large than 512MB, the same easySFS can’ work. I wonder that what can I do to solve this problem?
Secondly, I find a R script at “
https://github.com/shenglin-liu/vcf2sfs”, and it can give a sfs format data. I tried to use it convert the example file (example/fs_from_data) dadi provided but it can not work. So, I can’t verify if the fs generated are same or similar. Would you please tell me the sfs generated by this script is creditable?
Sincerely
Meng-Huan Song