I know I'm not alone in thinking that GISAID seems to be ahead of the
curve, especially in terms of datasets.
It's possible to download the newick trees, that's all well and good. The raw
data is not (some what expected, mind you), Just thought somebody here might know
answers to following questions:
1) Do registered users get access to the raw data (I have my doubts)
2) How is the raw data from such diverse sources uniformized?
3) which format are most of the labs submitting their data as? (I bet it won't be what the
what the Treetime/Augur/Auspice pipeline wants ... if bioinformaticans didn't have to convert file
format, 50% of their job would be gone. :-D)
Thanks in advance!