Hello Xingbin,
We think Galaxy is probably the best fit for your needs. You can find
the homepage for Galaxy here:
https://usegalaxy.org/
We also have a tool that may be of use to you called the Variant
Annotation Integrator (hgVai):
http://genome.ucsc.edu/cgi-bin/hgVai
hgVai can match up rsIDs with submitted variants -- but it has a hard
upper limit of 100,000 variants per submission, and if too many
additional outputs are selected then the practical limit can be 10,000
(or the query may time out). This would require you to break up your
input into 11 chunks of 100,000 or fewer variants per chunk. You would
need to convert their kgp data to pgSnp format, upload 11 pgSnp files
as custom tracks, and then run hgVai with max variants = 100,000 (and
probably disabling the protein-coding change scores that are enabled
by default) on each of the 11 custom tracks. 11 manual operations is
better than 1000, and you might get some additional useful information
from hgVai, but if you really only want to associate rs IDs, Galaxy is
an easier choice.
If you have any further questions, please reply to
gen...@soe.ucsc.edu. All messages sent to that address are archived on
a publicly-accessible Google Groups forum. If your question includes
sensitive data, you may send it instead to
genom...@soe.ucsc.edu.
Best regards,
Pauline Fujita
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu
> --
>