Hello Yu,
See the Methods section of the description page for the EST track for an
explanation of what may be happening. In particular:
"In general, the 3' ESTs mark the end of transcription reasonably well,
but the 5' ESTs may end at any point within the transcript. Some of the
newer cap-selected libraries cover transcription start reasonably well.
Before the cap-selection techniques emerged, some projects used random
rather than poly-A priming in an attempt to retrieve sequence distant
from the 3' end. These projects were successful at this, but as a side
effect also deposited sequences from unprocessed mRNA and perhaps even
genomic sequences into the EST databases. Even outside of the
random-primed projects, there is a degree of non-mRNA contamination.
Because of this, a single unspliced EST should be viewed with
considerable skepticism."
DB034871 appears to be a 5' EST.
Additionally, here is a paper that looks helpful for explaining the
quality of EST data:
Nagaraj SH, Gasser RB, Ranganathan S. A hitchhiker's guide to expressed
sequence tag (EST) analysis. Brief Bioinform. 2007 Jan;8(1):6-21. Epub
2006 May 23.
http://bib.oxfordjournals.org/cgi/content/abstract/8/1/6
I hope this is helpful.
--
Brooke Rhead
UCSC Genome Bioinformatics Group
> _______________________________________________
> Genome maillist -
Gen...@soe.ucsc.edu
>
http://www.soe.ucsc.edu/mailman/listinfo/genome