Inquiry about '--examine_coding_effect' parameter

101 views
Skip to first unread message

小林賢子

unread,
Aug 26, 2020, 8:25:28 AM8/26/20
to STAR-Fusion
Dear author,  

I would like to ask a question.
STAR-Fusion outputs CDS_LEFT_ID and CDS_RIGHT_ID as the results of '--examine_coding_effect' parameter. How does STAR-Fusion automatically choose each transcript? (I mean, there are a lot of transcripts for each genes. What criteria are applied for choosing transcripts?)

Tetsuko Kobayashi

Brian Haas

unread,
Aug 26, 2020, 9:08:23 AM8/26/20
to 小林賢子, STAR-Fusion
Hi,

It just prioritizes the candidate fusion isoform that produces the longest in-frame coding sequence given the reference annotation structures and the tentative breakpoint.  It doesn't do anything fancy or more intelligent here at the moment.





--
You received this message because you are subscribed to the Google Groups "STAR-Fusion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to star-fusion...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/star-fusion/17e50707-a740-4eab-8fa0-762eedc5ef4cn%40googlegroups.com.


--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas

 

小林賢子

unread,
Aug 26, 2020, 9:10:38 PM8/26/20
to STAR-Fusion
Thanks a lot.

2020年8月26日水曜日 22:08:23 UTC+9 Brian Haas:

Mario N

unread,
Sep 4, 2020, 10:49:09 AM9/4/20
to STAR-Fusion
Hello,

About this option, I have trouble to understand the "est_J" and the "est_S" column. It seems to be related to the JunctionReadCount and SpanningFragCount. Can you explain it to me ?

Best,

Mario

Brian Haas

unread,
Sep 4, 2020, 12:45:03 PM9/4/20
to Mario N, STAR-Fusion
hi,

These are adjusted counts taking into account multimappings.  For example, if there's a spanning frag that's assigned to multiple different isoforms with different breakpoints, the est-val involves a fractional assignment of that read to the isoform based on likelihood given all the other read mappings and proposed expression (FFPM) value.

This is a new addition in recent releases.

best,

~b

Mario N

unread,
Sep 5, 2020, 5:38:59 AM9/5/20
to STAR-Fusion
I seem to catch the idea but i'm still a bit confused.

From the data I have, the junctions reads are always equal to est_J. Does it mean that I have a single breakpoint on these fusions ?
On the other hand, the spanning reads are most of the time different of the est_S (eg :  210 / 196.60 or 216 /16.16).
So in the first case I have a major isoform and in the another one a lot a differents isoforms ?

Thanks again for the help,

Best,

Mario

Brian Haas

unread,
Sep 5, 2020, 9:12:50 AM9/5/20
to Mario N, STAR-Fusion
The spanning reads are reassigned to the isoform that's most likely the correct one, but they're compatible as evidence with the other one.

Think of it as having the 'raw counts' and then the 'estimated counts' as you would in isoform expression quantification data, in case that helps.

Mario N

unread,
Sep 7, 2020, 3:37:00 AM9/7/20
to STAR-Fusion
Oh okay, It's a bit clearer .

Thanks for the enlightenment.

Best,

Mario
Reply all
Reply to author
Forward
0 new messages