Hi Lu,I have a few more questions about how bartender clusters across time points --- in particular, the frequencies of my clustered barcodes are fluctuating a lot between time points, including some barcodes that appear only at a single intermediate time point but are absent at all other time points. I'm wondering if it's some artifact of the clustering.
First, when I use bartender_combiner_com, should I list input files forward in time or backward in time? I was a bit confused about whether it should be forward or backward, since the documentation on github says it should be forward, but the algorithm seems to process them in reverse order, in which case it is considering the least-diverse set of barcodes (the last time point) first. Shouldn't the algorithm consider the first time point first, and then add unmatched clusters from t to the list at t + 1 (with zero reads at that time point)?
Related, I'm wondering if it would be better to simply pool the reads from all time points together, clustering them, and then determining how many reads from each cluster came from each time point, rather than clustering at each time point separately and then matching.
Thank you very much for all your help!Michael
--
You received this message because you are subscribed to the Google Groups "Bartender" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bartenderRandomBarcode+unsubscri...@googlegroups.com.
To post to this group, send email to bartenderRandomBarcode@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bartenderRandomBarcode/0675e068-959b-4621-9286-c51be25270bf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Thanks, Lu. Using the UMI feature to map pooled reads back to their original time points is exactly what I thinking, but I am a bit confused how to do it. For example, let Pooled_barcode.txt be my master set of barcodes pooled across time points:
AAAAAAA,t0_1
AAACCCC,t0_2
AAAAAAA,t1_1
AAACCCC,t1_2
The UMIs at the end of each line indicate both the time point (t0 or t1) and a unique ID number within that time point (1 or 2). I then cluster this set of barcodes together. The output file Pooled_barcode.csv contains information on how each unique read mapped to a cluster:
Unique.reads,Frequency,Cluster.ID
AAACCCC,2,1
AAAAAAA,2,2
The problem is that while I know that two pooled reads mapped to cluster 1, I don't know which reads these were. So I suppose I can go back to the original list of pooled barcodes (Pooled_barcode.txt) and go through each barcode, one by one, look it up in the list of unique reads and mapped clusters (Pooled_barcode.csv), and determine the frequency of clusters in each time point. This last part --- looking up each original barcode one-by-one in the list of clusters --- seems rather time-consuming, but is there any other way to do it?
Many thanks,
Michael
To post to this group, send email to bartenderRa...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bartenderRandomBarcode/0675e068-959b-4621-9286-c51be25270bf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
----Sincerely,Lu
You received this message because you are subscribed to the Google Groups "Bartender" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bartenderRandomBarcode+unsub...@googlegroups.com.
To post to this group, send email to bartenderRandomBarcode@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bartenderRandomBarcode/ed1abf46-0098-464f-b5cf-44b20b4d1b58%40googlegroups.com.