Hello,
I've been playing around with Garnett with mixed success to identify cells in a dataset. I'm left with a few questions as to what the best practices are.
Most marker files I've seen only use a very small number of genes for 'expressed'. However, I'm trying to differentiate two similar subtypes of cell from a greater population, and using the bare minimum of markers hasn't proven very usefull for telling the two apart. This has lead me to add more and more marker genes to try and tell them apart. Is it counterproductive to use so many genes, or do more genes help build a more accurate picture?
Also with subtypes, when I set a cell as a subtype of another cell, it adds all the subtype markers to the main type markers in check_markers. Does this mean that it is using the subtype markers to search for the maintype?
Does Garnett use the main type markers to search for the subtype? For example, if I specify that atrial is a subtype of CM, does it determine which cells are CMs first then decide which ones are atrial? Does it search for atrial as the list of markers for CM + the markers for atrial?
How do I add a key to plot_cells? I have a lot of colors on this plot, but I don't understand what colors are which cell type. Also, can I change the assigned colors? One cell group in particular has been assigned a shade of pinkish purple that's too close to Unknown's purplish pink.
Thanks,
Robert Alpin