Hi,
I'm new to Garnett and had a few questions after trying to run it on my own data set which is 33694 genes x 2059 cells.
My marker file which I have formatted to the required Garnett format contains 20 cell definitions (after removing several that were overlapped or not found in db).
When I run train_classifier, half of the cell types are dropped because there are not enough training cells.
1) Is there anything that I should do so that I can retain more of these cell types. The cells in my data set have been manually labelled previously using the cell type definitions so I am wondering why so many were dropped.
2) I read in a previous post that the same input data set used for the training step can be used for the classifying step and I wanted to confirm this is appropriate (normally training and classifying data sets should be separate?).
Thanks for your help,
shui