Dear Tao,
thank you very much. I understand what K-mean means
So, if i understand correct....whatever normalization I will use, if I repeat the analysis I will always have different results?
That means that you have to repeat your analysis many times in order to have a picture which fits to your expectation?
Isn't scarry that you have different pictures each time you do the analysis with the same input?
Isn't a way to define the way the clusters are made? Like I said in the previous email, for example to have at one cluster the peaks which have a specific distance from TSS (for example greater that 1000bp from TSS) and another cluster to have peaks with distance less than 1000bp from TSS.
So if i want to see the distribution of my peaks relevant to TSS, is it logical to repeat the clustering method many times until I have the best picture? How can I define what number is the best to choose for the number of clusters?
Thank you in advance
I will wait for your answer as soon as possible as I really have to make some pictures with seqminner
Best regards
Petros