Hi all,
In ClueGO " the size of the nodes is mentioned to correspond to the statistical significance of the terms associated with it.
I wish to clarify few things are how the size of the node is derived.
1) The type of statistics used to derive the size of the nodes.
2) is it right to say the terms in the nodes are overrepresented or enriched for a specific background e.g., for all the human genes.
2) Does the reference set options (1) selected ontologies reference set (2) predefined reference set, (3) Custom reference set, influence the size of the node.
3) in the reference set options if I select the option "selected ontologies reference set" please clarify what will be the ontologies that ClueGO will use as the reference set. Does it correspond to all the genes in the human genome?
4) Does the software allow for no background gene list. If there is an option to select this how could this be fed into the software to perform the analysis? In this case, again does the software take all human genes as the background for calculations.
Thank you,
Nara