Hi Nora,
I will follow up your reply in this thread here, so that it might benefit others who might have the same questions.
As I mentioned in my earlier reply, in GENE2FUNC, it obtains the overlap between the genes in the gene set and the background genes. The file for the background gene is currently not shared on the Download page on FUMA but if you need it, please send an email and I can send it to you.
in R:
library(data.table)
bkgenes = fread("ENSG.genes.txt")
chromatin_gs = fread("GOBP_PROTEIN_LOCALIZATION_TO_CENP_A_CONTAINING_CHROMATIN.txt")
> chromatin_gs
entrezID
<int>
1: 1058
2: 121504
3: 3012
4: 554313
5: 8294
6: 8335
7: 8359
8: 8360
9: 8361
10: 8362
11: 8363
12: 8364
13: 8365
14: 8366
15: 8367
16: 8368
17: 8370
18: 8970
bkgenes_entrezID = unique(bkgenes$entrezID) #since you specify all, no filtering was done
length(bkgenes_entrezID)
[1] 24304
chromatin_gs_entrezID = unique(chromatin_gs$entrezID)
length(chromatin_gs_entrezID)
[1] 18
common = intersect(bkgenes_entrezID, chromatin_gs_entrezID)
common
[1] 554313 1058 8335 8970
Thus, there are 4 genes that are common between the genes in the gene set and the background genes.
Best,
Tanya