3. all "Unknown"-typed data nodes with a specific data source for all except from the Reactome collection
The second set it related to the Reactome Convertor and how some things are converted into GPML. However, I did note too there are unknown-typed datanotes with ChEBI identifiers. Something that may be worthwhile checking out.
The first and third sets are starting points for curation. In the third set, I limit the output to these sources: Wikidata, ChEBI, Uniprot-TrEMBL, and Ensembl. It has been suggested that some annotation we can do in an automated way, which may be feasible for nodes with the latter two data sources. For Wikidata and ChEBI it is less straightforward, and I would recommend manual curation for these.
I would suggest people to record DataNodes of a type that we currently do not have. Are there some types used relatively frequently but for which we currently do not have a type (current types: Metabolite, Protein, GeneProduct, Rna, Complex). We already identified "Dna" as missing, but there may be others.
Grtz,
Egon