Accession Numbers bulk download change

40 views
Skip to first unread message

Francesco Talo

unread,
Mar 31, 2020, 10:11:42 AM3/31/20
to Europe PMC Developer Forum
Dear users,

We want to announce that tomorrow there will be a change in the format of the Accession Numbers export (ftp://ftp.ebi.ac.uk/pub/databases/pmc/TextMinedTerms/).

Currently the files contain accession numbers identified through textmining in fulltext articles only.
Each line inside the files contain three information: accession number, PMCID and PMID

ArrayExpress,PMCID,PMID
E-AFMX-10,PMC1523219,16608515


Since tomorrow we are going to dump accssion numbers identified through textmining both in abstract and fulltext articles.
It means that now each line of the files will contain four informations: accession number, PMCID, SRC and EXT_ID

ArrayExpress,PMCID,SRC,EXT_ID
E-AFMX-10,PMC1523219,16608515,MED

Where the article does not have a correspondent fulltext version, the value PMCID will be empty.
It is necessary to show both SRC and EXT_ID to identify the article because accessions can be found also in abstracts coming from other sources rather than Pubmed (i.e. Patents, Guidelines, Thesis, etc...)

The dataset will keep containing one file for each database. Be aware that there will be also a slight change in the names of the files.

We will update you tomorrow when the change will be actually completed.

Please let us know if you have any doubt/queries about this.

Best Regards,
Europe PMC team

Francesco Talo

unread,
Apr 2, 2020, 4:57:19 AM4/2/20
to Europe PMC Developer Forum
Dear users,

We want to confirm that the change has been performed.
It is possible to find the new accession numbers files at ftp://ftp.ebi.ac.uk/pub/databases/pmc/TextMinedTerms/

Best Regards
Europe PMC team
Reply all
Reply to author
Forward
0 new messages