Which data I would like to see open?
The EU's Publication Office publishes
datasets
containing information on all research projects funded by the European
Union under the subsequent framework programmes for research and
innovation, currently Horizon 2020. These are extracted from the CORDIS
information system, used by the Commission to administer the programme:
receive applications, validate the eligibility of applicants and so on.
I would like the Commission to complete those datasets with information on the non-funded proposals.
What benefit would access to that information bring?
Here
I would make three arguments. The first argument is one of
market-making. One of the goals of EU-funded research is to foster
collaboration ties between universities and the private sector (with a
particular attention to SMEs). Knowing the full history of submissions
would help newcomers into the framework programme to choose its partners
wisely. If I am an SME looking to participate in a research programme, I
would prefer to partner up with university A, who produces relatively
few proposals but curates them well, and so has a high success rate.
University B produces very many proposals; while it might have the same number of successes as University A, its success rate
is lower. Since I am only going to present the one proposal, which is
useful for my business, I would like to know the success rates of
potential partners. This is not possible without access to the data of
unsuccessful proposals.
The second argument is
one of research. Having this information would allow the research
community to study how its probability of success depends on several
variables. Again, to know this it is necessary to have information on
the non-funded proposals.
The third argument is
one of general transparency. These data, like all administrative data,
should be available by default. It is the Commission who should argue
for them not to be released.
So why are these data not public?
I
asked (informally) the Publication Office, and they came up with a IPR
argument. Unsuccessful proposals are the intellectual property of the
applicants. This, however, could be easily solved by stripping away all
own content from the data. Even for successful proposals, the CORDIS
dataset only publishes the abstracts.
What do I need help with?
The EU has multiple channels to ask for information. This is quite confusing. For these data, I find: