“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015. The conference was organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
Report
Share
Report
Share
1 of 29
More Related Content
Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman implications of open data
1. Open Research Data:
Present and planned EC
Policy
Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D.
Spichtinger, W.Lusoli
Head of Unit
European Commission
DG RTD/A6
Keynote
ORD Conference Open Research Data: Implications for
Science and Society - Warsaw May 2015
2. A new Commission (2014-19)
Andrus Ansip, Vice-President, Digital Single Market
Carlos Moedas, Commissioner for
Research, Science and Innovation
Günther Oettinger, Commissioner
for Digital Economy and Society
3. Commissioner view
"Open Science, of which Open Access is an
important part, will be vital to ensuring
European progress and prosperity in the
future"
(Speech at NETHER, January 26, 2015)
4. Open Research Data
• ORD refers to making research data freely
available for reuse beyond the purpose for which
they were originally collected.
• Making Research data freely available aid further
discovery, make scientific process more cost
efficient and reliable.
• ORD is part of a broader change: data driven
science underpinning Open Science
5. • A systemic change in the modus operandi of
science and research
• Affecting the whole research cycle and its
stakeholders
Open Research Data - Open Science
8. Research and
Innovation
ORD and Data Driven Science: Big
Data in Open Science
• Traditional modus operandi for Science: scientific experiments
and observations generate data to test Scientific Hypotheses.
- many limitations (empirical blackholes)
• New opportunities due to “big data”:
- The digitisation of science (e.g. DNA sequencing)
- The internet of everything
• Data Driven Science is the application of Big Data in Science.
• Enormous opportunities:
- 4th paradigm in science (inductive, computional)
- “here are the data, where is the hypothesis?”
- Potential to reboot completely SSH (“social physics”)
• Serious policy issues
9. Research and
Innovation
Data Driven Science: Big Data in
Science
Most important policy issues for DDS to take off:
• TDM
• Open Access
• Copyright
• Data protection
• Cloud
10. The EC wants to optimise the impact of
public funded research
• At European level (FP7 & Horizon 2020)
• At Member State level
One way to get there: open access
• to peer-reviewed scientific publications
• to research data
Expected benefits:
• Better, more transparant & efficient science Open Science & RRI
• Faster uptake of research leading to faster & better innovation &
economic growth Innovation Union
11. The ORD policies of the European
Commission are threefold.
The EC as Policy maker
• It proposes EU legislation & legislates with other EU
institutions
• It invites Member States to act
The EC as Funding agency
• It sets its own access and dissemination rules for EC-funded
research
The EC Capacity builder
• It funds projects that support EC/EU policy
ORD Policy developed jointly in DG RTD and CNECT, with
input from the R&I family
12. EC as funder - Open access in
Horizon 2020
• Mandatory for publications
• Pilot for data
13. H2020: OA to publications
FP7
• Green open access pilot in 7
areas of FP7 with 'best effort'
stipulation
• Allowed embargoes: 6/12
months
• Gold open access costs
eligible for reimbursement as
part of the project budget
while the project runs
Horizon 2020
• Obligation to provide OA, either
through the Green or Gold way
in all areas
• Allowed embargoes: 6/12 months
• Gold open access costs eligible
for reimbursement as part of the
project budget while the project
runs & post-grant support
being piloted
• Authors encouraged to retain
copyright and grant licences
instead
14. H2020: Pilot on Open Research
Data
• Certain areas
• Voluntary
• Opt out of the pilot
• Opt in for other areas
• DMP provisions
15. H2020: Pilot on Open Research Data
Areas of the 2014-2015 Work Programme participating in the Open
Research Data Pilot are:
• Future and Emerging Technologies
• Research infrastructures – part e-Infrastructures
• Leadership in enabling and industrial technologies – ICT
• Societal Challenge: Secure, Clean and Efficient Energy – part Smart cities
and communities
• Societal Challenge: Climate Action, Environment, Resource Efficiency and
Raw materials – except raw materials
• Societal Challenge: Europe in a changing world – inclusive, innovative and
reflective Societies
• Science with and for Society
Projects in other areas can participate on a voluntary basis (already
new areas to be added for 2016-17 WP
16. Projects may opt out of the Pilot on ORD, if:
• The project will not generate / collect any data
• Conflict with obligation to protect results
• Conflict with confidentiality obligations
• Conflict with security obligations
• Conflict with rules on protection of personal data
• The achievement of the action’s main objective would be
jeopardised by making specific parts of the research data
openly accessible (to be explained in data management
plan)
H2020: Pilot on Open Research Data
17. H2020: Pilot on Open Research Data
Types of data concerned:
• Data needed to validate the results presented in scientific publications
("underlying data")
• Other data as specified in DMP (=up to projects)
Beneficiaries participating in the Pilot will:
• Deposit this data in a research data repository of their choice
• Take measures to make it possible to access, mine, exploit, reproduce
and disseminate free of charge
• Provide information about tools and instruments at the disposal of the
beneficiaries and necessary for validating the results (where possible,
provide the tools and instruments themselves)
18. H2020: Pilot on Open Research Data:
Data Management Plan
• DMPs are NOT part of the proposal evaluation: to be
delivered within the first six months of the project and
updated as needed
• DMP’s mandatory for all projects participating in the Pilot, optional
for others
• DMP questions:
•What data will be collected / generated?
•What standards will be used / how will metadata be generated?
•What data will be exploited? What data will be shared / made
open?
•How will data be curated and preserved?
19. H2020: Pilot on ORD: take-up in first calls
Basis: 3054 Horizon 2020 proposals
Calls in core-areas: opt out 24.2% (442 of
1824 proposals) – range from 9,1-29,1%
Other areas: voluntary opt in 27.2% (334 of
1230 proposals) – range from 9 to 50%
Preliminary but encouraging results
20. EC as policy maker on OA: currently
ongoing…
• Analysis of Uptake of ORD Pilot in signed grant
agreements (versus proposals)
• DMP implementation: investigating best-practice; tools to
be developed (2015)
• Top-notch monitoring of OA policies is crucial for further
policy development
21. EC as capacity builder on OA:
coordination and support actions
(ongoing - FP7 funded)
PASTEUR4OA (Open Access Policy Alignment Strategies for European
Union Research) Started 2014
FOSTER (Foster Open Science Training for European Research)
Started 2014
RECODE - (Policy Recommendations for Open Access to Research
Data in Europe) – 2013, finishing
OpenAIRE/OpenAIRE+: supporting the implementation of Open
Access in Europe (publications and data)
Infrastructure projects(with OA components), e.g. GEO/GEOSS,
ELIXIR…
22. EC as policymaker - Open access
policies across the EU
Preliminary findings from
(i) NPR reporting template of 13 EU MS & 1 Associated Country
(ii) Results of 2014 ERA Progress Report
General findings
1. Mostly soft measures rather than legislation: exceptions exist
2. OA to publications > than OA to data. Progress as to infrastructures for
data (repositories), but openness still quite complex an issue and not
addressed in many countries (for data)
3. Bigger and “richer” countries have more comprehensive OA policies and
OA enabling infrastructures, as well as tend to lead or participate more
actively in OA networking initiatives
4. Nevertheless, smaller or less federated countries have the advantage of
easier coordination and better synergistic capabilities
23. ORD in Poland
• OCEAN – a new national research datacentre with focuses on
Big and Open Data is being built. Its tasks will include large
scale data mining as well as long term preservation of
research results.
• There are about 100 digital libraries which contain scientific
papers and several classical scientific repositories.
• Poland is in the FOSTER (Facilitate Open Science Training for
European Research) project, where one of the areas covered
is to provide training on open access to research data.
24. EC as policymaker – A European Open Science
Cloud?
• Rationale and first ideas
• Work in progress.
• Not to be quoted
25. Science 2.0 consultation (July-Sept 2014 + validation WS)
o ~ 85 % agree to some extent that data infrastructures are a bottleneck
o Spontaneous position papers from research stakeholders
Possible actions
1. Mandate the development of common interfaces and data standards
2. Coordinate at European Level the funding/ maintenance and
interoperability of research infrastructures
3. Support the development of a European Open Science Cloud for
research
European
Open Science Agenda
26. A European Open Science Cloud: part of Europe´s ambition to support
the transition to Open Science and make the most of data-driven
science.
o European scientists strongly stated the need for a research data
that is cost-effective, preserves privacy and is IPR-conscious (Science
consultation).
o The cloud provides all EU researchers a virtual environment with
seamless services for data storage, management, analysis and re-use,
disciplines.
o The cloud will federate existing and emerging horizontal and
infrastructures, effectively bridging todays fragmentation and ad-
o The cloud adds value - scale, data-driven science, inter-disciplinarity,
knowledge to innovation - and leverages current and past
investment (10b per year by MS, two decades EU investment).
European
Open Science Cloud
27. Lifesciences
Lead users… Scientific communities …long tail
Physics
Earthsciences
Economics
Social
sciences
Scaleofscientificactivity(data-drivenscience)
Applied-engineering
……
Humanities Citizen science
European
Open Science Cloud
28. Data
layer
Service
layer
Governanc
e
layer
Lifesciences
Lead users… Scientific communities …long tail
Physics
Earthsciences
Economics
Social
sciences
Scaleofscientificactivity(data-drivenscience)
High performance computing
Data fusion across disciplines
Big data analytics
Privacy and personal data protection
… …
Data discovery and catalogue
Data manipulation and export
Data access and re-use
Trust
Leverage of MS investment
Legacy and sustainability
IPR protection
Federation
Applied-engineering
……
Humanities
Data storage
Citizen science
European
Open Science Cloud
Bottom-up governance
29. In summary
• The EC is a strong funder, policy maker and capacity builder
with regard to ORD in particular and OA in general
• It sees it as part of an irreversible change in the modus
operandi of science: open science
• A lot is ongoing and planned
• Much more needs to be done if we want Europe and its
science stakeholders to capitalise on the opportunities ORD,
OA and OS offer.
• The EC follows a bottom up and stakeholder driven apporach