Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
Open Research Data:
Present and planned EC
Policy
Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D.
Spichtinger, W.Lusoli
Head of Unit
European Commission
DG RTD/A6
Keynote
ORD Conference Open Research Data: Implications for
Science and Society - Warsaw May 2015
A new Commission (2014-19)
Andrus Ansip, Vice-President, Digital Single Market
Carlos Moedas, Commissioner for
Research, Science and Innovation
Günther Oettinger, Commissioner
for Digital Economy and Society
Commissioner view
"Open Science, of which Open Access is an
important part, will be vital to ensuring
European progress and prosperity in the
future"
(Speech at NETHER, January 26, 2015)
Open Research Data
• ORD refers to making research data freely
available for reuse beyond the purpose for which
they were originally collected.
• Making Research data freely available aid further
discovery, make scientific process more cost
efficient and reliable.
• ORD is part of a broader change: data driven
science underpinning Open Science
• A systemic change in the modus operandi of
science and research
• Affecting the whole research cycle and its
stakeholders
Open Research Data - Open Science
Analysis
Publication
ReviewConceptualisation
Data
gathering
Open
access
Scientific
blogs Collaborative
bibliographies
Alternative
Reputation
systems
Citizens
science
Open
code
Open
workflows
Open
annotation
Open
data
Pre-
print
Data-
intensive
6
Sci-
starter.com
Runmycode
.org
ArXiv
Roar.eprints.
org
Impact Story
Altmetric.com
Mendeley.com
Academia.edu
Researchgate.com
Openannotation.org
Datadryad.org
Myexperiment.org
Figshare.com
An emerging
ecosystem of
services and
standards
It's real!
Research and
Innovation
O
P
E
N
S
C
I
E
N
C
E
OA Research data
Data-intensive
Science
Data-intensive Science
OA to
research
publications
TDM
O
P
E
N
S
C
I
E
N
C
E
BIG DATA
OA to research data
ORD “small” part
of Open Data
Research and
Innovation
ORD and Data Driven Science: Big
Data in Open Science
• Traditional modus operandi for Science: scientific experiments
and observations generate data to test Scientific Hypotheses.
- many limitations (empirical blackholes)
• New opportunities due to “big data”:
- The digitisation of science (e.g. DNA sequencing)
- The internet of everything
• Data Driven Science is the application of Big Data in Science.
• Enormous opportunities:
- 4th paradigm in science (inductive, computional)
- “here are the data, where is the hypothesis?”
- Potential to reboot completely SSH (“social physics”)
• Serious policy issues
Research and
Innovation
Data Driven Science: Big Data in
Science
Most important policy issues for DDS to take off:
• TDM
• Open Access
• Copyright
• Data protection
• Cloud
The EC wants to optimise the impact of
public funded research
• At European level (FP7 & Horizon 2020)
• At Member State level
One way to get there: open access
• to peer-reviewed scientific publications
• to research data
Expected benefits:
• Better, more transparant & efficient science  Open Science & RRI
• Faster uptake of research leading to faster & better innovation &
economic growth  Innovation Union
The ORD policies of the European
Commission are threefold.
The EC as Policy maker
• It proposes EU legislation & legislates with other EU
institutions
• It invites Member States to act
The EC as Funding agency
• It sets its own access and dissemination rules for EC-funded
research
The EC Capacity builder
• It funds projects that support EC/EU policy
ORD Policy developed jointly in DG RTD and CNECT, with
input from the R&I family
EC as funder - Open access in
Horizon 2020
• Mandatory for publications
• Pilot for data
H2020: OA to publications
FP7
• Green open access pilot in 7
areas of FP7 with 'best effort'
stipulation
• Allowed embargoes: 6/12
months
• Gold open access costs
eligible for reimbursement as
part of the project budget
while the project runs
Horizon 2020
• Obligation to provide OA, either
through the Green or Gold way
in all areas
• Allowed embargoes: 6/12 months
• Gold open access costs eligible
for reimbursement as part of the
project budget while the project
runs & post-grant support
being piloted
• Authors encouraged to retain
copyright and grant licences
instead
H2020: Pilot on Open Research
Data
• Certain areas
• Voluntary
• Opt out of the pilot
• Opt in for other areas
• DMP provisions
H2020: Pilot on Open Research Data
Areas of the 2014-2015 Work Programme participating in the Open
Research Data Pilot are:
• Future and Emerging Technologies
• Research infrastructures – part e-Infrastructures
• Leadership in enabling and industrial technologies – ICT
• Societal Challenge: Secure, Clean and Efficient Energy – part Smart cities
and communities
• Societal Challenge: Climate Action, Environment, Resource Efficiency and
Raw materials – except raw materials
• Societal Challenge: Europe in a changing world – inclusive, innovative and
reflective Societies
• Science with and for Society
Projects in other areas can participate on a voluntary basis (already
new areas to be added for 2016-17 WP
Projects may opt out of the Pilot on ORD, if:
• The project will not generate / collect any data
• Conflict with obligation to protect results
• Conflict with confidentiality obligations
• Conflict with security obligations
• Conflict with rules on protection of personal data
• The achievement of the action’s main objective would be
jeopardised by making specific parts of the research data
openly accessible (to be explained in data management
plan)
H2020: Pilot on Open Research Data
H2020: Pilot on Open Research Data
Types of data concerned:
• Data needed to validate the results presented in scientific publications
("underlying data")
• Other data as specified in DMP (=up to projects)
Beneficiaries participating in the Pilot will:
• Deposit this data in a research data repository of their choice
• Take measures to make it possible to access, mine, exploit, reproduce
and disseminate free of charge
• Provide information about tools and instruments at the disposal of the
beneficiaries and necessary for validating the results (where possible,
provide the tools and instruments themselves)
H2020: Pilot on Open Research Data:
Data Management Plan
• DMPs are NOT part of the proposal evaluation: to be
delivered within the first six months of the project and
updated as needed
• DMP’s mandatory for all projects participating in the Pilot, optional
for others
• DMP questions:
•What data will be collected / generated?
•What standards will be used / how will metadata be generated?
•What data will be exploited? What data will be shared / made
open?
•How will data be curated and preserved?
H2020: Pilot on ORD: take-up in first calls
Basis: 3054 Horizon 2020 proposals
Calls in core-areas: opt out 24.2% (442 of
1824 proposals) – range from 9,1-29,1%
Other areas: voluntary opt in 27.2% (334 of
1230 proposals) – range from 9 to 50%
 Preliminary but encouraging results
EC as policy maker on OA: currently
ongoing…
• Analysis of Uptake of ORD Pilot in signed grant
agreements (versus proposals)
• DMP implementation: investigating best-practice; tools to
be developed (2015)
• Top-notch monitoring of OA policies is crucial for further
policy development
EC as capacity builder on OA:
coordination and support actions
(ongoing - FP7 funded)
PASTEUR4OA (Open Access Policy Alignment Strategies for European
Union Research) Started 2014
FOSTER (Foster Open Science Training for European Research)
Started 2014
RECODE - (Policy Recommendations for Open Access to Research
Data in Europe) – 2013, finishing
OpenAIRE/OpenAIRE+: supporting the implementation of Open
Access in Europe (publications and data)
Infrastructure projects(with OA components), e.g. GEO/GEOSS,
ELIXIR…
EC as policymaker - Open access
policies across the EU
Preliminary findings from
(i) NPR reporting template of 13 EU MS & 1 Associated Country
(ii) Results of 2014 ERA Progress Report
General findings
1. Mostly soft measures rather than legislation: exceptions exist
2. OA to publications > than OA to data. Progress as to infrastructures for
data (repositories), but openness still quite complex an issue and not
addressed in many countries (for data)
3. Bigger and “richer” countries have more comprehensive OA policies and
OA enabling infrastructures, as well as tend to lead or participate more
actively in OA networking initiatives
4. Nevertheless, smaller or less federated countries have the advantage of
easier coordination and better synergistic capabilities
ORD in Poland
• OCEAN – a new national research datacentre with focuses on
Big and Open Data is being built. Its tasks will include large
scale data mining as well as long term preservation of
research results.
• There are about 100 digital libraries which contain scientific
papers and several classical scientific repositories.
• Poland is in the FOSTER (Facilitate Open Science Training for
European Research) project, where one of the areas covered
is to provide training on open access to research data.
EC as policymaker – A European Open Science
Cloud?
• Rationale and first ideas
• Work in progress.
• Not to be quoted
Science 2.0 consultation (July-Sept 2014 + validation WS)
o ~ 85 % agree to some extent that data infrastructures are a bottleneck
o Spontaneous position papers from research stakeholders
Possible actions
1. Mandate the development of common interfaces and data standards
2. Coordinate at European Level the funding/ maintenance and
interoperability of research infrastructures
3. Support the development of a European Open Science Cloud for
research
European
Open Science Agenda
A European Open Science Cloud: part of Europe´s ambition to support
the transition to Open Science and make the most of data-driven
science.
o European scientists strongly stated the need for a research data
that is cost-effective, preserves privacy and is IPR-conscious (Science
consultation).
o The cloud provides all EU researchers a virtual environment with
seamless services for data storage, management, analysis and re-use,
disciplines.
o The cloud will federate existing and emerging horizontal and
infrastructures, effectively bridging todays fragmentation and ad-
o The cloud adds value - scale, data-driven science, inter-disciplinarity,
knowledge to innovation - and leverages current and past
investment (10b per year by MS, two decades EU investment).
European
Open Science Cloud
Lifesciences
Lead users… Scientific communities …long tail
Physics
Earthsciences
Economics
Social
sciences
Scaleofscientificactivity(data-drivenscience)
Applied-engineering
……
Humanities Citizen science
European
Open Science Cloud
Data
layer
Service
layer
Governanc
e
layer
Lifesciences
Lead users… Scientific communities …long tail
Physics
Earthsciences
Economics
Social
sciences
Scaleofscientificactivity(data-drivenscience)
High performance computing
Data fusion across disciplines
Big data analytics
Privacy and personal data protection
… …
Data discovery and catalogue
Data manipulation and export
Data access and re-use
Trust
Leverage of MS investment
Legacy and sustainability
IPR protection
Federation
Applied-engineering
……
Humanities
Data storage
Citizen science
European
Open Science Cloud
Bottom-up governance
In summary
• The EC is a strong funder, policy maker and capacity builder
with regard to ORD in particular and OA in general
• It sees it as part of an irreversible change in the modus
operandi of science: open science
• A lot is ongoing and planned
• Much more needs to be done if we want Europe and its
science stakeholders to capitalise on the opportunities ORD,
OA and OS offer.
• The EC follows a bottom up and stakeholder driven apporach

More Related Content

Open Research Data: Present and planned EC Policy, Jean-Claude Burgelman implications of open data

  • 1. Open Research Data: Present and planned EC Policy Jean-Claude Burgelman, S. Luber, R. Von Schomberg, D. Spichtinger, W.Lusoli Head of Unit European Commission DG RTD/A6 Keynote ORD Conference Open Research Data: Implications for Science and Society - Warsaw May 2015
  • 2. A new Commission (2014-19) Andrus Ansip, Vice-President, Digital Single Market Carlos Moedas, Commissioner for Research, Science and Innovation Günther Oettinger, Commissioner for Digital Economy and Society
  • 3. Commissioner view "Open Science, of which Open Access is an important part, will be vital to ensuring European progress and prosperity in the future" (Speech at NETHER, January 26, 2015)
  • 4. Open Research Data • ORD refers to making research data freely available for reuse beyond the purpose for which they were originally collected. • Making Research data freely available aid further discovery, make scientific process more cost efficient and reliable. • ORD is part of a broader change: data driven science underpinning Open Science
  • 5. • A systemic change in the modus operandi of science and research • Affecting the whole research cycle and its stakeholders Open Research Data - Open Science
  • 7. Research and Innovation O P E N S C I E N C E OA Research data Data-intensive Science Data-intensive Science OA to research publications TDM O P E N S C I E N C E BIG DATA OA to research data ORD “small” part of Open Data
  • 8. Research and Innovation ORD and Data Driven Science: Big Data in Open Science • Traditional modus operandi for Science: scientific experiments and observations generate data to test Scientific Hypotheses. - many limitations (empirical blackholes) • New opportunities due to “big data”: - The digitisation of science (e.g. DNA sequencing) - The internet of everything • Data Driven Science is the application of Big Data in Science. • Enormous opportunities: - 4th paradigm in science (inductive, computional) - “here are the data, where is the hypothesis?” - Potential to reboot completely SSH (“social physics”) • Serious policy issues
  • 9. Research and Innovation Data Driven Science: Big Data in Science Most important policy issues for DDS to take off: • TDM • Open Access • Copyright • Data protection • Cloud
  • 10. The EC wants to optimise the impact of public funded research • At European level (FP7 & Horizon 2020) • At Member State level One way to get there: open access • to peer-reviewed scientific publications • to research data Expected benefits: • Better, more transparant & efficient science  Open Science & RRI • Faster uptake of research leading to faster & better innovation & economic growth  Innovation Union
  • 11. The ORD policies of the European Commission are threefold. The EC as Policy maker • It proposes EU legislation & legislates with other EU institutions • It invites Member States to act The EC as Funding agency • It sets its own access and dissemination rules for EC-funded research The EC Capacity builder • It funds projects that support EC/EU policy ORD Policy developed jointly in DG RTD and CNECT, with input from the R&I family
  • 12. EC as funder - Open access in Horizon 2020 • Mandatory for publications • Pilot for data
  • 13. H2020: OA to publications FP7 • Green open access pilot in 7 areas of FP7 with 'best effort' stipulation • Allowed embargoes: 6/12 months • Gold open access costs eligible for reimbursement as part of the project budget while the project runs Horizon 2020 • Obligation to provide OA, either through the Green or Gold way in all areas • Allowed embargoes: 6/12 months • Gold open access costs eligible for reimbursement as part of the project budget while the project runs & post-grant support being piloted • Authors encouraged to retain copyright and grant licences instead
  • 14. H2020: Pilot on Open Research Data • Certain areas • Voluntary • Opt out of the pilot • Opt in for other areas • DMP provisions
  • 15. H2020: Pilot on Open Research Data Areas of the 2014-2015 Work Programme participating in the Open Research Data Pilot are: • Future and Emerging Technologies • Research infrastructures – part e-Infrastructures • Leadership in enabling and industrial technologies – ICT • Societal Challenge: Secure, Clean and Efficient Energy – part Smart cities and communities • Societal Challenge: Climate Action, Environment, Resource Efficiency and Raw materials – except raw materials • Societal Challenge: Europe in a changing world – inclusive, innovative and reflective Societies • Science with and for Society Projects in other areas can participate on a voluntary basis (already new areas to be added for 2016-17 WP
  • 16. Projects may opt out of the Pilot on ORD, if: • The project will not generate / collect any data • Conflict with obligation to protect results • Conflict with confidentiality obligations • Conflict with security obligations • Conflict with rules on protection of personal data • The achievement of the action’s main objective would be jeopardised by making specific parts of the research data openly accessible (to be explained in data management plan) H2020: Pilot on Open Research Data
  • 17. H2020: Pilot on Open Research Data Types of data concerned: • Data needed to validate the results presented in scientific publications ("underlying data") • Other data as specified in DMP (=up to projects) Beneficiaries participating in the Pilot will: • Deposit this data in a research data repository of their choice • Take measures to make it possible to access, mine, exploit, reproduce and disseminate free of charge • Provide information about tools and instruments at the disposal of the beneficiaries and necessary for validating the results (where possible, provide the tools and instruments themselves)
  • 18. H2020: Pilot on Open Research Data: Data Management Plan • DMPs are NOT part of the proposal evaluation: to be delivered within the first six months of the project and updated as needed • DMP’s mandatory for all projects participating in the Pilot, optional for others • DMP questions: •What data will be collected / generated? •What standards will be used / how will metadata be generated? •What data will be exploited? What data will be shared / made open? •How will data be curated and preserved?
  • 19. H2020: Pilot on ORD: take-up in first calls Basis: 3054 Horizon 2020 proposals Calls in core-areas: opt out 24.2% (442 of 1824 proposals) – range from 9,1-29,1% Other areas: voluntary opt in 27.2% (334 of 1230 proposals) – range from 9 to 50%  Preliminary but encouraging results
  • 20. EC as policy maker on OA: currently ongoing… • Analysis of Uptake of ORD Pilot in signed grant agreements (versus proposals) • DMP implementation: investigating best-practice; tools to be developed (2015) • Top-notch monitoring of OA policies is crucial for further policy development
  • 21. EC as capacity builder on OA: coordination and support actions (ongoing - FP7 funded) PASTEUR4OA (Open Access Policy Alignment Strategies for European Union Research) Started 2014 FOSTER (Foster Open Science Training for European Research) Started 2014 RECODE - (Policy Recommendations for Open Access to Research Data in Europe) – 2013, finishing OpenAIRE/OpenAIRE+: supporting the implementation of Open Access in Europe (publications and data) Infrastructure projects(with OA components), e.g. GEO/GEOSS, ELIXIR…
  • 22. EC as policymaker - Open access policies across the EU Preliminary findings from (i) NPR reporting template of 13 EU MS & 1 Associated Country (ii) Results of 2014 ERA Progress Report General findings 1. Mostly soft measures rather than legislation: exceptions exist 2. OA to publications > than OA to data. Progress as to infrastructures for data (repositories), but openness still quite complex an issue and not addressed in many countries (for data) 3. Bigger and “richer” countries have more comprehensive OA policies and OA enabling infrastructures, as well as tend to lead or participate more actively in OA networking initiatives 4. Nevertheless, smaller or less federated countries have the advantage of easier coordination and better synergistic capabilities
  • 23. ORD in Poland • OCEAN – a new national research datacentre with focuses on Big and Open Data is being built. Its tasks will include large scale data mining as well as long term preservation of research results. • There are about 100 digital libraries which contain scientific papers and several classical scientific repositories. • Poland is in the FOSTER (Facilitate Open Science Training for European Research) project, where one of the areas covered is to provide training on open access to research data.
  • 24. EC as policymaker – A European Open Science Cloud? • Rationale and first ideas • Work in progress. • Not to be quoted
  • 25. Science 2.0 consultation (July-Sept 2014 + validation WS) o ~ 85 % agree to some extent that data infrastructures are a bottleneck o Spontaneous position papers from research stakeholders Possible actions 1. Mandate the development of common interfaces and data standards 2. Coordinate at European Level the funding/ maintenance and interoperability of research infrastructures 3. Support the development of a European Open Science Cloud for research European Open Science Agenda
  • 26. A European Open Science Cloud: part of Europe´s ambition to support the transition to Open Science and make the most of data-driven science. o European scientists strongly stated the need for a research data that is cost-effective, preserves privacy and is IPR-conscious (Science consultation). o The cloud provides all EU researchers a virtual environment with seamless services for data storage, management, analysis and re-use, disciplines. o The cloud will federate existing and emerging horizontal and infrastructures, effectively bridging todays fragmentation and ad- o The cloud adds value - scale, data-driven science, inter-disciplinarity, knowledge to innovation - and leverages current and past investment (10b per year by MS, two decades EU investment). European Open Science Cloud
  • 27. Lifesciences Lead users… Scientific communities …long tail Physics Earthsciences Economics Social sciences Scaleofscientificactivity(data-drivenscience) Applied-engineering …… Humanities Citizen science European Open Science Cloud
  • 28. Data layer Service layer Governanc e layer Lifesciences Lead users… Scientific communities …long tail Physics Earthsciences Economics Social sciences Scaleofscientificactivity(data-drivenscience) High performance computing Data fusion across disciplines Big data analytics Privacy and personal data protection … … Data discovery and catalogue Data manipulation and export Data access and re-use Trust Leverage of MS investment Legacy and sustainability IPR protection Federation Applied-engineering …… Humanities Data storage Citizen science European Open Science Cloud Bottom-up governance
  • 29. In summary • The EC is a strong funder, policy maker and capacity builder with regard to ORD in particular and OA in general • It sees it as part of an irreversible change in the modus operandi of science: open science • A lot is ongoing and planned • Much more needs to be done if we want Europe and its science stakeholders to capitalise on the opportunities ORD, OA and OS offer. • The EC follows a bottom up and stakeholder driven apporach

Editor's Notes

  1. See table with more detailed info