OpenAIRE Graph: steadily riding the wild wave of Open Science

“Having the vision to create and deliver an open, up-to-date, and global “map of science” across disciplines and countries, from December 2019 onwards, we started providing the OpenAIRE Graph (until recently called the OpenAIRE Research Graph), one of the largest and most heterogeneous collections of scholarly metadata for research products (e.g., articles, datasets, software), other research entities (e.g., projects, institutions, communities), and the links between them. This initiative culminates over ten years of work by OpenAIRE in the domain of scholarly communication to facilitate and advocate for the free flow and sharing of research products and related metadata across researchers, communities, institutions, companies, and policymakers. As a result of this community-driven and technological effort, today, our Graph aggregates and interlinks hundreds of millions of metadata records from tens of thousands of data sources trusted by scientists.

The Graph is being updated bi-weekly and its contents are available for download and re-use as CC-BY via an API, while an open snapshot is released every six months on In addition, the principles, data, and vision of the Graph are community-governed: OpenAIRE AMKE that implements and delivers the Graph, is a non-profit legal entity connecting 49 members that represent research and academic organisations who are committed to Open Science and steer activities in their countries (read our Strategy 2023-2025). OpenAIRE AMKE’s participatory governance structure ensures the Graph’s endorsement, adoption, operation, and sustainability among its members, countries, and research communities. Finally, the underlying infrastructure has recently adhered to the POSI principles.  

The Graph APIs count today 500Mi+ accesses per year via OpenAIRE portals and as third-party services requests. Elsevier’s Scopus and SciVal rely on the APIs, as well as European and worldwide institutional repositories, European Commission (EC’s Participants Portal SYGMA), ORCID, other funders around the globe, researchers, companies, and scholarly services. Furthermore, the Graph will be a key EOSC resource by providing the EOSC with: (i) a catalogue of all research products, core in fostering Open Science and establishing its practices in the daily research activities, and (ii) Open Science monitoring tools, to measure trends and impact of Open Science and funding across communities and Nations. Conceived as a public and transparent good, populated out of data sources trusted by scientists, the Graph aims at bringing discovery, monitoring, and assessment of science back into the hands of the scientific community….”

Scholarly Infrastructure: a Latin American perspective

“With the digitization of scholarship and the rise of the open research movement, new models and outputs of science communication have emerged  beyond the journal article. Scholarly communications is shifting towards the “record of versions”, rather than just a One True “version of record”, where persistent identifiers and their metadata enable recognition, linking and discoverability of a wide range of outputs regardless of where those are housed. It is worth noting the importance of infrastructure in connecting all outputs and resources throughout the research lifecycle (such as research data, software, samples, etc.) to better understand and evaluate the contributions to research, and support their recognition. Many of the organizations providing this kind of foundational infrastructure have been established as non profit community governed and sustained initiatives (Crossref, DataCite, ROR), and are committed to the Principles of Open Scholarly Infrastructure. …

Opposite to what happens in the Global North, in Latin America open access is not an “alternative” model but has long been mainstream. We have a long-term history of Open Access without APCs and are publicly funded. There are few reasons for this, one of which being the high cost of journal subscriptions in the 90s, which worked as a big motivator for the creation of free to access/publish electronic journals.

Despite this Latin American open access tradition, in countries like Colombia, APC payments are increasing. Many voices in the community argue that transformative agreements might threaten the current local ecosystem as the more funds that are allocated for APCs diminish the investment in shared infrastructure and tools.

When it comes to infrastructure, just being open might not be enough; operating infrastructure is not simple and requires investment, capacity building, maintenance, and dedicated staff committed that can ensure accessible, inclusive, and responsive tools. Resilience and sustainability are very sizable challenges that need to be addressed via governance….

A local solution in Latin America for this has been to have independent and self-sustainable organizations that not only publish research results but also fill a role in offering infrastructure and innovation options, training to improve research publishing and dissemination practices, allowing local communities to operate according to the state of the art in scholarly communications, such as SciELO (established in 1998 and celebrating its 25th anniversary with an international conference this year), Redalyc (2002), La Referencia (2010)….


In the end, open goes beyond access and  it’s indispensable for our community to question and rethink the ownership and diversity of research infrastructure. There is an urgent need to reclaim scholarly infrastructure if we want to pursue the benefit of the majority instead of the profit of few. There are many ways to play a more proactive role in steering research infrastructure: (choosing and) using open community-led infrastructure and services, through institutional membership, sharing use cases and feedback for improvement, participating in governance and working groups and more.”

POSI business plan by ChatGPT | 2022-01-04

“Prompt: Write a new business plan for Elsevier that incorporates the Principles for Open Scholarly Infrastructure (POSI).

ChatGPT response: Executive Summary: Elsevier is a global publisher of scientific, technical, and medical (STM) research, with a long history of publishing high-quality research and a strong reputation in the scholarly community. However, in recent years, there has been growing demand for open access to research and a shift towards more open and collaborative approaches to scholarly communication. In response to this shift, Elsevier is introducing a new business plan that incorporates the Principles for Open Scholarly Infrastructure (POSI).

Key Elements of the New Business Plan: Increased support for open access: Elsevier will increase its support for open access publishing, including the establishment of a new open access journal platform and the expansion of its open access book program…….”

DOAJ commits to the Principles of Open Scholarly Infrastructure (POSI) | DOAJ News Service

This is a post by our Managing Director, Joanna Ball.

DOAJ is proud to be a community-driven infrastructure. It is one of the most well-established open infrastructures, having grown and developed considerably since its founding by Lars Bjørnshauge in 2003. Its criteria have been adopted as an unofficial gold standard in Open Access journal publishing across the community. 

I am pleased to share the news that we are committing to the Principles of Open Scholarly Infrastructure (POSI). The principles align with our core values, mission and identity as an open, global and trusted infrastructure.



Better Together: Open new possibilities with Open Infrastructure

“Crossref, DataCite and ORCID work together to provide foundational open infrastructure that is integral to the global research ecosystem. We offer unique, persistent identifiers (PIDs) — Crossref and DataCite DOIs for research outputs and ORCID iDs for people — alongside collecting comprehensive, open metadata that is non-proprietary, accessible, interoperable, and available across borders, disciplines, and time.

As sustainable community-driven scholarly infrastructure providers ORCID, Crossref and Datacite, guarantee data provenance and machine-readability. Persistent identifiers combined with open, standardized, and machine readable metadata enable reliable and robust connections to be made between research outputs, organizations, individuals and much more, as well as being beneficial to others who build services and tools on top of the open infrastructure we provide making content more discoverable.

Join us for a webinar on the 27th June at 7am UTC/ 9am CEST / 5pm AEST where we will discuss:
– Who we are
– What we mean by Open Scholarly Infrastructure
– How our organizations work together for the benefit of the scholarly community
– How the Principles of Open Scholarly Infrastructure (POSI) help to build trust and accountability as well as ensuring we are around for the long term….”

CORE: Our commitment to The Principles of Open Scholarly Infrastructure – CORE

The Principles of Open Scholarly Infrastructure (POSI) offer a set of guidelines by which open scholarly infrastructure organisations and initiatives that support the research community can be operated and sustained. In this post, we demonstrate CORE’s commitment to adhere to these principles and show our current progress in achieving these aims. The principles are divided into three main categories; Governance, Sustainability and Insurance:

POSI fan tutte | Crossref

by Geoffrey Bilder

Just over a year ago, Crossref announced that our board had adopted the Principles of Open Scholarly Infrastructure (POSI).

It was a well-timed announcement, as 2021 yet again showed just how dangerous it is for us to assume that the infrastructure systems we depend on for scholarly research will not disappear altogether or adopt a radically different focus. We adopted POSI to ensure that Crossref would not meet the same fate.


Blog – Europe PMC: Europe PMC adopts the Principles of Open Scholarly Infrastructure

“As a long-standing service and infrastructure provider in the open science ecosystem, Europe PMC supports the Principles of Open Scholarly Infrastructure (POSI). We welcome the momentum gathering behind this initiative to promote the need to support and sustain the open infrastructure.

Europe PMC has been a part of the public and open infrastructure for over 15 years and is run and managed by EMBL-EBI (which is part of the pan-European organisation of EMBL). It is funded by 34 international funders and is community-driven, open infrastructure, set in the context of key global open data resources such as the European Nucleotide Archive (INSDC), the wwPDB and the European Genome-Phenome Archive. All of these resources exist for the public good, led by scientific need and international collaborations, and have open governance structures and a commitment to long-term sustainability. Together with PMC USA, Europe PMC is a part of the PubMed Central International archive network, which plays an integral part in fulfilling shared goals to enable international open science. Europe PMC has been selected as an ELIXIR Core Data Resource, which means that it is of fundamental importance to the wider life-science community and the long-term preservation of biological data….”

Transparency | OurResearch

“We think that organizations working for Open should be sure that they’re being open themselves–with their code and data, and with the details of their operation. We’re doing our best to live up to that, and this page is part of that effort. If you’ve got feedback, drop us a line!

All monetary figures are in US dollars.”

DataCite’s Commitment to The Principles of Open Scholarly Infrastructure

“DataCite was founded in 2009 on the principle of being an open stakeholder governed community that is open to participation from organizations around the world. Today, that continues to be true. Although our services have expanded, we continue to remain grounded to our roots. DataCite’s umbrella was formed with the aim to safeguard common standards worldwide to support research, thereby facilitating compliance with the rules of good scientific practice. DataCite’s identifier registration, Data File, and services are foundational components of the scholarly ecosystem. As the ecosystem continues to evolve, governance, sustainability and living-will insurance have become increasingly important components of the open infrastructure.

Recently several open scholarly infrastructure organizations and initiatives have adopted The Principles of Open Scholarly Infrastructure. DataCite has conducted its own audit against the principles and would like to affirm our commitment to upholding these….”

Dryad’s Commitment to the Principles of Open Scholarly Infrastructure | Dryad news and views

“Researchers, institutions, funders, and publishers have relied on Dryad for over 10 years to support open data publishing. Throughout, we have taken our responsibility as open infrastructure seriously. As a small nonprofit working in a crowded and complicated scholarly communications landscape, it has been our honor to serve as an exemplar for what is possible when you remain committed to the mission and to building coalitions with like minded organizations in order to achieve success.  

Since our founding, members of our board and our team have been involved in constructing best practices for open infrastructure organizations like ours. One foundational set was published as the Principles of Open Scholarly Infrastructure. Since these principles were first published as a blog post in 2015, our team has worked closely with the original authors and additional collaborators to ensure Dryad aligns internal processes and external commitments to reflect these principles. …”

The Principles of Open Scholarly Infrastructure

Open source – All software required to run the infrastructure should be available under an open source license. This does not include other software that may be involved with running the organisation.
Open data (within constraints of privacy laws) – For an infrastructure to be forked it will be necessary to replicate all relevant data. The CC0 waiver is best practice in making data legally available. Privacy and data protection laws will limit the extent to which this is possible
Available data (within constraints of privacy laws) – It is not enough that the data be made “open” if there is not a practical way to actually obtain it. Underlying data should be made easily available via periodic data dumps.
Patent non-assertion – The organisation should commit to a patent non-assertion covenant. The organisation may obtain patents to protect its own operations, but not use them to prevent the community from replicating the infrastructure….”