The Semantic Scholar Open Data Platform

The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by helping scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-theart techniques for scholarly PDF content extraction and automatic knowledge graph construction to build the Semantic Scholar Academic Graph, the largest open scientific literature graph to-date, with 200M+ papers, 80M+ authors, 550M+ paper-authorship edges, and 2.4B+ citation edges. The graph includes advanced semantic features such as structurally parsed text, natural language summaries, and vector embeddings. In this paper, we describe the components of the S2 data processing pipeline and the associated APIs offered by the platform. We will update this living document to reflect changes as we add new data offerings and improve existing services.

Opinion: Why we’re becoming a Digital Public Good — and why we aren’t | Devex

“A few months ago, Medtronic LABS made the decision to open source our digital health platform SPICE, and pursue certification as a Digital Public Good. DPGs are defined by the Digital Public Good Alliance as: “Open-source software, open data, open AI models, open standards, and open content that adhere to privacy and other applicable laws and best practices, do no harm by design, and help attain the Sustainable Development Goals.” The growing momentum around DPGs in global health is relatively new, coinciding with the launch of the U.N. Secretary General’s Roadmap for Digital Cooperation in 2020. The movement aims to put governments in the driver’s seat, promote better collaboration among development partners, and reduce barriers to the digitization of health systems.”

European Commission provides funding to improve Open Access publishing landscape

“From January 2023, the University of Coimbra will be involved in another Open Science project: the CRAFT-OA project (“Creating a Robust Accessible Federated Technology for Open Access”) involves 23 partners in 14 European countries and will last for 36 months. The project is funded under the Horizon Europe framework programme, aiming to evolve and strengthen the institutional publishing landscape of Diamond Open Access (Diamond OA): no fees for authors or readers.

By offering tangible services and tools for the entire journal publishing lifecycle, CRAFT-OA will empower local and regional platforms and service providers to extend, professionalise and achieve greater interoperability with other scientific information systems for content and platforms. These developments will help researchers and publishers involved in publishing.

The project focuses on four action strands to improve the Diamond OA model:

(1) Providing technical improvements for journal platforms and journal software.

(2) Building communities of practice to promote overall infrastructure improvement

(3) Increase the visibility, discoverability and recognition of Diamond OA publishing

(4) Integrate Diamond OA publishing with the European Open Science Cloud (EOSC) and other large-scale data aggregators….”

Economics and Equity in the Development of Open Research Europe – Munin conference presentation | Zenodo

“Open Research Europe (ORE) is the open access peer-reviewed publishing platform offered by the European Commission as an optional service to Horizon 2020 and Horizon Europe beneficiaries at no cost to them. The platform enables researchers to publish open access without paying out of their research budgets and while complying with their open access obligations.

These are the outputs from an interactive session run by Rob Johnson at the 17th Munin Conference on 30 November 2022 to gather delegates’ feedback on the future operationalisation of Open Research Europe as a collective publishing enterprise.

These outputs supplement the following paper and report: …”

What Do We Want in a Research Platform of the Future? (Session 620 at MLA 2023) – Notes from the CIT

“Instead of just critiquing major information technology (IT) platforms that higher-ed institutions commit to, panelists “lead with the solution.” They set forth visions, plans, and ethical, sustainable principles for the research IT of the future.”

Open Communication Platforms – Google Docs

“The purpose of this document is to organise ideas about open communication platforms for big team science and open research coordination. It can serve as a primer for those looking to set up a platform, or for ideas when developing new platforms. Please add yourself to the Document Contributors list if you make a contribution (feel free to edit anything). …”

Continental Platform | University of Cape Town

“The continental platform allows the African research community to take ownership of creating and sharing its own scholarly content, which contributes to the growth and development of local research for African society. The publishing platform addresses the challenge of Africa’s low production by practicing diamond open access, that is knowledge is free to access for the reader and the author does not pay to publish. In this model, diamond open access is a community-based publishing alternative model that disrupts the commercial publishing system. This shift returns the control of publishing back to the researcher community; free from third-party publishers imposing their restrictions to access.

The continental platform consist of open access journals and open access monographs and textbooks.

If any African institution would like to publish open journals or open monographs/textbooks on this continental platform, please contact Jill Claassen, Section Manager: Scholarly Communication and Research…”

How Africa is overcoming ‘knowledge colonialism’ – 360

“But as the world starts to question why access to knowledge is controlled by a small group of corporate publishers, African scientists are developing their own platform for sharing research findings. It’s sparking interest from other regions around the world. 

The stakes could not be higher. In exchange for making scientific papers free to read — open access — the corporate publishers will often charge an up-front fee to the scientist. This is a win-win for the publishers, who secure payment either way. 

Many nations have negotiated country-wide deals with publishers allowing their scientists to publish and read articles. During these negotiations in South Africa, one publisher inadvertently let slip that in determining the up-front fee for a South African scientist, the publisher weighed up whether the journal had a high ‘impact factor’ and the number of articles South Africa was likely to publish as open access.

The calculations mean a South African scientist is paying almost 200 percent more in up-front fees than a UK scientist (or their institution). If South African scientists have access to read high impact journals (while not necessarily publishing in them) via such nation-wide agreements then it pushes up the up-front fee….”

Home – Galaxy Community Hub

“Galaxy is an open-source platform for FAIR data analysis that enables users to:

Use tools from various domains (that can be plugged into workflows) through its graphical web interface.
Run code in interactive environments (RStudio, Jupyter…) along with other tools or workflows.
Manage data by sharing and publishing results, workflows, and visualizations.
Ensure reproducibility by capturing the necessary information to repeat and understand data analyses….”

“Creating a Robust Accessible Federated Technology for Open Access” (CRAFT-OA): European Commission grants substantial funding to improve institutional publishing for science

CRAFT-OA launch

The project “Creating a Robust Accessible Federated Technology for Open Access” (CRAFT-OA), carried out by 23 experienced partners from 14 European countries, coordinated by the University of Göttingen, Germany will start in January 2023 and run for 36 months. Funded within the Horizon Europe Framework Programme (HORIZON Europe), the project aims to equally evolve and strengthen the Diamond Open Access (Diamond OA, no fees towards authors or readers) institutional publishing landscape. By offering tangible services and tools for the entire life cycle of journal publishing CRAFT OA empowers local and regional platforms and service providers to upscale, professionalise and reach stronger interoperability with other scientific information systems for content and platforms. These developments will help researchers and editors involved in publishing.

The project focuses on four strands of action to improve the  Diamond OA model: (1) Provide technical improvements for journal platforms and journal software (2) Build communities of practice to foster overall infrastructure improvement (3) Increase visibility, discoverability and recognition for Diamond OA publishing (4) Integrate Diamond OA publishing with the European Open Science Cloud (EOSC) and other large-scale data aggregators. Consortium partners in CRAFT-OA bring their long-standing engagement in institutional publishing and infrastructure and are committed to sustaining and developing capacities in the field. CRAFT-OA will deliver technical tools, training events, training materials, information, and services for the Diamond OA institutional publishing environment. It will foster communities of practice with the capacity to sustain the project improvements over time.

Margo Bargheer, CRAFT-OA Coordinator, University of Göttingen:

There are countless engaged open access journals out there, making a point to offer Diamond Open Access options to their communities. With our project, they will benefit from shared developments and shared services, but most of all from shared knowledge around professional institutional publishing and stronger networks to reach resilience within their own operation.

EU-Projects support scholarly publishing

CRAFT-OA is linked with other European projects supporting Diamond Open Access, especially the 3-years DIAMAS project (Developing Institutional Open Access Publishing Models to Advance Scholarly Communication). As CRAFT-OA mainly supports Diamond Open Access publishing by providing a technology update, the DIAMAS project supports Diamond Open Access on a non-technical level by building up a capacity centre and a community. The PALOMERA project (Policy Alignment of Open Access Monographs in the European Research Area) investigates institutional scholarly communication as well. Still, it concentrates on contrary to journals on books and especially policies for books. It launches in January 2023 and will run for two years.

Consortium and skills

CRAFT-OA’s 23 consortium partners from 14 European countries are all engaged in institutional publishing and infrastructure, and committed to sustaining and developing capacities in the field. A wide variety of skills and expertise is represented via the consortium partners participating in the project: 

For more information please contact the coordinator Margo Bargheer, bargheer[a]sub.uni-goettingen.de


EU Funding

Open Science – A Croatian Perspective | Open Science Talk

Abstract:  Jadranka Stojanovski discusses the evolution of library support for open science from a Croatian perspective. From her vantage point as (former) library director of the Ru?er Boškovi? Institute and associate professor at the department of information science of the University of Zadar, Stojanovski has been a pioneer in establishing services exploiting the possibilities offered by new information technologies since the 1990s. Many of her activities have been connected to broad European collaborative projects such as OpenAIRE, OASPA, and EOSC.

The Croatian approach has been a very proactive one. Already in 1997, the CROSBI was launched, a combined national scientific bibliography and repository for Green Open Access documents. Although deposition of articles and other research documents is entirely voluntary, CROSBI now carries metadata on more than 725,000 documents, a large proportion of which are available in fulltext. Alongisde CROSBI, there are also several institutions running their own institutional repositories. There is now extensive collaboration between these services in the form of DABAR (‘beaver’ in English), aiming to enhance the interoperability and findability of documents stored in the various repositories. Stojanovski has also been involved in setting up an inventory on Who’s Who in Science in Croatia as well as a database on scientific equipment, Šestar (‘pair of compasses’).

Set up in 2005, the HR?AK (‘hamster’) platform for Croatian scientific and professional journals has been a massive success. Less than twenty years after its inception, it now carries more than 500 scholarly journals and series of conference proceedings, nearly all of which are Diamond Open Access (i.e., free to the reader and with no author-facing publishing charges). Roughly 150 of these journals receive annual subsidies from the government, the rest are fully based on voluntary work from individual editors and the institutions or learned societies they represent. Only around 25 HR?AK journals charge Article Processing Charges. The Social Sciences and Humanities are particularly well represented on the platform, with many journals publishing in Croatian despite the lack of an official language policy in favor of Croatian as a scholarly language. The University Computing Centre in Zagreb (SRCE, ‘heart’) is responsible for the technical development of HR?AK, which is based on seamless interconnection between in-house developed software and open-source software for editorial processes, primarily Open Journal Systems.

A national Research Data Policy or, better still, a general Open Science Policy is highly desirable, Stojanovski argues. Infrastructure is in place, but usage will undoubtedly rise significantly as soon as open science practices become mandatory.

Alongside Dominic Tate (episode 43) and Pierre Mounier (episode 44), Jadranka was a keynote speaker at the 17th Munin Conference on Scholarly Publishing. This interview was first published online on December 13, 2022.

Stockholm University’s publishing platform | recording of September 2022 interviews with developers

“Soon Stockholm University is to launch its own publishing platform where researchers can publish articles with open peer review and with an open license. A pilot of the platform will be released during Open Access Week 2022, where 50 researchers affiliated with Stockholm University will participate and try the tool for a few months, before it is made available to others as well. Interviewed in this video clip is Wilhelm Widmark, Senior Advisor to the President with operational responsibility for Open Science at Stockholm University, and Abeni Wickham, founder of SciFree and developer of the platform. The interview was originally published in the staff web cast Panorama in September 2022, led by Stefan Nyman at the Communication Team at Stockholm University….”