ORCID’s 2022 Public Data File Now Available – ORCID

“ORCID was founded on a set of 10 principles, some of which directly mirror the goals of the Open Access initiative. In particular, our 7th founding principle states: All data contributed to ORCID by researchers or claimed by them will be available in standard formats for free download (subject to the researchers’ own privacy settings) that are updated once a year and released under a CC0 waiver. This is why we publish our annual public data file, as we do each year, usually during Open Access Week. 

Our 2021 data file was downloaded 14,299 times and received one citation. In 2020, the data file contributed to the data visualization that showed the digital footprint of Covid-19 research in an astounding map produced by the Research Graph Foundation. 

The file is available in XML format, however, if you prefer JSON, you can use our ORCID Conversion Library available in our Github repository. This Java application enables the generation of JSON from XML in the default version ORCID schema format.

The data is divided into 12 subsets for easier download and use. The first set contains the full record summary for each record. The other 11 contain the activities for each record, including full work data. We also have an article for those who need help working with bulk data.


We look forward to seeing how the research community will take advantage of this free, open source of data that is an asset to the research ecosystem. Do you have plans to use the public data file? Let us know by contacting us at comms@orcid.org or Tweet us @ORCID_org to let us know. ”

Webinar: Dr. Chris Gilliard aka HyperVisible on Educational Surveillance. 20 Oct 2022, 6pm (EDT)| The Feminist and Accessible Publishing, Communications, + Tech Series @Eventbrite

Dr. Chris Gilliard (@hypervisible) is a leading critic of surveillance technology, digital privacy, and the problematic ways that tech intersects with race and social class. He will talk about the digital forms of surveillance that are coming into schools, colleges, and universities.

Dr. Chris Gilliard is a writer, professor and speaker. His scholarship concentrates on digital privacy, and the intersections of race, class, and technology. He is an advocate for critical and equity-focused approaches to tech in education. His work has been featured in The Chronicle of Higher Ed, EDUCAUSE Review, Fast Company, Vice, and Real Life Magazine. He was recently a a research fellow with the Technology and Social Change Research Project at Harvard Kennedy School’s Shorenstein Center.

This event is part of the 4th Season of the Feminist and Accessible Publishing and Communications Technologies Speaker and Workshop Series (https://www.feministandaccessiblepublishingandtechnology.com), organized by Dr. Alex Ketchum.

Our series was made possible thanks to our sponsors: SSHRC, the Institute for Gender, Sexuality, and Feminist Studies (IGSF), the DIGS Lab, Milieux, Initiative for Indigenous Futures, MILA, Dean of Arts Grant, ReQEF, and more (see our website!)

There is no fee required to attend this event. We will provide professional captions in english. This event will NOT be recorded and NOT bemade available on our website after the event. However, you can watch other past events at: https://www.feministandaccessiblepublishingandtechnology.com/p/videos.html


Navigating Risk in Vendor Data Privacy Practices: An Initial Analysis of Elsevier’s ScienceDirect

“As libraries spend more and more resources licensing platforms, the terms around how vendors treat user data have become complex and difficult to understand. This poses serious concerns given the ever-increasing incentive for vendors to monetize this data—many in ways fundamentally at odds with libraries’ commitment to privacy. Protecting user privacy is a challenge for libraries trying to navigate through the vague and abstruse vendor contracts and policies. To assist libraries in this challenge, SPARC has partnered with Becky Yoose of LDH Consulting Services to analyze vendor contracts and privacy policies to provide libraries a better understanding of the potential risks they pose to user privacy….”

Statement by Library Futures and SPARC on Wiley E-Textbook Withdrawal | SPARC

In late August, at the start of the Fall 2022 school semester, Wiley Publishing Company abruptly withdrew 1,379 multidisciplinary titles from Proquest, a vendor for university ebook collections around the world. As a result, librarians and faculty members in the United States and internationally have scrambled to identify alternative textbook options for their students as the pandemic amplified the trouble with restrictive licensing and e-textbooks.

Library Futures and SPARC strongly condemn this action by Wiley, which seriously hinders students’ access to equitable, affordable course materials. The full list of titles and public contact information for their authors was compiled by Johanna Anderson of #ebookSOS.



Open science and data sharing in trauma research: Developing a trauma-informed protocol for archiving sensitive qualitative data. – PsycNET

Abstract:  Objective: The open science movement seeks to make research more transparent, and to that end, researchers are increasingly expected or required to archive their data in national repositories. In qualitative trauma research, data sharing could compromise participants’ safety, privacy, and confidentiality because narrative data can be more difficult to de-identify fully. There is little guidance in the traumatology literature regarding how to discuss data-sharing requirements with participants during the informed consent process. Within a larger research project in which we interviewed assault survivors, we developed and evaluated a protocol for informed consent for qualitative data sharing and engaging participants in data de-identification. Method: We conducted qualitative interviews with N = 32 adult sexual assault survivors regarding (a) how to conduct informed consent for data sharing, (b) whether participants should have input on sharing their data, and (c) whether they wanted to redact information from their transcripts prior to archiving. Results: No potential participants declined participation after learning about the archiving mandate. Survivors indicated that they wanted input on archiving because the interview is their story of trauma and abuse and it would be disempowering not to have control over how this information was shared and disseminated. Survivors also wanted input on this process to help guard their privacy, confidentiality, and safety. None of the participants elected to redact substantive data prior to archiving. Conclusions: Engaging participants in the archiving process is a feasible practice that is important and empowering for trauma survivors. (PsycInfo Database Record (c) 2022 APA, all rights reserved)

Reasons for qualitative psychologists to share human data – Karhulahti – British Journal of Social Psychology – Wiley Online Library

Abstract:  Qualitative data sharing practices in psychology have not developed as rapidly as those in parallel quantitative domains. This is often explained by numerous epistemological, ethical and pragmatic issues concerning qualitative data types. In this article, I provide an alternative to the frequently expressed, often reasonable, concerns regarding the sharing of qualitative human data by highlighting three advantages of qualitative data sharing. I argue that sharing qualitative human data is not by default ‘less ethical’, ‘riskier’ and ‘impractical’ compared with quantitative data sharing, but in some cases more ethical, less risky and easier to manage for sharing because (1) informed consent can be discussed, negotiated and validated; (2) the shared data can be curated by special means; and (3) the privacy risks are mainly local instead of global. I hope this alternative perspective further encourages qualitative psychologists to share their data when it is epistemologically, ethically and pragmatically possible.


Journal of Medical Internet Research – A Study of Publicly Available Resources Addressing Legal Data-Sharing Barriers: Systematic Assessment

Abstract:  Background:

United States data protection laws vary depending on the data type and its context. Data projects involving social determinants of health often concern different data protection laws, making them difficult to navigate.

Objective:We systematically aggregated and assessed useful online resources to help navigate the data-sharing landscape.

Methods:We included publicly available resources that discussed legal data-sharing issues with some health relevance and published between 2010 and 2019. We conducted an iterative search with a common string pattern using a general-purpose search engine that targeted 24 different sectors identified by Data Across Sectors for Health. We scored each online resource for its depth of legal and data-sharing discussions and value for addressing legal barriers.

Results:Out of 3710 total search hits, 2721 unique URLs were reviewed for scope, 322 received full-text review, and 154 were selected for final coding. Legal agreements, consent, and agency guidance were the most widely covered legal topics, with HIPAA (The Health Insurance Portability and Accountability Act), Family Educational Rights and Privacy Act (FERPA), Title 42 of the Code of Federal Regulations Part 2 being the top 3 federal laws discussed. Clinical health care was the most prominent sector with a mention in 73 resources.

Conclusions:This is the first systematic study of publicly available resources on legal data-sharing issues. We found existing gaps where resources covering certain laws or applications may be needed. The volume of resources we found is an indicator that real and perceived legal issues are a substantial barrier to efforts in leveraging data from different sectors to promote health.

Mündiges Datensubjekt statt Laborratte: Rechtsschutz gegen Wissenschaftstracking | Jahrbuch Technikphilosophie

 by Felix Reda

Bei der Debatte um das Wissenschaftstracking stand bislang vor allem die Sensibilisierung für den Datenschutz im Vordergrund. Das ist ein wichtiger erster Schritt, denn nur wenn Forschende sich darüber bewusst sind, dass ihr Forschungsverhalten Klick für Klick überwacht und kommerziell verwertet wird, können sie sich dafür engagieren, dieser Praxis Einhalt zu gebieten. Doch wie so oft bei Datenschutzthemen droht sich Fatalismus breitzumachen, wenn die Debatte in der Problembeschreibung steckenbleibt.

Viel zu wenige Universitäten bieten ihren Forschenden proaktiv eine eigene, datenschutzsensible Software-Infrastruktur an, die kollaboratives wissenschaftliches Arbeiten auch institutionenübergreifend ermöglichen würde. Große Teile der wissenschaftlichen Literatur sind ausschließlich über die Portale der kommerziellen Wissenschaftsverlage verfügbar, die mit verwirrenden Cookie-Bannern aufwarten. Allein sich einen Überblick zu verschaffen, welche Daten ein Konzern wie Elsevier über einen gespeichert hat, ist ein aufwändiges Unterfangen[1]. Im ohnehin schon stressigen Forschungsalltag ist es unrealistisch, dass einzelne Forschende sich selbst vor dem Tracking durch diese Unternehmen schützen, indem sie deren Produkte meiden.


OSF Preprints | Open science practices in psychiatric genetics: a primer

Abstract:  Open science is a set of practices to ensure that all research elements are transparently reported and freely accessible for all to learn, assess, and build on. Psychiatric genetics has led among the health sciences in implementing some open science practices in common study designs, such as replication as part of genome-wide association studies. However, while additional open science practices could be embedded in genetics research to further improve its quality and accessibility, guidelines for doing so are limited. They are largely not specific to data, privacy, and research conduct challenges in psychiatric genetics. Here, we present a primer of open science practices in psychiatric genetics for multiple steps of the research process, including deciding on a research topic with patients/non-academic collaborators, equitable authorship and citation practices, considerations in designing a replicable, reproducible study, pre-registrations, open data, and privacy issues. We provide tips for creating informative figures, using inclusive, precise language, and following reporting standards. We also discuss considerations in working with non-academic research collaborators (citizen scientists) and outline ways of disseminating research through preprints, blogs, social media, and accessible lecture materials. Finally, we provide a list of extra resources to support every step of the research process.


UPC member states to vote on full access to judgments – JUVE Patent

“On 8 July, the UPC [Unified Patent Court] Administrative Committee will vote on the final draft of the Rules of Procedure, proposing public access to all judgments and orders. …

Participating EU member states first signed off on the Unified Patent Court in 2013. However, ongoing GDPR developments have threatened the transparency of its administration and judicial output. Now a revised final version of the UPC Rules of Procedure stipulates that the public will have access to the content of all decisions and orders.

Judges will remain responsible for redacting any confidential or personal data before formally issuing the decision or order.

The proposals follow multiple professional bodies demanding that, in the interest of transparency, the Rules of Procedure should stipulate full public access to all documents. As such, the committee will consider the Rules of Procedure revisions, along with issues such as which judges will preside over UPC cases, on and from 8 July 2022. All in all, the day will be crucial for the court’s development….”

Many researchers say they’ll share data — but don’t

“Most biomedical and health researchers who declare their willingness to share the data behind journal articles do not respond to access requests or hand over the data when asked, a study reports1. …

But of the 1,792 manuscripts for which the authors stated they were willing to share their data, more than 90% of corresponding authors either declined or did not respond to requests for raw data (see ‘Data-sharing behaviour’). Only 14%, or 254, of the contacted authors responded to e-mail requests for data, and a mere 6.7%, or 120 authors, actually handed over the data in a usable format. The study was published in the Journal of Clinical Epidemiology on 29 May….

Puljak’s results square with those of a study that Danchev led, which found low rates of data sharing by authors of papers in leading medical journals that stipulate all clinical trials must share data2. …

Past research suggests that some fields, such as ecology, embrace data sharing more than others. But multiple analyses of COVID-19 clinical trials — including some from Li4,5 and Tan6 — have reported that anywhere from around half to 80% of investigators are unwilling or not planning to share data freely….

To encourage researchers to prepare their data, Li says, journals could make data-sharing statements more prescriptive. They could require authors to detail where they will share raw data, who will be able to access it, when and how.


Funders could also raise the bar for data sharing. The US National Institutes of Health, in an effort to curb wasteful, irreproducible research, will soon mandate that grant applicants include a data-management and sharing plan in their applications. Eventually, they will be required to share data publicly….”

A blockchain-based infection tracing and notification system by non-fungible tokens – ScienceDirect

Abstract:  SARS-CoV2 pandemic is heavily affecting our lives. Many actions have been undertaken to slow down its expansion and, among the others, contact tracing applications are the less invasive to monitor the spread of the virus. The idea behind contact tracing is to track contacts between people by the exchange of identifiers, not linked to individuals, exploiting the use of Bluetooth Low Energy (BLE) technology to estimate the duration and proximity of contacts. The data collected in this way is used for the sole purpose of notifying a potential contact with an infected person without revealing their identity and location. This paper presents a contact tracing protocol based on blockchain technology that exploits smart contracts for reporting contacts at risk of contagion. The novelty of the proposed solution is the use of Non Fungible Tokens (NFT) to guarantee user privacy through a decentralized approach, equipped with a reliable non-proprietary notification mechanism that allows public access to anonymous infections data.


Michele Gibney Joins SPARC as Visiting Program Officer for Privacy & Surveillance – SPARC

“SPARC is pleased to welcome Michele Gibney as a Visiting Program Officer for Privacy & Surveillance. Michele is the Head of Publishing and Scholarship Support at University of the Pacific in California, where she manages the institutional repository, Scholarly Commons, and will spend about a quarter of her time in the newly established role.”

Fast Track Action Committee on Advancing Privacy Preserving Data Sharing and Analytics Roundtable Series

“The Fast Track Action Committee on Advancing Privacy Preserving Data Sharing and Analytics invites you to participate in a virtual roundtable series to be held via Zoom from 1:00 p.m. – 3:00 p.m. ET on June 7, 9, and 10, 2022.

Privacy-preserving approaches present a technological opportunity to harness the power of data and data analysis techniques in a manner that protects privacy and intellectual property. This can enable more collaboration across entities, sectors, and borders to help tackle shared challenges, such as health care, climate change, financial crime, human trafficking, and pandemic response.

Acknowledging this opportunity, the White House Office of Science and Technology Policy and the Networking and Information Technology Research and Development Program launched the Fast Track Action Committee on Advancing Privacy Preserving Data Sharing and Analytics. This group is tasked with development of a cohesive national strategy to advance the research, development, and adoption of privacy-preserving data sharing and analytics technologies….”

New Guidance to Ensure Federally Funded Research Data Equitably Benefits All of America | The White House

“To help ensure that access is shared equitably by all Americans, the White House Office of Science and Technology Policy (OSTP) has been working for nearly a decade to ensure that Federal agencies with research and development budgets of at least $100 million develop plans to deposit Federally funded data into online digital repositories.

To continue this effort, today OSTP is releasing the report Guidance on Desirable Characteristics of Data Repositories for Federally Funded Research. This guidance contains clearly defined desirable characteristics for two classes of online research data repositories: a general class appropriate for all types of Federally funded data—including free and easy access—and a specific class that has special considerations for the sharing of human data, including additional data security and privacy considerations. Federal agencies can use this guidance to provide more consistent information to their research communities about sharing Federally funded data with the public. 

Agencies can also use this guidance to ensure uniformity as they invest in their own digital repository infrastructure and to make their research data resources more findable, accessible, interoperable, and reusable. It is expected that this guidance will not be static, but rather, will be updated as needed, as new modes of data storage and management emerge and agency needs evolve.  Ultimately, this guidance—along with the agency efforts detailed in OSTP’s recent report to Congress—will help make Federally funded research data more accessible to the American public. The release of this guidance is one of many steps that OSTP is taking to advance equitable delivery of research and strengthen Federal public access policies….”