Abstract: The data paper is becoming a popular way for researchers to publish their research data. The growing numbers of data papers and journals hosting them have made them an important data source for understanding how research data is published and reused. One barrier to this research agenda is a lack of knowledge as to how data journals and their publications are indexed in the scholarly databases used for quantitative analysis. To address this gap, this study examines how a list of 18 exclusively data journals (i.e., journals that primarily accept data papers) are indexed in four popular scholarly databases: the Web of Science, Scopus, Dimensions, and OpenAlex. We investigate how comprehensively these databases cover the selected data journals and, in particular, how they present the document type information of data papers. We find that the coverage of data papers, as well as their document type information, is highly inconsistent across databases, which creates major challenges for future efforts to study them quantitatively, which should be addressed in the future.
“Until recently, MDPI and Frontiers were known for their meteoric rise. At one point, powered by the Guest Editor model, the two publishers combined for about 500,000 papers (annualized), which translated into nearly USD $1,000,000,000 annual revenue. Their growth was extraordinary, but so has been their contraction. MDPI has declined by 27% and Frontiers by 36% in comparison to their peak.
Despite their slowdown, MDPI and Frontiers have become an integral part of the modern publishing establishment. Their success reveals that their novel offering resonates with thousands of researchers. Their turbulent performance, however, shows that their publishing model is subject to risk, and its implementation should acknowledge and mitigate such risk….”
Abstract: As part of the data-driven paradigm and open science movement, the data paper is becoming a popular way for researchers to publish their research data, based on academic norms that cross knowledge domains. Data journals have also been created to host this new academic genre. The growing number of data papers and journals has made them an important large-scale data source for understanding how research data is published and reused in our research system. One barrier to this research agenda is a lack of knowledge as to how data journals and their publications are indexed in the scholarly databases used for quantitative analysis. To address this gap, this study examines how a list of 18 exclusively data journals (i.e., journals that primarily accept data papers) are indexed in four popular scholarly databases: the Web of Science, Scopus, Dimensions, and OpenAlex. We investigate how comprehensively these databases cover the selected data journals and, in particular, how they present the document type information of data papers. We find that the coverage of data papers, as well as their document type information, is highly inconsistent across databases, which creates major challenges for future efforts to study them quantitatively. As a result, we argue that efforts should be made by data journals and databases to improve the quality of metadata for this emerging genre.
“In the early 2000s, my role at Google was running web indexing: the system that crawls the web, making pages and content discoverable and accessible through search. Nowadays, there’s an assumption that looking for something via Google searches everything, but that wasn’t the case in the early days. Part of my role was to expand the index by reaching out to many different types of organizations – government, business, publishers – to make sure their web sites were included in the index.
A key group among these was scholarly publishers hosting journals and conferences. Having grown up on a university campus, scholarly articles had been all around and I wanted to make sure that they were as easy to find as everything else.
As a part of this, I reached out to HighWire to explore the possibility of indexing the hosted journals. I remember our first call in the Fall of 2002 with John Sack, Todd McGee and several others. A few quick calls, a couple of meetings in person and we were off….”
“We are indexing all kinds of academically relevant resources – journals, institutional repositories, digital collections etc. – which provide an OAI interface and use OAI-PMH for providing their contents (learn more about OAI at the Open Archives Initiative or Wikipedia). In case your source does not provide an OAI interface, upload your documents to aggregators like DataCite or Zenodo, to subject repositories like RePEC or add your open access journal to DOAJ. We are indexing these sources regularly.
However, the best way to get your documents indexed by BASE is to provide an OAI interface. We have compiled some golden rules that might be helpful to optimize your OAI interface. If your OAI interface complies with these rules, we can assure fast and smooth indexing of your source. Data from your source will be presented completely and in the best possible way.
“In recent years, the level of availability has reached a tipping point, whereby at least half of the articles published become available in open access within 12 to 18 months of their publication….This report compares established commercial databases—namely, the Web of Science and Scopus—with a bibliographic database that has been produced with the goal of facilitating the retrieval of gold and green2 open access articles published in peer-reviewed journals. In addition to examining the strengths and limitations of large-scale measurement, this report performs a number of measures, particularly at the country and academic-field levels. It also examines the question of whether articles available in open access are more highly cited that those available strictly with a subscription….The evidence presented in this report shows that at least two-thirds of the articles published between 2011 and 2014 and having at least one U.S. author can be downloaded for free as of August 2016. In the case of Brazil, the proportion reaches 75%. More broadly, the vast majority of the large scholarly publishing countries have more than 50% of their articles published from 2010 to 2014 freely available for download in gold and/or green gratis open access. Examining the availability of articles by domains of scholarly activity shows that health sciences has the most articles available for free (at least 59% of the articles published in 2014 could be read for free in 2016), followed by the natural sciences (55%), applied sciences (47%), economic and social sciences (44%), and arts and humanities (24%)….Whereas current data suggests that gold OA is prevalent in health sciences, green dominates the natural sciences, applied sciences, and economic and social sciences. In the humanities, green and gold are more or less on the same level. …There is evidence that articles available in green OA are overall the most highly cited….”
“This reportdetails population-level measurements of the open access (OA) availability of publications indexed in two bibliometric databases—the Web of Science (WoS) by Clarivate Analytics and Scopus by Elsevier. This was achieved by matching the bibliometric database populations to the 1science database to determine the availability of the papers in OA form.
A comparative analysis of the recall and precision levels of the 1science database was performed using Scopus and the WoS. This helped to characterize the 1science database. Two policy-relevant indicators were selected for in-depth analyses: country affiliation of authors on publications, and scientific disciplines. These indicators were selected because they are very frequently used in bibliometric studies, including those performed by the National Science Foundation (NSF), and they appear in the NSF’s Science and Engineering Indicators….”
“Brazil stands out on the international landscape when it comes to open access, a movement launched in the early 2000s with the aim of making scientific output freely available online. According to data compiled by Spanish research group Scimago, 33.5% of the Brazilian articles indexed in the Scopus database in 2016 were published in journals whose content is free to read online as soon as it is published, under a model known as the “golden road.” This is the largest proportion among the 15 nations with the highest volume of scientific output recorded on Scopus. Brazil is also top of the list of nations with the highest number of open access scientific journals (see charts).”
At ScienceOpen, we offer a range of next-generation indexing services. This includes a package especially for institutes, to help them gain the maximum visibility and re-use of the articles their researchers publish.