Grambank

“Grambank was constructed in an international collaboration between the Max Planck institutes in Leipzig and Nijmegen, the Australian National University, the University of Auckland, Harvard University, Yale University, the University of Turku, Kiel University, Uppsala University, SOAS, the Endangered Languages Documentation Programme, and over a hundred scholars from around the world. Grambank is designed to be used to investigate the global distribution of features, language universals, functional dependencies, language prehistory and interactions between language, cognition, culture and environment. The Grambank database currently covers 2,467 language varieties, capturing a wide range of grammatical phenomena in 195 features, from word order to verbal tense, nominal plurals, and many other well-studied comparative linguistic variables. Grambank’s coverage spans 215 different language families and 101 isolates from all inhabited continents. The aim is for Grambank to ultimately cover all languages for which a grammar or sketch grammar exists. Grambank is part of Glottobank, a research consortium that involves work on complementary databases of lexical data, paradigms, numerals and sound patterns in the world’s languages. Grambank can be used in concert with other databases, such as those in Glottobank and D-PLACE, to deepen our understanding of our history and communicative capabilities.”

Language Diversity in Scholarly Publishing – COKI

“we have mapped the 122 million objects in Crossref up to the end of May 2022 to languages (based on titles and abstracts, where available) and done an initial analysis. The results are a mix of the expected and surprising….

Not surprisingly, English dominates the literature (although with a slowly dropping proportion) with other European languages following including German, French, Spanish and then Portuguese, with Bahasa Indonesian as the next largest language. Spanish and Portuguese grew strongly over the period with Portuguese growing from around 7,000 outputs captured in 2000 to over 150,000 in 2021, reflecting the rise of Brazil as a research powerhouse, and the effectiveness of SciELO as a dissemination platform over that period. Indonesian shows massive growth, probably in part reflecting improved coverage of Crossref metadata over this period along with the massive growth of Indonesian publishing efforts….

Open access shows substantial differences across languages. Perhaps even more importantly, our ability to classify open access types is leading to issues across different languages. Indonesian is a great example. Currently we use DOAJ as the marker of a “completely OA journal” (and we differ from Unpaywall in this at the moment). Many Indonesian journals are not in DOAJ and therefore show as “hybrid”. Unpaywall is also not always able to pick up license information so full OA journals that are not in DOAJ may also get characterized as “bronze”. In Portuguese it is likely that a large proportion of “hybrid” is actually fully OA journals published through SciElO. Categories of open access publishing in Hungarian, Polish, Turkish and many other languages are also likely to need closer examination. We used DOAJ to identify non-APC journals as well and this is likely undercounting this category for Indonesian, Turkish, Portuguese and Spanish outputs. 

Nonetheless, we observe high proportions of articles in non-APC journals in Spanish and Portuguese (attesting to the success of the diamond OA model in Latin America), as well as in a number of other languages, including Nordic languages, many Eastern European languages, and others. Overall, when looking at 2020-2022, for English articles in DOAJ journals, 21% are in non-APC journals, but for articles in languages other than English, this percentage is a massive 86%. Non-APC models appear to dominate the landscape for non-English full OA journals. And amongst English language articles in OA journals (as defined by registration in DOAJ) the APC model definitely dominates. As is often the case, innovation rooted in community needs is more common away from traditional centres of prestige.

Some countries with high levels of open access in English have comparatively low levels in the local language. This is the case for the Netherlands and to some extent France and Germany. This is most likely related to disciplinary differences in what is published in English (with a bias towards STEM and higher levels of OA) vs local language (with a bias towards HSS subjects). By contrast Nordic languages and Norwegian in particular show high levels of open access with an emphasis on APC-free OA journals, likely as a result of local initiatives to fund the conversion of national language journals (which tend to focus on HSS) to open access. The Hr?ak central portal providing support for Croatian journals is another example with Croatian also showing a similar pattern….”

Lakota elders helped a white man preserve their language. Then he tried to sell it back to them.

“ay Taken Alive had been fighting for this moment for two years: At his urging, the Standing Rock Sioux Tribal Council was about to take the rare and severe step of banishing a nonprofit organization from the tribe’s land. 

The Lakota Language Consortium had promised to preserve the tribe’s native language and had spent years gathering recordings of elders, including Taken Alive’s grandmother, to create a new, standardized Lakota dictionary and textbooks. 

 

But when Taken Alive, 35, asked for copies, he was shocked to learn that the consortium, run by a white man, had copyrighted the language materials, which were based on generations of Lakota tradition. The traditional knowledge gathered from the tribe was now being sold back to it in the form of textbooks.  

“No matter how it was collected, where it was collected, when it was collected, our language belongs to us. Our stories belong to us. Our songs belong to us,” Taken Alive, who teaches Lakota to elementary school students, told the tribal council in April. …”

Case Study of Open Access practices: Limitations and Opportunities in Public Libraries in Nigeria | by Isaac Oloruntimilehin | Creative Commons: We Like to Share | Mar, 2022 | Medium

“Online Public Access Catalogue (OPAC)…

Institutional Repositories…

The Nigerian Copyright Act…

Expressions of Folklore…

Nigerian Language Oral History Documentation Project…

Challenges…”

 

Language extinction triggers the loss of unique medicinal knowledge | PNAS

Abstract:  The United Nations proclamation of 2022–2032 as the International Decade of Indigenous Languages aims to raise global awareness about their endangerment and importance for sustainable development. Indigenous languages contain the knowledge that communities have about their surrounding plants and the services they provide. The use of plants in medicine is a particularly relevant example of such ecosystem services. Here, we find that most medicinal knowledge is linguistically unique—i.e., known by a single language—and more strongly associated with threatened languages than with threatened plants. Each indigenous language is therefore a unique reservoir of medicinal knowledge—a Rosetta stone for unraveling and conserving nature’s contributions to people.

 

The Global Extinction of Languages Is Threatening a Vital Type of Human Knowledge

“As human languages are driven to extinction around the world, a verbal encyclopedia of medical knowledge is on the brink of being forgotten.

Among 12,495 medicinal uses for plants in indigenous communities, new research has found over 75 percent of those plants are each tied to just one local language. If these unique words trickle out of use, so too may the knowledge they contain….

Language extinction is a tragic phenomenon that’s been occurring worldwide, as languages spoken by precious few people are replaced by larger ones. Roughly one language ceases to be spoken every four months, and 3,054 languages are currently endangered around the world….

The vast majority of plant species in the study were found to have medical properties described in just one indigenous language, many of which are themselves endangered….

In North America, for instance, the authors found waning indigenous languages held 86 percent of all unique knowledge on plant medicine. In the northwest Amazon, on the other hand, 100 percent of medicinal plant knowledge is restricted to languages on the edge of extinction. …”

Contemporary China Centre Blog » The Hidden Language Policy of China’s Research Evaluation Reform

“In February, China’s Ministries of Education and of Science and Technology released two documents that reshaped the research landscape: “Some Suggestions on Standardizing the Use of SCI Paper Indexes” and “Some Measures to Eliminate the Bad Orientation of ‘Papers Only’.” Elaborating the academic reform that President Xi has pursued since 2016, they provide the first detailed steps for dramatically reducing the role of the Science Citation Index (SCI) in evaluating Chinese research….

For twenty years, the SCI—a prestige listing of “high impact” scientific journals—controlled the careers of Chinese researchers. It and various derived indices are commonly used for university rankings and research evaluation (the UK, for example, uses SCI-derived data to allocate funding), but China relied on the SCI to an unusual degree. There, quotas for publishing in SCI journals governed hiring and advancement, pay bonuses, and even graduation from doctoral programs. In using the SCI as a “gold standard,” Chinese administrators sought to increase productivity, enhance national prestige, and benchmark the closure of gaps between China’s research sector and cutting-edge work internationally.

To a significant extent, these goals have been met. China has risen rapidly up international rankings, and Chinese research productivity routinely exceeds the world average (Li & Wang, 2019). Since 2016, China has been the world’s largest producer of published research, accounting for over a third of all global activity (Xie & Freeman, 2018, p. 2). …

So why change a winning formula? The Ministries’ announcements have focused on eliminating perverse incentives created by over-reliance on the SCI, which saw researchers prioritizing quantity over quality, nepotistically inflating citation counts, and falling prey to predatory journals. The Chinese government has, accordingly, allocated tens of millions of dollars to initiatives for improving Chinese journal quality and combating corrupt publishing practices. At the same time, commentators have noted the potential cost savings of de-centering SCI metrics….”

What Is Global Health: Science and Practice Doing to Address Power Imbalances in Publishing? | Global Health: Science and Practice

“We [at Global Health] are committed to removing publication barriers that can disproportionately impact authors based in LMICs. Recognizing that journal fees can be a major impediment to article submission and publication, especially for researchers in LMICs,11 we continue to make GHSP a no-fee, open-access journal. Furthermore, in instances where English language barriers could hinder publication opportunities, our editorial team has worked with authors to address language barriers and is committed to continuing this practice.”

Disadvantages in preparing and publishing scientific papers caused by the dominance of the English language in science: The case of Colombian researchers in biological sciences

Abstract:  The success of a scientist depends on their production of scientific papers and the impact factor of the journal in which they publish. Because most major scientific journals are published in English, success is related to publishing in this language. Currently, 98% of publications in science are written in English, including researchers from English as a Foreign Language (EFL) countries. Colombia is among the countries with the lowest English proficiency in the world. Thus, understanding the disadvantages that Colombians face in publishing is crucial to reducing global inequality in science. This paper quantifies the disadvantages that result from the language hegemony in scientific publishing by examining the additional costs that communicating in English creates in the production of articles. It was identified that more than 90% of the scientific articles published by Colombian researchers are in English, and that publishing in a second language creates additional financial costs to Colombian doctoral students and results in problems with reading comprehension, writing ease and time, and anxiety. Rejection or revision of their articles because of the English grammar was reported by 43.5% of the doctoral students, and 33% elected not to attend international conferences and meetings due to the mandatory use of English in oral presentations. Finally, among the translation/editing services reviewed, the cost per article is between one-quarter and one-half of a doctoral monthly salary in Colombia. Of particular note, we identified a positive correlation between English proficiency and higher socioeconomic origin of the researcher. Overall, this study exhibits the negative consequences of hegemony of English that preserves the global gap in science. Although having a common language is important for science communication, generating multilinguistic alternatives would promote diversity while conserving a communication channel. Such an effort should come from different actors and should not fall solely on EFL researchers.

 

2020 Workshop Multilingualism – AEUP – Association of European University Presses

“This workshop on multilingualism will bring publishers, translators, librarians, young researchers and interested stakeholders together, to exchange experiences on multilingualism and challenges with open access in academic publishing regarding multilingualism and discuss lobbying strategies….

Balance of localization and globalization: right balance of international and local relevance of research & impact of research
Translation: translation and translators as “invisible contribution” or recognized in their own right, authority of automated translations
Open access specific challenges around multilingualism
Hegemonial challenges around languages:
– linguistic hegemony and dominance in multilingual settings
– minorities and native languages as a cultural trait
– dominance of latin alphabet requirements (aggregators in scientific communication)
– dominance of the concept of universal linguae franca such as English….”

New wiki project – Abstract Wikipedia – will boost content across languages – Neowin

“Wikimedia Foundation has announced a new project that proposes a new way to generate encyclopedic content in a multilingual fashion. Abstract Wikipedia will allow more contributors and more readers to share more knowledge in more languages.

The Wikimedia Foundation is an American non-profit organization founded by Jimmy Wales as a way to fund Wikipedia and its sibling projects. This is the foundation’s first new project in over seven years.

The project was first proposed in a 22-page paper by Denny Vrande?i?, founder of Wikidata, earlier this year. He had floated a new idea that would allow contributors to create content using abstract notation which could then be translated to different natural languages, balancing out content more evenly, no matter the language you speak.

He suggested a project that could be used by anyone in the world to enter information as abstract notation, and then a tool called Wikilambda would host a collection of functions that could turn the notation into natural language text. Per him, the project wouldn’t require a major breakthrough in current knowledge of natural language generation or lexical knowledge representation….”

New wiki project – Abstract Wikipedia – will boost content across languages – Neowin

“Wikimedia Foundation has announced a new project that proposes a new way to generate encyclopedic content in a multilingual fashion. Abstract Wikipedia will allow more contributors and more readers to share more knowledge in more languages.

The Wikimedia Foundation is an American non-profit organization founded by Jimmy Wales as a way to fund Wikipedia and its sibling projects. This is the foundation’s first new project in over seven years.

The project was first proposed in a 22-page paper by Denny Vrande?i?, founder of Wikidata, earlier this year. He had floated a new idea that would allow contributors to create content using abstract notation which could then be translated to different natural languages, balancing out content more evenly, no matter the language you speak.

He suggested a project that could be used by anyone in the world to enter information as abstract notation, and then a tool called Wikilambda would host a collection of functions that could turn the notation into natural language text. Per him, the project wouldn’t require a major breakthrough in current knowledge of natural language generation or lexical knowledge representation….”

Long read | Science needs to inform the public. That can’t be done solely in English | LSE Covid-19

“Science, when communicated exclusively in English, risks not fully meeting its third mission, which is to inform the public. Never before have we seen this phenomenon as intensified as it has been throughout the COVID-19 pandemic. Ideally health-related studies, measures, and responses can be produced and examined by scientists, professionals, governing authorities, and individuals with the benefit of time. In the case of COVID-19, scientific communities have been called upon to assert new knowledge that will satisfy a remarkably urgent dual mission. Doing that only in English will leave many people behind, write Zehra Taskin (Adam Mickiewicz University), Guleda Dogan (Hacettepe University), Emanuel Kulczycki (Adam Mickiewicz University), and Alesia Ann Zuccala (University of Copenhagen)….”

Long read | Science needs to inform the public. That can’t be done solely in English | LSE Covid-19

“Science, when communicated exclusively in English, risks not fully meeting its third mission, which is to inform the public. Never before have we seen this phenomenon as intensified as it has been throughout the COVID-19 pandemic. Ideally health-related studies, measures, and responses can be produced and examined by scientists, professionals, governing authorities, and individuals with the benefit of time. In the case of COVID-19, scientific communities have been called upon to assert new knowledge that will satisfy a remarkably urgent dual mission. Doing that only in English will leave many people behind, write Zehra Taskin (Adam Mickiewicz University), Guleda Dogan (Hacettepe University), Emanuel Kulczycki (Adam Mickiewicz University), and Alesia Ann Zuccala (University of Copenhagen)….”

What Open means in the face of inequity & inequality – Google Slides

“It’s not only about getting the licensing right — it’s also about overcoming linguistic barriers, put resources in place, build the technical infrastructures that are flexible enough to adapt to diverse contexts”