“Digital archives are transforming the Humanities and the Sciences. Digitized collections of newspapers and books have pushed scholars to develop new, data-rich methods. Born-digital archives are now better preserved and managed thanks to the development of open-access and commercial software. Digital Humanities have moved from the fringe to the center of academia. Yet, the path from the appraisal of records to their analysis is far from smooth. This book explores crossovers between various disciplines to improve the discoverability, accessibility, and use of born-digital archives and other cultural assets….
Introduction
Seiten 7 – 28
Chapter 1: Artificial Intelligence and Discovering the Digitized Photoarchive
Seiten 29 – 60
Chapter 2: Web Archives and the Problem of Access: Prototyping a Researcher Dashboard for the UK Government Web Archive
Seiten 61 – 82
Chapter 3: Design Thinking, UX and Born-digital Archives: Solving the Problem of Dark Archives Closed to Users
Seiten 83 – 108
Chapter 4: Towards Critically Addressable Data for Digital Library User Studies
Seiten 109 – 130
Chapter 5: Reviewing the Reviewers: Training Neural Networks to Read Peer Review Reports
Seiten 131 – 156
Chapter 6: Supervised and Unsupervised: Approaches to Machine Learning for Textual Entities
Seiten 157 – 178
Chapter 7: Inviting AI into the Archives: The Reception of Handwritten Recognition Technology into Historical Manuscript Transcription
Seiten 179 – 204
AFTERWORD: Towards a new Discipline of Computational Archival Science (CAS)
Seiten 205 – 218 …
[From the Introduction:]
The closure of libraries, archives and museums due to the COVID-19 pandemic has highlighted the urgent need to make archives and cultural heritage materials accessible in digital form. Yet too many born-digital and digitized collections remain closed to researchers and other users due to privacy concerns, copyright and other issues. Born-digital archives are rarely accessible to users. For example, the archival emails of the writer Will Self at the British Library are not listed on the Finding Aid describing the collection, and they are not available to users either onsite or offsite. At a time when emails have largely replaced letters, this severely limits the amount of content openly accessible in archival collections. Even when digital data is publicly available (as in the case of web archives), users often need to physically travel to repositories to consult web pages. In the case of digitized collections, copyright can also be a major obstacle to access. For instance, copyrightprotected texts are not available for download from HathiTrust, a not-for-profit collaborative of academic and research libraries preserving 17+ million digitized items (including around 61% not in the public domain)….
It is important to recognize that “dark” archives contain vast amounts of data essential to scholars – including email corres