Digital Cultural Heritage Papers - 2024 Roundup
A roundup of papers, essays, articles, book, blogposts, reports etc published this year I've read (ok, sometimes just scanned) for topics I'm currently interested in, i.e. mostly digital cultural heritage or AI/web/computing related. And some random other stuff.
I reserve the right to return in the future to add other papers published this year that I missed, as my interests change. There are also many other papers I should have added here from earlier in the year, I may add more if I have another burst of enthusiam to trawl through my Zotero.
Infrastructure
Christopher Smith - On funding arts and humanities infrastructure in the UK (with a teaser for 2025 plans!) - https://anatomiesofpower.wordpress.com/2024/12/31/funding-arts-and-humanities/
Waters, D.J. (2023) ‘The emerging digital infrastructure for research in the humanities’, International Journal on Digital Libraries, 24(2), pp. 87–102. Available at: https://doi.org/10.1007/s00799-022-00332-3.
Peter Wells The National Data Library should help people deliver trustworthy data services (Dec 2024) - https://peterkwells.com/2024/12/18/the-national-data-library-should-help-people-deliver-trustworthy-data-services/
Linked Data
Sanderson, R. (2024) ‘Implementing Linked Art in a Multi-Modal Database for Cross-Collection Discovery’, Open Library of Humanities, 10(2). Available at: https://doi.org/10.16995/olh.15407.
Data Wrangling
Beyond HTTP APIs: the case for database dumps in Cultural Heritage - https://literarymachin.es/beyond-api-data-dumps/
Computational Text Analysis
Lit, C. van and Roorda, D. (2024) ‘Neither Corpus Nor Edition: Building a Pipeline to Make Data Analysis Possible on Medieval Arabic Commentary Traditions’, Journal of Cultural Analytics, 9(3). Available at: https://doi.org/10.22148/001c.116372.
(conf paper anstract) "Exploring Zero-Shot Named Entity Recognition in Multilingual Historical Travelogues Using Open-Source Large Language Models" - https://clin34.leidenuniv.nl/abstracts/exploring-zero-shot-named-entity-recognition-in-multilingual-historical-travelogues-using-open-source-large-language-models/
AI
DeepMind - A new golden age of discovery - https://deepmind.google/public-policy/ai-for-science/ - "In this essay, we take a tour of how AI is transforming scientific disciplines from genomics to computer science to weather forecasting. Some scientists are training their own AI models, while others are fine-tuning existing AI models, or using these models’ predictions to accelerate their research"
ODI A data for AI taxonomy - https://theodi.org/news-and-events/blog/a-data-for-ai-taxonomy/ - "[...] we set out to develop a taxonomy of the data involved in developing, using and monitoring foundation AI models and systems. It is a response to the way that the data used to train models is often described as if a static, singular blob, and to demonstrate the many types of data needed to build, use and monitor AI systems safely and effectively."
A Large Language Model walks into an archive... https://cblevins.github.io/posts/llm-primary-sources/
VLM Art Analysis by Microsoft Florence-2 and Alibaba Cloud Qwen2-VL - https://huggingface.co/blog/PandorAI1995/vlm-art-analysis-by-florence-2-b-and-qwen2-vl-2b
OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B - https://huggingface.co/blog/PandorAI1995/ocr-processing-text-in-image-analysis-vlm-models
OpenAI - Introducing SimpleQA - "Factuality is a complicated topic because it is hard to measure—evaluating the factuality of any given arbitrary claim is challenging, and language models can generate long completions that contain dozens of factual claims. In SimpleQA, we will focus on short, fact-seeking queries, which reduces the scope of the benchmark but makes measuring factuality much more tractable." - https://openai.com/index/introducing-simpleqa/
Digital Humanities
The Bloomsbury Handbook to the Digital Humanities - https://www.bloomsbury.com/uk/bloomsbury-handbook-to-the-digital-humanities-9781350452572/#
Digital Editions
On Automating Editions The Affordances of Handwritten Text Recognition Platforms for Scholarly Editing - https://scholarlyediting.org/issues/41/on-automating-editions/
3D Printing
Volpe, Y. et al. (2014) ‘Computer-based methodologies for semi-automatic 3D model generation from paintings’, International Journal of Computer Aided Engineering and Technology, 6(1), p. 88. Available at: https://doi.org/10.1504/IJCAET.2014.058012.
Web Development
"[...] this investigation into JavaScript-first frontend culture and how it broke US public services has been released in four parts." - https://infrequently.org/2024/08/the-landscape/
Conference Proceedings
SWIB24 - Semantic Web in Libraries - https://swib.org/swib24/programme.html
Computational Humanities Research CH 2024 - https://ceur-ws.org/Vol-3834/
Vis4DH 2024 - Didn't happen ?
Journals
New Journals I've come across (or re-discovered):
Interdisciplinary Digital Engagement in Arts & Humanities (IDEAH) - https://ideah.pubpub.org/
Public Humanities - https://www.cambridge.org/core/journals/public-humanities
RIDE - "RIDE is an open access review journal dedicated to digital editions and resources" - https://ride.i-d-e.de/
DH Benelux Journal - "DH Benelux Journal is the official journal of the DH Benelux community, which fosters collaboration between researchers in the digital humanities in Belgium, Luxembourg and the Netherlands. " - https://journal.dhbenelux.org/
Journal of Open Research Software - "The Journal of Open Research Software (JORS) features peer reviewed Software Metapapers describing research software with high reuse potential." - https://openresearchsoftware.metajnl.com/
Transformations - A DARIAH Journal is a multilingual journal created in 2024 by the European research infrastructure DARIAH ERIC. This journal is an ongoing publication with thematic issues in Digital Humanities, humanities, social sciences, and the arts. The journal is particularly interested in the use of digital tools, methods, and resources in a reproducible approach. It welcomes scientific contributions on collections of data, workflows and software analysis. - https://transformations.episciences.org/