The string2string library is an open-source tool that offers a comprehensive suite of efficient algorithms for a broad range of string-to-string problems. It includes both traditional algorithmic solutions and recent advanced neural approaches to address various problems in pairwise string alignment, distance measurement, lexical and semantic search, and similarity analysis. Additionally, the library provides several helpful visualization tools and metrics to facilitate the interpretation and analysis of these methods.
Quo vadam? Start here! / How to Use Scrapbox? / Social Networks (Current) / Social Networks (Research) / Initials / Arrows in Manuscripts / Cats and Manuscripts / Standoff Annotation (TEI) / Marginal
The Lat-Epig interface allows you to query the EDCS and save the search result in a TSV file and plot the results on a map of the Roman Empire without any prior knowledge of programming. - GitHub - mqAncientHistory/Lat-Epig: The Lat-Epig interface allows you to query the EDCS and save the search result in a TSV file and plot the results on a map of the Roman Empire without any prior knowledge of programming.
Beautiful visualizations of how language differs among document types. - GitHub - JasonKessler/scattertext: Beautiful visualizations of how language differs among document types.
The research of ancient written artefacts results in an ever-increasing amount of digital data in different forms, ranging from raw images of artefacts to automatically generated data from advanced acquisition techniques. The manual analysis of this data is typically time consuming and can be subject to human error and bias. Therefore, a set of Pattern Analysis Software Tools (PAST) has been developed for the automatic analysis of visual and tabular patterns in the research data from the study of ancient written artefacts. These software tools have been developed by Hussein Mohammed to facilitate a more efficient study of written artefacts and to help scholars benefit from the rapid advancements in the fields of pattern analysis and artificial intelligence. Furthermore, these tools can provide new insights which can only be derived from the statistical analysis of research data. Each tool in PAST is developed and tested in close collaboration with experts from relevant fields of research in order to ensure its usability and applicability to actual research questions.
The BERT for Humanists project is developing resources to enable DH scholars to explore how BERT-like models can be used in their research and teaching. Find an annotated bibliography of research papers and tools, a glossary of relevant terms, code tutorials, and information about our virtual workshop in June 2021.
CRMtex is an extension of CIDOC CRM created to support the study of ancient documents by identifying relevant textual entities and by modelling the scientific process related with the investigation of ancient texts and their features in order to foster integration with other cultural heritage research fields, such as archaeology and history.
In this step-by-step tutorial, you'll learn how to use spaCy. This free and open-source library for natural language processing (NLP) in Python has a lot of built-in capabilities and is becoming increasingly popular for processing and analyzing data in NLP.
M. Springstein, S. Schneider, C. Althaus, and R. Ewerth. MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022
, page 1107--1116. ACM, (2022)
D. Erdmann, and K. Vogel. Digital Turn und Historische Bildungsforschung. Bestandsaufnahme und Forschungsperspektiven
, Julius Klinkhardt, Bad Heilbrunn, (2022)
V. Alexiev, and I. Nikolova. Data Sharing, Holocaust Documentation and the Digital Humanities. Best Practices, Benefits, Case Studies (DSDH 2017)
, Venice, Italy, (June 2017)
E. Losh, and J. Wernimont (Eds.).
. Debates in the digital humanities University of Minnesota Press, Minneapolis, Minn., (2018)Includes bibliographical references and index.