Digital Technologies Expo
How can researchers and students in Chinese and Korean Studies best take advantage of existing biographical databases, historical geographic datasets, and dictionaries in analyzing their own text collections or sources gathered from full-text databases? In this workshop Professor De Weerdt will introduce the thinking behind and the functionality of MARKUS and associated platforms designed for the customized semi-automated annotation of texts and the analysis and visualization of the resulting data. The first part of the workshop focuses on the main functionality of the MARKUS platform and will offer a step-by-step overview of supported research processes, including primary source text discovery and import from textual databases, the automated and manual mark-up of default named entities and user-generated tags, keyword discovery, batch mark-up, linked Chinese, Korean, and Manchu reference materials, data curation, content filtering, and data export. We will also take a look at additional features including the automated learning module in which MARKUS can be trained to obtain higher accuracy and greater recall, relational markup, metadata markup and tag conversion, and text reuse markup in COMPARATIVUS. In the second part of the workshop we will discuss how the various types of data marked up in MARKUS and COMPARATIVUS can be explored and mapped in associated textual analysis and data visualization platforms such as Palladio (designed at the Stanford and Design Research Lab), PLATIN (designed at the Max Planck Institute for the History of Science), and DOCUSKY (designed at National Taiwan University by Hsiang Jieh, Tu Hsieh-Chang et al.).
MARKUS and COMPARATIVUS are platforms designed by Brent Ho and Hilde De Weerdt at Leiden University with modules contributed by Mees Gelein, Tu Hsieh-Chang, and Miao Shengfa.
This workshop will cover methods for both close and distant reading of digital Chinese (both full-form and simplified characters) and Korean texts that are broadly applicable to a variety of languages and across disciplines.