Digital Technologies Expo
Rapid Online Search Engine for Scanned Materials (acronym: ROSES) is an online platform designed to provide digital keyword-based searching in scanned materials using minimal typing labor and fast and easy deployment. The field of digitizing texts in major parts of Asia is suffering from entrenched challenges, including: a) underdeveloped OCR systems with relatively high error rates for most scripts, b) costly, unsupported, and ineffective procedures for verifiable double keying, c) complex TEI standards that require expertise and entail costly implementation, and d) non-existent or non-standardized platforms for facilitating searches and textual excavation in digitized texts across languages. ROSES was developed in order to bypass some of these problems by offering an easy solution for rapid browsing in scanned materials of two kinds: 1) alphabetized dictionaries and lexicons, and 2) indexed books. ROSES is deployable for printed texts written in any Asian language and requires minimal resources and configuration.
In this presentation I will introduce ROSES for the first time to the scholarly community and discuss the history of the project, its coding principles, potential scholarly usages, and possibilities for easy deployment. In addition, I will discuss the advantages of using ROSES for browsing scanned materials over full digitization, as well as the kinds of textual materials best fitting for ROSES deployment and the specific areas of research that could benefit from it.
ROSES Online: https://www.digitalroses.net