ICDAR 2024 Competition on Handwriting Recognition of Historical Ciphers

Handwritten Text Recognition (HTR) in low-resource scenarios (i.e. when the amount of labeled data is scarce) is a challenging problem. This is particularly true for historical encrypted manuscripts, commonly known as ciphers, which contain secret messages and were typically used in military or dipl...

ver descrição completa

Detalhes bibliográficos
Autores: Fornés Bisquerra, Alicia|||0000-0002-9692-5336, Chen, Jialuo|||0000-0002-7808-6567, Torras, Pau|||0000-0003-0327-9046, Badal Pérez-Alarcón, Carles|||0000-0001-8948-9946, Megyesi, Beáta|||0000-0002-4838-6518, Waldispühl, Michelle|||0000-0002-5603-6083, Kopal, Nils, Lasry, George
Tipo de documento: capítulo de livro
Data de publicação:2024
País:España
Recursos:Universitat Autònoma de Barcelona
Repositório:Dipòsit Digital de Documents de la UAB
Idioma:inglês
OAI Identifier:oai:ddd.uab.cat:325042
Acesso em linha:https://ddd.uab.cat/record/325042
https://dx.doi.org/urn:doi:10.1007/978-3-031-70552-6_20
Access Level:Acceso aberto
Palavra-chave:Handwritten Text Recognition
Historical Documents
Cipher Recognition
Competition
Descrição
Resumo:Handwritten Text Recognition (HTR) in low-resource scenarios (i.e. when the amount of labeled data is scarce) is a challenging problem. This is particularly true for historical encrypted manuscripts, commonly known as ciphers, which contain secret messages and were typically used in military or diplomatic correspondence, records of secret societies, or private letters. To hide their contents, the sender and receiver created their own secret method of writing. The cipher alphabets often include digits, Latin or Greek letters, Zodiac and alchemical signs, combined with various diacritics, as well as invented ones. The first step in the decryption process is the transcription of these manuscripts, which is difficult due to the great variation in handwriting styles and cipher alphabets with a limited number of pages. Although different strategies can be considered to deal with the insufficient amount of training data (e.g., few-shot learning, self-supervised learning), the performance of available HTR models is not yet satisfactory. Thus, the proposed competition, which includes ciphers with a large number of symbol sets and scribes, aims to boost research in HTR in low-resource scenarios.