LT4HALA 2026
--Home-- --CFP-- --EvaLatin-- --EvaHan-- --EvaCun-- --Program-- --Organization--
9:00-9:15 Welcome
9:15-10:30 Oral Session 1
- 9:15-9:30 - Vladimir Polomac and Silvie Cinkova, Morphological Annotation of Old Serbian in Universal Dependencies
- 9:30-9:45 - Aleš Manuel Manuel Papáček and Zdeněk Žabokrtský, Tracing Morph Origins in Czech: A Computational Approach to Morph-Level Etymology
- 9:45-10:00 - Ellinor Lindqvist, Eva Pettersson and Joakim Nivre, Uncovering Work from Words: LLM-Based Information Extraction from Historical Petitions
- 10:00-10:15 - Costanza Marini, Gianluca Casagrande, Alessio Palmero Aprosio and Claudia Principe, Extracting Volcanological Knowledge from Historical Texts: A Language-Technology Pipeline for Diachronic Geovisualization
- 10:15 - 10-30 - Manuel Favaro, Elisa Guadagnini, Eva Sassolini, Marco Biffi and Simonetta Montemagni, When Lexicographic Quotations Become a Corpus: To Deduplicate or Not to Deduplicate?
10:30-11:30 Poster Session + Coffee Break
- Elisha Rosensweig, Yitzchak Lindenbaum, Hillel Gershuni, Vered Raziel-Kretzmer, Daniel Caine and Avi Shmidman, A New State-of-the-Art BERT Model for Judeo-Arabic
- Iglika Nikolova-Stoupak, Maxime Amblard and Frédérique Rey, BEReshiT: an Ancient Hebrew Model based on DictaBERT
- Hang Zhu, Mitoki Ohara, Rei Kikuchi, Kanako Komiya, Masayuki Asahara and Sachi Kato, Automatic Detection of Metaphorical Expressions in Classical Japanese Using WLSP-Enhanced BERT
- Shmuel LIebeskind, Maayan Zhitomirsky-Geffet, Binyamin Katzoff, Nati Ben-Gigi and Jonathan Schler, Domain-Aware Error Correction for Citation NER in Medieval Hebrew Responsa
- Colin Swaelens, Francesco Mambrini and Marco Passarotti, From Lemmas to Links: A Lemma Bank for Ancient Greek
- Wenhui Cui and Phillip Benjamin Ströbel, Across Generations: A Comparative Analysis of NER for Latin Inscriptions from Classical Machine Learning to LLMs
- Irene Miani, Gregory Darwin and Sara Stymne, POS Tagging with Generative LLMs for Historical Germanic Low-Resource Languages: An Evaluation Against Fine-Tuned BERT
- Nicklas Sindlev Anderesn, Byron Macdougall, Tariq Yousef and Aglae Pizzone, From Manuscript to Model: Developing HTR for Medieval Greek
- Marijke Beersmans, Evelien de Graaf, Julie Nijs, Valeria Irene Boano, Alek Keersmaekers, Mark Depauw, Tim Van de Cruys and Margherita Fantoli, I, RE:Claudius 256: Towards Linking Classical Latin Person Mentions to a Domain-specific Knowledge Base
- Guan-Yu Tseng, Chunki Lim, Chih-Han Lin, Tung-Le Pan, Yu-Chieh Wang, Lang-Jing Yeh and Shu-Kai HSIEH, Capturing Ancient Chinese Sense Induction with Automatic Pipelines
11:30-12:15 Oral Session 2
- 11:30-11:45 - Evgeniya Korovina, A Computational Evaluation of Syllabic Hypotheses for Rongorongo: Evidence from N-gram Analysis
- 11:45-12:00 - Beata Megyesi, Rune Rattenborg, Benedek Láng, Michelle Waldispühl and Mihály Héder, Building a Corpus and Database for Rare and Undeciphered Scripts
- 12:00-12:15 - Tal Bernstein, Shai Gordin and Letizia Cerqueglini, A Layered Annotation Workflow for Semitic Epigraphy
12:15-13:00 EvaLatin
- 12:15-12:25 - Federica Iurescia, Marco Passarotti and Rachele Sprugnoli, Overview of the Dependency Parsing Task at EvaLatin 2026
- 12:25-12:35 - Luc Pommeret, Thibault Wagret and Jules Deret, THIVLVC: Retrieval Augmented Dependency Parsing for Latin
- 12:35-12:45 - Valeria Irene Boano, Eleonora Litta and Matteo Romanello, Overview of the Named Entity Recognition Task at EvaLatin 2026
- 12:45-12:55 - Callum Chan, Transfer Learning for Named Entity Recognition of Classical Latin through LLM Prompting
- 12:55-13:00 - QA
13:00-14:00 Lunch Break
14:00-15:15 EvaHan
- Li Bin, Opening Remarks
- Feng Zhiwei, Invited talk
- Wang Dongbo, Overview of EvaHan2026: The First International Evaluation on Ancient Chinese OCR and Layout Analysis
- KeYan Liang, Meiling Liu, A Multi-Stage System for Ancient Chinese OCR and Layout Understanding in the EvaHan2026 Shared Task
- Chaokun Zhang, Xin Wen, Tongtong Zhou, A Multi-Modal Recognition Framework for Ancient Books Integrating DoRA-DPO Text Recognition and YOLO Layout Analysis
- Yihuan Yin, Qian Zhao, Beijing Normal University at EvaHan 2026: Enhancing Ancient Chinese Character Recognition and Layout Analysis via VLM Fine-Tuning and Linguistic Post-Processing
- Qi Fan, Jieming Hu, Chen Ye, A Dual-Modality Framework for Ancient Document Layout Analysis and Text Recognition
- Chenrui Zheng, EvaHan 2026 Ancient Books Multimodal OCR and Layout Analysis System Technical Report
- Yuchun Meng, A Parameter-Efficient and Data-Centric Framework for Ancient Chinese Text
- Xia Tian, Liu Yulong, Wang Yilin, Yang Yumeng, Cai Dongheng, Tan Yuyang,Yang Menghui, LVLM Optimization for Ancient Chinese Book Image Analysis with Task-specific Augmentation and Instruction Tuning
- Chengfei Li, Yunjie Zhang, Xiaoyi Li, Changshun Quan, Taihe Cao, Bin Liu, Data-Centric Strategies for Ancient Chinese Text Recognition: Augmentation, Annotation Refinement, and Style Transfer in EvaHan 2026
- Colin Brisson, Ayoub Kahfy, Frédéric Constant, Marc Bui, AnandaSky: A Vision–Language Model for Line-Level Transcription of Historical Sinographic Documents
- Liqi He, Qiwei Li, Ziye Yang, Zuchao Li, Multimodal Ancient Document Parsing: Technical Report for EvaHan2026 Competition
- Huizi Zhou, Yuhan Shu, Multi-Task Learning Trade-offs in Vision–Language Models for Ancient Chinese OCR: An Empirical Analysis of Parameter-Efficient Adaptation
- Denise Atzori, Marie Bizais-Lillig, Mathias Garnier, Maxime Létoffé, Charles Planque, Tianjie Yin, Chahan Vidal-Gorène, Building Character(s): Synthetic Data and In-Context Learning Strategies for Few-Shot Ancient Chinese Recognition
- Closing Remarks
15:15-16:00 Oral Session 3
- 15:15-15:30 - Lucas Consolin Dezotti, Marco Passarotti, Federica Iurescia and Giovanni Moretti, The UD_Latin-PROIEL as Linked Open Data: Integrating a Latin Treebank into the LiLa Knowledge Base
- 15:30-15:45 - Shibingfeng Zhang, Edoardo Caraffa, Annafelicia Zuffrano, Maddalena Modesti and Giovanni Colavizza, Language Models for the Restoration of Latin Legal Manuscripts
- 15:45-16:00 - Luca Brigada Villa, Marco Passarotti, Chiara Zanchi, Riccardo Ginevra, Erica Fratellini and Eleonora Litta, Evaluating Hierarchical Aggregation and LLM-Based Matching for Synset Selection in Ancient Greek
16:00-17:00 Poster Session + Coffee Break
- Thomas Koppens and Claudia Borg, Miktub: A Manuscript Dataset of Historical Maltese for Handwritten Text Recognition
- Teresa Paccosi and Marijn Koolen, Smelling the Past: Investigating Historical Models for Olfactory Event Extraction
- Heiki-Jaan Kaalep, Contemporizing 20-th Century Estonian
- Charlene Ellul, Vanessa Buhagiar, Claudia Borg and Charlie Abela, Cost-Aware Pre-Annotation Strategies for Nested NER in Historical Latin Notarial Deeds
- Paola Marongiu and Eva Sassolini, From Lemmatization to Legal Terminology: Assessing an Hybrid Pipeline on Justinian’s Digest
- Sara Stymne, UppsalaNLP at EvaLatin 2026: Multilingual parsing for Latin
- Maria Mihaela Trusca, Mark Depauw, Violet Soen, Ine de Daele, Kevin Verbruggen and Tim Van de Cruys, Contextual Probing for Low-Resource Named Entity Recognition in Latin
- Luisa Ripoll-Alberola, Classificatio Sine Iactu – That Is, Zero-Shot NERC in Latin
- Hiroshi Matsuda, Extending omnes flores for the EvaLatin 2026 Dependency Parsing Tasks
17:00-17:45 Oral Session 4
- 17:00-17:15 - Thibault Clérice, Rachel Bawden, Anthony Glaise, Ariane Pinche and David Smith, Pre-Editorial Normalization for Automatically Transcribed Medieval Manuscripts in Old French and Latin
- 17:15-17:30 - Pontus Henningsson, Eva Pettersson and Erik Lenas, OldBERTur: Named Entity Recognition For Medieval Icelandic
- 17:30-17:45 - Nasma Chaoui and Richard Khoury, Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages
17:45-18:00 Closing
Back to the Main Page