LT4HALA 2026
--Home-- --CFP-- --EvaLatin-- --EvaHan --EvaCun-- --Program-- --Organization--
CALL FOR PAPERS
- Website: https://circse.github.io/LT4HALA/
- Submission page: TBA
- Date: TBA
- Place: co-located with LREC 2026, May 11-16, Palma, Mallorca (Spain)
DESCRIPTION
LT4HALA 2026 is a one-day workshop that seeks to bring together scholars who are developing and/or are using Language Technologies (LTs) for historically attested languages, so to foster cross-fertilization between the Computational Linguistics community and the areas in the Humanities dealing with historical linguistic data, e.g. historians, philologists, linguists, archaeologists and literary scholars. LT4HALA 2026 follows LT4HALA 2020, 2022, 2024 that were organized in the context of LREC 2020, LREC 2022 and LREC-COLING 2024, respectively. Despite the current availability of large collections of digitized texts written in historical languages, such interdisciplinary collaboration is still hampered by the limited availability of annotated linguistic resources for most of the historical languages. Creating such resources is a challenge and an obligation for LTs, both to support historical linguistic research with the most updated technologies and to preserve those precious linguistic data that survived from past times.
Relevant topics for the workshop include, but are not limited to:
- handling spelling variation;
- detection and correction of OCR errors;
- creation and annotation of linguistic resources (both lexical and textual);
- deciphering;
- morphological/syntactic/semantic analysis of textual data;
- adaptation of tools to address diachronic/diatopic/diastratic variation in texts;
- teaching ancient languages with LTs;
- NLP-driven theoretical studies in historical linguistics;
- NLP-driven analysis of literary ancient texts;
- evaluation of LTs designed for historical and ancient languages;
- LLMs for the automatic analysis of ancient texts;
- the role of digital infrastructures, such as CLARIN, in supporting research based on language resources for historical and ancient languages.
SHARED TASKS
LT4HALA 2026 will also host:
- the 4th edition of EvaLatin, a campaign entirely devoted to the evaluation of NLP tools for Latin. This new edition will focus on two tasks: dependency parsing and named entity recognition. Dependency parsing will be based on the Universal Dependencies framework.
- the 4th edition of EvaHan, the campaign for the evaluation of NLP tools for Ancient Chinese. EvaHan 2026 will focus on Ancient Chinese OCR (Optical Character Recognition) Evaluation.
- the 2nd edition of EvaCun, the campaign for the evaluation of Ancient Cuneiform Languages, with a shared task on Akkadian to English Machine Translation.
SUBMISSIONS
Submissions should be 4 to 8 pages in length and follow the LREC stylesheet (see below). The maximum number of pages excludes potential ethics Statements and discussion on limitations, acknowledgements and references, as well as data and code availability statements. Appendices or supplementary material are not permitted during the initial submission phase, as papers should be self-contained and reviewable on their own.
Papers must be of original, previously unpublished work. Papers must be anonymized to support double-blind reviewing. Submissions thus must not include authors’ names and affiliations. The submissions should also avoid links to non-anonymized repositories: the code should be either submitted as supplementary material in the final version of the paper, or as a link to an anonymized repository (e.g., Anonymous GitHub or Anonym Share). Papers that do not conform to these requirements will be rejected without review.
Submissions should follow the LREC stylesheet, that is available on the LREC 2026 website on the Author’s kit page.
Each paper will be reviewed by three independent reviewers.
Accepted papers will appear in the workshop proceedings, which include both oral and poster papers in the same format. Determination of the presentation format (oral vs. poster) is based solely on an assessment of the optimal method of communication (more or less interactive), given the paper content.
As for EvaLatin and EvaHan, participants will be required to submit a technical report for each task (with all the related sub-tasks) they took part in. Technical reports will be included in the proceedings as short papers: the maximum length is 4 pages (excluding references) and they should follow the LREC 2026 official format. Reports will receive a light review (we will check for the correctness of the format, the exactness of results and ranking, and overall exposition). All participants will have the possibility to present their results at the workshop. Reports of the shared tasks are not anonymous.
WORKSHOP IMPORTANT DATES
- 24 February 2026: submissions due
- 17 March 2026: reviews due
- 20 March 2026: notifications to authors
- 27 March 2026: camera-ready (PDF) due
Identify, Describe and Share your LRs!
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).
Back to the Main Page