The project Comparing Romance Languages through History (CoRaLHis): building a multilingual parallel diachronic corpus (13th-18th C.) aims to provide a thorough understanding of how languages diverge from each other over time in the field of Romance languages. In order to do so, it presents a consortium of already existing corpora, which are integrated into a multilingual digital corpus of comparable texts from the 13th to 18th C. for Romania’s three main areas (Gallo-, Italo- and Ibero-Romance), in order to allow a large-scale and fine-grained empirically analysis of linguistic change in the different Romance languages.
=== PARTNERS === [alphabetical order]
|BDSud(It), Biblioteca Digitale del Sud del Sud (italoromanzo)||[Southern Italian Dialects]|
|MIDIA, Morfologia dell'Italiano in DIAcronia||[Italian]|
|BFM, Base de Français Médieval||[French]|
|CICA, Corpus informatitzat del català antic||[Catalan]|
|OLDES - Hispanic Seminary of Medieval Studies||[Spanish]|
|Corpus Histórico do Português Tycho Brahe||[Portuguese]|
|Team Corpus of Swiss legal texts||[French (many texts), Italian (quite a lot), Rhaeto-Romance (very few)]|
|TMAO, Trésor Manuscrit Ancien Occitan||[Occitan]|
A formal agreement will be signed with the person in charge of each of the above-mentioned corpora in order to establish the terms and conditions of the collaboration, with a special regard on the legal issues derived form the use of texts.
After the first in-person meeting in Zurich in November 2019, a second (online, due to the pandemic situation) meeting took place in April 2020, where several details were settled on and a concrete action plan was agreed on by the partners.
For more specific information on the languages, genres and number of words of the corpus, see here.
Projet Emergence funded by the Idex Super de l'Alliance Sorbonne Université. More information here