The following parameters are used to select a sub-corpus or to segment the corpus into different sub-corpora (by genre, century, dialect, etc.).
Researchers can use other meta-data to refine their queries:
Author
Title
Date of composition
Date of manuscript
Date of the printed of the first edition
Geographical origin of the text
Bibliographic data
Reference of the manuscript
Size of the text (number of words)
Origin of the corpus, funding
PoS-TAGGING
List of morpho-syntactic tags for CoRaLHis
ADJ = Adjective
ADV = Adverb
CON = Conjunction
DET = Determiner
NOM = Noun
PRE = Preposition
PRO = Pronoun
VER = Verb
PUN = Punctuation
INJ = Interjection