MuLa2022  Search Word Frequency List

Corpus of Contemporary Latgalian Texts 2022

The corpus was created by expanding MuLa2012 with texts from new sources edited according to the Latgalian orthography adopted in 2007.

Publication to be cited:
S. Martena, A. Briška, N. Naua
Latgaliešu valodas korpuss citu Eiropas mazāk lietoto valodu kontekstā: korpusa raksturojums, lietojums un potenciālā iespējošana
Letonica, 208-224, 2022
PDF
Corpus size 2M words (2.8M tokens)
Development period 2020–2022
Developers Rezekne Academy of Technologies, Institute of Mathematics and Computer Science UL
Funding State Research Programme "Digital Resources for Humanities" (VPP-IZM-DH-2020/1-0001)