MuLa2022  Search Word Frequency List

Corpus of Contemporary Latgalian Texts 2022

The corpus consists of certain proportions of various Latgalian published texts (1988–2021) with accompanying metadata about the author, place and year of the publication, as well as information about the type and genre of the text. The corpus was created by significantly expanding MuLa2012.

A. Briška, I. Ziņģe, R. Darģis, K. Pokratniece, S. Martena, A. Kļavinska, A. Juško-Štekele
Corpus of Contemporary Latgalian Texts 2022 (MuLa2022)
CLARIN-LV digital library, 2022
Corpus size 2M words (2.8M tokens)
Development period 2020–2022
Developers Rezekne Academy of Technologies, Institute of Mathematics and Computer Science UL
Funding State Research Programme "Digital Resources for Humanities" (VPP-IZM-DH-2020/1-0001)