MuLa2022  Search Word Frequency List

Corpus of Contemporary Latgalian Texts 2022

The corpus consists of certain proportions of various Latgalian published texts (1988–2021) with accompanying metadata about the author, place and year of the publication, as well as information about the type and genre of the text. The corpus was created by significantly expanding MuLa2012.

Citation
Publication
S. Martena, A. Briška, N. Naua
Latgaliešu valodas korpuss citu Eiropas mazāk lietoto valodu kontekstā: korpusa raksturojums, lietojums un potenciālā iespējošana
Letonica, 208-224, 2022
PDF
Data
A. Briška, I. Ziņģe, R. Darģis, K. Pokratniece, S. Martena, A. Kļavinska, A. Juško-Štekele
Corpus of Contemporary Latgalian Texts 2022 (MuLa2022)
CLARIN-LV digital library, 2022
http://hdl.handle.net/20.500.12574/72
Corpus size 2M words (2.8M tokens)
Development period 2020–2022
Developers Rezekne Academy of Technologies, Institute of Mathematics and Computer Science UL
Funding State Research Programme "Digital Resources for Humanities" (VPP-IZM-DH-2020/1-0001)
CLARIN http://hdl.handle.net/20.500.12574/72