MuLa2012  Search Word Frequency List

Corpus of Contemporary Latgalian Texts 2012

The corpus consists of certain proportions of various Latgalian published texts (1988–2012) with accompanying metadata about the author, as well as place and time of publication.

I. Sperga, K. Pokratniece, A. Briška
Corpus of Contemporary Latgalian Texts 2012 (MuLa2012)
CLARIN-LV digital library, 2013
Corpus size 1M words (1.3M tokens)
Development period 2011–2013
Developers Institute of Mathematics and Computer Science UL, Rezekne Academy of Technologies
Funding Latvian-Lithuanian Cross Border Cooperation program, “Development of Research Infrastructure for Education in the Humanities in Eastern Latvia and Lithuania” (HipiLatLit)