Senie  Search Word Frequency List

Corpus of Early Written Latvian Texts

The historical corpus consists of both printed sources and manuscript transcripts included in toto from the 16th–18th cc., facilitating studies in the history of the Latvian language, its lexis, morphology, and syntax. The corpus serves as the basis of the ‘Historical Dictionary of Latvian (16th-17th cc.)’. To provide advanced search possibilities, texts (mainly word roots) have been normalised into conventional modern spelling. ‘Guidelines on the normalisation of early Latvian texts (16th–18th cc.) into modern spelling’ present the main principles.

Citation
Publication
E. Andronova, A. Fridenberga, L. Pretkalnina, R. Silina-Pinke, E. Skruzmane, A. Trumpa, P. Vanags
New Possibilities for Exploring Early Latvian Texts: Switching to the NoSketchEngine
Baltic Journal of Modern Computing, 12(4), 548-559, 2024
Data
E. Andronova, M. Baltiņa, A. Frīdenberga, N. Grūzītis, S. Ķauķīte, K. Pokratniece, L. Pretkalniņa, R. Siliņa-Piņķe, E. Skrūzmane, A. Spektors, M. Spektors, I. Štrausa, A. Trumpa, E. Trumpa, P. Vanags
Corpus of Early Written Latvian Texts (Senie)
CLARIN-LV digital library, 2025
http://hdl.handle.net/20.500.12574/90
Corpus size 2M words (3M tokens)
Data period 1507–1800
Development period 2002–..
Developers Latvian Language Institute, Faculty of Humanities, UL, Institute of Mathematics and Computer Science UL, Faculty of Humanties UL
Funding State Research Programme "Digital Humanities" (VPP-IZM-DH-2022/1-0002); State Research Programme "Digital Resources for Humanities" (VPP-IZM-DH-2020/1-0001); State Culture Capital Foundation
Homepage http://senie.korpuss.lv/
CLARIN http://hdl.handle.net/20.500.12574/90
Other publications
E. Andronova
Short Texts in the Corpus of Early Written Latvian
2020
PDF
E. Andronova, R. Silina-Pinke, A. Trumpa, P. Vanags
The Electronic Historical Latvian Dictionary based on the Corpus of Early Written Latvian Texts
Acta Baltico-Slavica, 40, 2016
E. Andronova
The Corpus of Early Written Latvian: Current state and future tasks
University of Birmingham, UK, 2007
PDF