LPT_teikas  Search Word Frequency List

Folk Legend Corpus of LPT

The corpus includes legends published in volumes 13, 14, and 15 of "Latvian Folk Tales and Legends" (1925–1937), compiled by Pēteris Šmits. The volumes were digitised in the late 1990s; a revised version and the preparation of the German-language texts were carried out in 2012. Metadata refinement and the development of a new corpus version were undertaken in 2024 and 2025.

Corpus size 503k words (616k tokens)
Data period 1925–1937
Development period 1998–2025
Developers Digital Humanities Center of the University of Latvia, Institute of Literature, Folklore and Art UL, Institute of Mathematics and Computer Science UL
Funding State Research Programme "Towards Development of Open and FAIR Digital Humanities Ecosystem in Latvia" (VPP-IZM-DH-2022/1-0002)
Homepage http://valoda.ailab.lv/folklora/pasakas/