LitMāksla  Search Word Frequency List

"Literatūra un Māksla"

Corpus contains texts of the newspaper "Literatūra un Māksla", published from 1945 to 1995. The corpus is comprised of early printed source texts processed through optical character recognition (OCR), therefore, it contains a notable number of character recognition mistakes.

A. Baklāne, V. Saulespurēns, A. Ozols
"Literatūra un Māksla" (LitMāksla)
CLARIN-LV digital library, 2022
Corpus size 52.7M words (65.8M tokens)
Development period 2022
Developers National Library of Latvia
Funding State Research Programme "Digital Resources of the Humanities" (VPP-IZM-DH-2020/1-0001)