VVPP 

Corpus of the Tests of the State Language Proficiency Testing

The Corpus includes a collection of 900 Latvian language proficiency tests: 150 tests per each proficiency level (A1, A2, B1, B2, C1, C2). Error annotation has been perfomed in all texts.

Citation
Publication
I. Auzina, G. Klava, A. Lazareva, K. Levane-Petrova, B. Murniece-Buleva, S. Pavulena, A. Semjonova
Latviešu valodas prasmes kvalitāte: valsts valodas prasmes pārbaudes kārtotāju rezultāti
Latviešu valodas aģentūra, 2019
PDF
Data
I. Auziņa, R. Darģis, K. Levāne-Petrova, K. Pokratniece, D. Vēvere
Corpus of the Tests of the State Language Proficiency Testing (VVPP)
CLARIN-LV digital library, 2018
http://hdl.handle.net/20.500.12574/49
Corpus size 150k tokens
Data period 2016–2017
Development period 2017–2018
Developers Institute of Mathematics and Computer Science UL
Funding Latvian Language Agency, "Quality of the Latvian language: results of the state language proficiency test"
CLARIN http://hdl.handle.net/20.500.12574/49
Other publications
R. Dargis, I. Auzina, K. Levane-Petrova
The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners
2018
PDF