Corpora with tag speech (5)

LAMBA

Annotated Longitudinal Latvian Children's Speech Corpus

2015–2017, 34 hours
Developers: IMCS UL

LaRKo

Latvian Speech Corpus

2014, 8 hours
Developers: IMCS UL

LRK2013

Latvian Speech Recognition Corpus

2013, 100 hours (1.1M tokens)
Developers: IMCS UL, Tilde, LETA

LVMED

Latvian Radiology Speech Corpus

2022, 35 hours (157k tokens)
Developers: IMCS UL, REUH

Subtitri

Latvian Subtitles of Public Broadcasting

2020–2022, 1200 hours (10.8M tokens)
Developers: IMCS UL