FullStack-LV 

Full Stack of Latvian Language Resources

The multilayer corpus is anchored in the following cross-lingual state-of-the-art representations: Universal Dependencies (UD), FrameNet, PropBank and Abstract Meaning Representation (AMR).

Citation
Publication
N. Gruzitis, L. Pretkalnina, B. Saulite, L. Rituma, G. Nespore-Berzkalne, A. Znotins, P. Paikens
Creation of a Balanced State-of-the-Art Multilayer Corpus for NLU
Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), 2018
PDF
Data
N. Grūzītis, L. Pretkalniņa, B. Saulīte, L. Rituma, G. Nešpore-Bērzkalne, P. Paikens, I. Auziņa, A. Znotiņš, K. Levāne-Petrova, R. Darģis
Full Stack of Latvian Language Resources (FullStack-LV)
CLARIN-LV digital library, 2019
http://hdl.handle.net/20.500.12574/5
Corpus size 13691 sentences
Data period 1991–2018
Development period 2017–2019
Developers Institute of Mathematics and Computer Science UL
Funding European Regional Development Fund, "Full Stack of Language Resources for Natural Language Understanding and Generation in Latvian" (1.1.1.1/16/A/219); PostDoc grant No. 1.1.1.2/VIAA/1/16/118
CLARIN http://hdl.handle.net/20.500.12574/5
Other publications
N. Gruzitis, G. Nespore-Berzkalne, B. Saulite
Creation of Latvian FrameNet based on Universal Dependencies
Proceedings of the International FrameNet Workshop (IFNW), 2018
PDF
G. Nespore-Berzkalne, B. Saulite, N. Gruzitis
Latvian FrameNet: Cross-Lingual Issues
Human Language Technologies - The Baltic Perspective, IOS Press, 2018
N. Gruzitis, R. Dargis, L. Rituma, G. Nespore-Berzkalne, B. Saulite
Deriving a PropBank Corpus from Parallel FrameNet and UD Corpora
Proceedings of the International FrameNet Workshop 2020: Towards a Global, Multilingual FrameNet, 2020
PDF