Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new …
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s e…
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constit…
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constit…
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 400,000 entries …
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 400,000 entries based on 346 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and it is integrated with the Latvian W…
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constit…
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
The corpus consists of PhD theses and abstracts published in the University of Latvia, Riga Technical University, Riga S…
The corpus consists of PhD theses and abstracts published in the University of Latvia, Riga Technical University, Riga Stradins University and Liepaja University until 2020.
LVBERT is the first publicly available monolingual BERT language model pre-trained for Latvian. For training we used the…
LVBERT is the first publicly available monolingual BERT language model pre-trained for Latvian. For training we used the original implementation of BERT on TensorFlow with the whole-word masking and the next sentence prediction objectives. We used BERT-BASE configuration with 12 layers, 768 hidden units, 12 heads, 128 …
The corpus consists of all information published on Latvian Wikipedia until February 2022.
The corpus consists of all information published on Latvian Wikipedia until February 2022.
Corpus contains texts of the magazine "Karogs" from 1940 to 1994.
Corpus contains texts of the magazine "Karogs" from 1940 to 1994.
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new …
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s e…