Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora i…
Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora in different languages. The corpora have been collected from the Internet using the automated system developed in the Finno-Ugric Languages and the Internet project (SUKI) supported by the Kone foundat…
The VRT version of Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 29…
The VRT version of Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora in different languages. The corpora have been collected from the Internet using the automated system developed in the Finno-Ugric Languages and the Internet project (SUKI) supported …
The Korp version of Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 2…
The Korp version of Wanca 2016 is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora in different languages. The corpora have been collected from the Internet using the automated system developed in the Finno-Ugric Languages and the Internet project (SUKI) supported…
The Kven N-gram data set is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The …
The Kven N-gram data set is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, Ciprian-Virgil Gerstenberger compiled the data set from the entire SIKOR Kven corpus version 2015-08-30. Th…
The SIKOR Kven free corpus is a monolingual text corpus of Kven that contains administrative, law, religious, non-fictio…
The SIKOR Kven free corpus is a monolingual text corpus of Kven that contains administrative, law, religious, non-fiction, fiction, and news texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language communi…
This resource contains n-grams - i.e. uni-, bi- and trigrams - from all books and newspapers that had been digitized at …
This resource contains n-grams - i.e. uni-, bi- and trigrams - from all books and newspapers that had been digitized at the National Library of Norway up to July 15 2022. The n-grams have been extracted from a material consisting of approximately 610,000 books and 4,000,000 newspapers, amounting to a total of 138.5 bil…
This resource contains n-grams - i.e. unigrams, bigrams and trigrams - from all books and newspapers that had been digit…
This resource contains n-grams - i.e. unigrams, bigrams and trigrams - from all books and newspapers that had been digitized at the National Library of Norway up to July 2021. The n-grams have been extracted from a material consisting of approximately 580,000 books and 3,400,000 newspapers, amounting to a total of 122 …
The Kven lemma frequency list is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT…
The Kven lemma frequency list is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, Ciprian-Virgil Gerstenberger compiled the list from the entire SIKOR Kven corpus version 2015-08-30. T…
The Norwegian Bokmål-Kven dictionary is the work done by Giellatekno, UiT The Arctic University of Norway, Kainun instit…
The Norwegian Bokmål-Kven dictionary is the work done by Giellatekno, UiT The Arctic University of Norway, Kainun institutti, as well as by members of the language communities. In particular, the following colleagues have contributed to the creation of the ressource: Terje Aronsen, Verena Schall, Eira Söderholm, Trond …
The Kven-Norwegian Bokmål dictionary is the work done by Giellatekno, UiT The Arctic University of Norway, Kainun instit…
The Kven-Norwegian Bokmål dictionary is the work done by Giellatekno, UiT The Arctic University of Norway, Kainun institutti, as well as by members of the language communities. In particular, the following colleagues have contributed to the creation of the ressource: Terje Aronsen, Verena Schall, Eira Söderholm, Trond …