Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
The resource is available in Kielipankki - The Language Bank of Finland and contains the following texts with paragraphs…
The resource is available in Kielipankki - The Language Bank of Finland and contains the following texts with paragraphs scrambled: suomentanut Kersti Juva: Ylpeys ja ennakkoluulo, Teos, 2013 Alkuteos Jane Austen, Pride and Prejudice suomentanut Kersti Juva: Washingtonin aukio, Otava, 2003 Alkuteos Henry James, Was…
Until November 2020, this corpus version is available via the LAT platform in Kielipankki - the Language Bank of Finland…
Until November 2020, this corpus version is available via the LAT platform in Kielipankki - the Language Bank of Finland (see Access location). IMPORTANT NOTICE: The LAT service of the Language Bank of Finland will be discontinued in November 2020, after which this corpus version can no longer be used. However, a down…
Iijoki-sarjan kuvaus löytyy sivulta http://urn.fi/urn:nbn:fi:lb-2019041401. Sarjan 26 kirjaa on jäsennetty Kielipankiss…
Iijoki-sarjan kuvaus löytyy sivulta http://urn.fi/urn:nbn:fi:lb-2019041401. Sarjan 26 kirjaa on jäsennetty Kielipankissa kahdella eri jäsentimellä. Tämä versio on jäsennetty Turku Neural Parser Pipeline (TNPP) -jäsentimellä. Se on Turun yliopistossa TurkuNLP-hankeessa kehitetty neuroverkkojäsennin, tarkemmat tiedot l…
This resource contains answers to the matriculation exam in Swedish (B syllabus). The corpus will be made available via…
This resource contains answers to the matriculation exam in Swedish (B syllabus). The corpus will be made available via Kielipankki – The Language Bank of Finland. For the time being, the corpus can only be accessed by the Digisvenska project team at the University of Helsinki, but when the preparation of the material…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains data, where the OCR (opti…
This version of the The Magazine Corpus of the Institute for the Languages of Finland contains data, where the OCR (optical character recognition) hasn't been checked. It contains different volumes of four magazines: Suomen Kuvalehti's volumes: one issue from 1916 'sample', 1917, 1925, 1935, 1945, 1955, 1965, 1972 (app…
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish …
The Helsinki Corpus of Scottish Correspondence comprises circa 0.4 million words (0.5 million tokens) of early Scottish correspondence by male and female writers dating from the period 1540-1750. Unlike the majority of digital resources available for historical linguistics at present, the corpus consists of transcripts…
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is a…
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is available at Korp. The licence is available at http://urn.fi/urn:nbn:fi:lb-2019120401
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus create…
This resource is available for download in Kielipankki – the Language Bank of Finland. This is a parallel corpus created of the Yle news articles from 2014-2018 by aligning the standard Finnish versions with the easy-language versions. The dataset, created by Anna Dmitrieva and available in CSV format, is aligned on t…
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is a…
The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2012 onwards up to 2018 inclusive, is available at korp.csc.fi/download
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Se…
The corpus is available for download in Kielipankki - the Language Bank of Finland. This dataset consists of the Yle Selkokieliset uutiset in Finnish (Yle Easy-to-read Finnish News). The dataset was created from the contents of the Yle News Archive for the language code "fi" for each month from the year 2011 to the ye…