Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
This Byte-Level Neural Error Correction Model for Icelandic is a fine-tuned byT5-base Transformer model for error correc…
This Byte-Level Neural Error Correction Model for Icelandic is a fine-tuned byT5-base Transformer model for error correction in natural language. It acts as a machine translation model in that it “translates” from deficient Icelandic to correct Icelandic. The model is an improved version of a previous model which is ac…
This Byte-Level Neural Error Correction Model for Icelandic is a fine-tuned byT5-base Transformer model for error correc…
This Byte-Level Neural Error Correction Model for Icelandic is a fine-tuned byT5-base Transformer model for error correction in natural language. It acts as a machine translation model in that it “translates” from deficient Icelandic to correct Icelandic. The model is an improved version of a previous model which is ac…
The item contains a list of manually corrected items in a student thesis from https://skemman.is. The list is a product …
The item contains a list of manually corrected items in a student thesis from https://skemman.is. The list is a product of student theses testing, carried out as a part of developing a spell and grammar checker, GreynirCorrect (http://hdl.handle.net/20.500.12537/174). A thesis from https://skemman.is, which is publishe…
The Pronunciation Dictionary for Icelandic is a part of the Hjal-project and includes between 50 and 60 thousand phoneti…
The Pronunciation Dictionary for Icelandic is a part of the Hjal-project and includes between 50 and 60 thousand phonetically written word forms. The list of word forms is coined from many resources such as the newspaper Morgunblaðið, newly-published novels and the speech corpus of Ístal. [ICELANDIC] Framburðarorða…
Three dev/test sets for MT quality estimation created from subcorpora of ParIce. The dev/test sets contain English-Icela…
Three dev/test sets for MT quality estimation created from subcorpora of ParIce. The dev/test sets contain English-Icelandic segment pairs. One of the three sets is made up of subtitle segments from OpenSubtitles, one of segments from drug descriptions distributed by the European Medical Agency (EMA) and one from EEA d…
The Icelandic L2 Error Corpus (IceL2EC) is a collection of texts in modern Icelandic, written by learners of Icelandic a…
The Icelandic L2 Error Corpus (IceL2EC) is a collection of texts in modern Icelandic, written by learners of Icelandic as a second language. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are 253. T…
A list of automatically prepared word forms containing systematic errors along with their corrections. All words are non…
A list of automatically prepared word forms containing systematic errors along with their corrections. All words are nonwords, i.e. they do not exist in Icelandic. All of the word forms are created by making substitutions commonly made by Icelandic informants, wherein a letter is replaced with another letter which soun…
The general Icelandic pronunciation dictionary for ASR is based on the Pronunciation dictionary. The dictionary contains…
The general Icelandic pronunciation dictionary for ASR is based on the Pronunciation dictionary. The dictionary contains about 136,000 words, transcribed with the IPA. [ICELANDIC] Almenn framburðarorðabók fyrir talgreiningu er unnin upp úr Framburðarorðabókinni. Hún inniheldur sem stendur um 136 þúsund orð hljóðrituð með IPA.
The Icelandic Dyslexia Error Corpus (IceDEC) is a collection of texts in modern Icelandic, written by people who have be…
The Icelandic Dyslexia Error Corpus (IceDEC) is a collection of texts in modern Icelandic, written by people who have been diagnosed with dyslexia. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are…
ENGLISH: IGC-Wiki is part of the IGC-Project (https://igc.arnastofnun.is) that aims to collect as much as possible of I…
ENGLISH: IGC-Wiki is part of the IGC-Project (https://igc.arnastofnun.is) that aims to collect as much as possible of Icelandic texts that can be published under an open or restricted licence. IGC-Wiki contains texts from the Icelandic Wikipedia website. The corpus is published in two versions. IGC-Wiki contains plain…