Welcome to the VLO!
Use the search bar below to start searching through hundreds of thousands of language resources, or continue to browse everything and use facets to narrow down to your area of interest or discover new resources.
See all records Learn more Take a quick tourUse the categories below to limit the search results to those matching the selected value(s).
These levels provide an indication of the degree to which resources and tools are publicly accessible. Please check the specific conditions on any resource or tool that you end up using.
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and cl…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and classify the language of each line as one of the 200 languages it knows and writes the results, one ISO 639-3 code per line, into file <outfile>. It can identify c. 3000 sentences per second using one c…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and cl…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and classify the language of each line as one of the 200 languages it knows and writes the results, one ISO 639-3 code per line, into file <outfile>. It can identify c. 3000 sentences per second using one c…
HeLI off-the-shelf language identifier with language models for 220 languages. # Performance It can identify c. 600-17…
HeLI off-the-shelf language identifier with language models for 220 languages. # Performance It can identify c. 600-1700 sentences (averaging c. 150 characters) per second from a file using one core and around 4,3 gigabytes of memory on a modern laptop. # Requirements Java The software has been created and tested o…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and cl…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and classify the language of each line as one of the 200 languages it knows and writes the results, one ISO 639-3 code per line, into file <outfile>. It can identify c. 3000 sentences per second using one c…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and cl…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and classify the language of each line as one of the 200 languages it knows and writes the results, one ISO 639-3 code per line, into file <outfile>. It can identify c. 3000 sentences per second using one c…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and cl…
HeLI off-the-shelf language identifier with language models for 200 languages. The program will read the <infile> and classify the language of each line as one of the 200 languages it knows and writes the results, one ISO 639-3 code per line, into file <outfile>. It can identify c. 3000 sentences per second using one c…
Wikipedia plain text data obtained from Wikipedia dumps with WikiExtractor in February 2018. The data come from all W…
Wikipedia plain text data obtained from Wikipedia dumps with WikiExtractor in February 2018. The data come from all Wikipedias for which dumps could be downloaded at [https://dumps.wikimedia.org/]. This amounts to 297 Wikipedias, usually corresponding to individual languages and identified by their ISO codes. Severa…
Jimmy Kalarriya; Don Namundja; Isaiah Nagurrgurrba and Reuben Brown discuss (Kunwinjku; English) Colin Simpson and Raymo…
Jimmy Kalarriya; Don Namundja; Isaiah Nagurrgurrba and Reuben Brown discuss (Kunwinjku; English) Colin Simpson and Raymond Giles' recordings at Gunbalanya; Howell Walker film footage and Frank Setzler photographs from 1948 at Injalak Arts and Crafts Centre, Gunbalanya on 2 August 2011. Recorded by Reuben Brown. '1948 P…
Jimmy Kalarriya discusses Colin Simpson 1948 recordings, gives translation of song texts for Reuben Brown, at Injalak Ar…
Jimmy Kalarriya discusses Colin Simpson 1948 recordings, gives translation of song texts for Reuben Brown, at Injalak Arts and Crafts Centre, Gunbalanya, 10 June 2012. Recorded by Reuben Brown. Comprises the full recordings 20120610-RB_01.WAV; 20120610-RB_v01.mp4; 20120610-RB_v02.mp4; 20120610-RB_v03.mp4; 20120610-RB_…