The following language resource repositories are powering the Virtual Language Observatory by kindly providing open and harvestable metadata records:

CLARIN centres

ASV Leipzig

Bayerisches Archiv für Sprachsignale

Berlin-Brandenburg Academy of Sciences and Humanities

CLARIN Centre of Latvian language resources and tools

CLARIN Knowledge Centre for Belarusian text and speech processing



CLARIN-PL Language Technology Centre

CLARIN.SI Language Technology Centre

CLARINO Bergen Center

CLARINO Text Laboratory Centre


Center of Estonian Language Resources

Centre for Language and Speech Technology

Centre for the Digital Foundation of Research in the Humanities, Social, and Educational Sciences

Collections de corpus oraux numeriques

DARIAH-DE Repository

Data Archiving and Networked Services

Eberhard Karls Universität Tübingen

Eurac Research CLARIN Centre

Georg Eckert Institute for International Textbook Research

Hamburger Zentrum für Sprachkorpora

Huygens ING


Institut für Maschinelle Sprachverarbeitung

Instituut voor de Nederlandse Taal


Language Archive Cologne

Leibniz-Institut für Deutsche Sprache

Lund University Humanities Lab

MPI for Psycholinguistics

Mediterranean Research Centre for the Humanities' Phonothèque

Meertens Instituut/HuC

National Library of Norway

Oxford Text Archive

PORTULAN CLARIN Research Infrastructure for the Science and Technology of Language

PolMine Project

South African Centre for Digital Language Resources

Speech & Language Data Repository

Språkbanken, The Swedish language bank

The CLARIN Centre at the University of Copenhagen

The ILC4CLARIN Centre at the Institute for Computational Linguistics

The Language Bank of Finland

Universität des Saarlandes

ZIM Centre for Information Modelling

Other metadata providers


CLLE ERSS Universite de Toulouse Le Mirail

California Language Archive

European Language Resources Association


e-codices - Virtual Manuscript Library of Switzerland

GEI historic German textbooks

Pacific And Regional Archive for Digital Sources in Endangered Cultures (PARADISEC)

The LDC Corpus Catalog

The LINGUIST List Language Resources

The Rosetta Project A Long Now Foundation Library of Human Language

University of the Basque Country

WALS Online & WALS RefDB