Linguistic Data Consortium Corpora

Notes: Registration is required to download any datasets and additional user agreements may be required. Register and create a new account here. When creating a new account, use "University of Arizona, Library System" as the organization. You will be authorized by our corpus administrator (S. Bosch) and receive an email once your UA status is verified.

Contents: Find supporting materials for language-related education, research, and technology development by creating and sharing language resources including lexicons, speech files, transcripts, and other text files from 1999 to present.

Some of this data is also available in the Library on DVDs and CD-ROMs (for check-out to use on computers outside the Library). Search for titles in the library's Catalog using Linguistic Data Consortium as the author, or search by known titles.