Obtaining DataUsing DataProviding DataCreating Data
About LDCMembersCatalogProjectsPapersLDC OnlineSearchContact UsUPennHome


What's New! What's Free!


Upcoming LDC Institute ~ Sociolinguistic Archive and Analysis Project
LDC Timeline ~ two decades of milestones
2012 LDC Survey Responses and Benefit Winner ~ congratulations to our benefit recipient
Spring 2012 LDC Data Scholarship Recipients! ~ student scholarship recipients
LDC Celebrates its 20th Anniversary! ~ 2012 marks our 20th year
LDC data on Blu-ray ~ select databases now on Blu-ray Disc
LDC and Social Networks ~ find us on Facebook, LinkedIn, and RSS
LDC Providing Guidelines ~ enhanced guidelines for submitting corpora for publication by LDC
LDC Data Sheets ~ concise descriptions of LDC projects, operations, and technical capabilities
What's New Archive

New Corpora

2005 NIST/USF Evaluation Resources for the VACE Program - Broadcast News ~ 60 hours of English broadcast news video data annotated for 2005 VACE tasks.
2009 CoNLL Shared Task Part 1 ~ Catalan, Czech, German and Spanish data used for 2009 CoNLL.
2009 CoNLL Shared Task Part 2 ~ Chinese and English data used for 2009 CoNLL.
USC-SFI MALACH Interviews and Transcripts English ~ 375 hours of interviews from 784 interviewees along with transcripts.
English Translation Treebank: An-Nahar Newswire ~ 599 newswire stories translated from Arabic to English and annotated for POS and syntactic structure.
New Corpora Archive

Employment at the LDC

ACL Anthology ~ A Digital Archive of Research Papers in Computational Linguistics

OLAC ~ Open Language Archives Community

Linguistic Resources
Linguistic Data Consortium

The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards.

map

LDC is supported in part by grant IRI-9528587 from the Information and Intelligent Systems division and grant 9982201 from the Human Computer Interaction Program of the National Science Foundation. LDC's corpus creation efforts are powered in part by Academic Equipment Grant 7826-990 237-US from Sun Microsystems.

About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
Last modified: Sunday, 22-Apr-2012 12:08:58 EDT
© 1992-2010 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.