|
What's New! What's Free!
Upcoming LDC Institute ~ Sociolinguistic Archive and Analysis Project
LDC Timeline ~ two decades of milestones
2012 LDC Survey Responses and Benefit Winner ~ congratulations to our benefit recipient
Spring 2012 LDC Data Scholarship Recipients! ~ student scholarship recipients
LDC Celebrates its 20th Anniversary! ~ 2012 marks our 20th year
LDC data on Blu-ray ~ select databases now on Blu-ray Disc
LDC and Social Networks ~ find us on Facebook, LinkedIn, and RSS
LDC Providing Guidelines ~ enhanced guidelines for submitting corpora for publication by LDC
LDC Data Sheets ~ concise descriptions of LDC projects, operations, and technical capabilities
What's New
Archive
New Corpora
2005 NIST/USF Evaluation Resources for the VACE Program - Broadcast News ~ 60 hours of English broadcast news video data annotated for 2005 VACE tasks.
2009 CoNLL Shared Task Part 1 ~ Catalan, Czech, German and Spanish data used for 2009 CoNLL.
2009 CoNLL Shared Task Part 2 ~ Chinese and English data used for 2009 CoNLL.
USC-SFI MALACH Interviews and Transcripts English ~ 375 hours of interviews from 784 interviewees along with transcripts.
English Translation Treebank: An-Nahar Newswire ~ 599 newswire stories translated from Arabic to English and annotated for POS and syntactic structure.
New Corpora Archive
Employment at the LDC
ACL Anthology ~ A Digital Archive of Research
Papers in Computational Linguistics
OLAC ~ Open Language Archives Community
|
|
Linguistic Resources
The Linguistic Data Consortium supports language-related education, research
and technology development by creating and sharing linguistic resources:
data, tools and standards.


LDC is supported in part by grant IRI-9528587 from the Information and Intelligent
Systems division and grant 9982201 from the Human Computer Interaction Program of the
National Science Foundation.
LDC's corpus creation efforts are powered in part by Academic Equipment Grant 7826-990
237-US from Sun Microsystems.
|
|