Search Criteria


Key Word Search by:

Organization Type

State or Jurisdiction

Congressional District


Division or Office

Grants to:

Date Range Start

Date Range End

  • Special Searches

    Product Type

    Media Coverage Type


Search Results

Grant number like: HT-272570-20

Permalink for this Search

Page size:
 1 items in 1 pages
Award Number Grant ProgramAward RecipientProject TitleAward PeriodApproved Award Total
Page size:
 1 items in 1 pages
HT-272570-20Digital Humanities: Institutes for Advanced Topics in the Digital HumanitiesPrinceton UniversityNew Languages for NLP: Building Linguistic Diversity in the Digital Humanities9/1/2020 - 8/31/2024$239,983.00Natalia ErmolaevAndrew JancoPrinceton UniversityPrincetonNJ08540-5228USA2020Computational LinguisticsInstitutes for Advanced Topics in the Digital HumanitiesDigital Humanities23998302370340

an institute to help humanities scholars learn how to create linguistic data and apply statistical models to new languages.

Natural Language Processing (NLP) has revolutionized our ability to interpret texts at scale and is an essential tool for scholars in the digital humanities. However, only a small percentage of the world’s languages are supported by the major NLP libraries. The New Languages for NLP Institute will help scholars with expertise in less-resourced languages to create linguistic data and train NLP models for their languages. In three workshops, held at the Center for Digital Humanities at Princeton University in 2021-2022, participants will create linguistic data and train statistical language models for new languages. They will learn best practices in project and research data management. As an outcome of the project, participants will publish an open dataset in the standard Conference on Computational Natural Language Learning format as well as a trained language model that can be used for computational text analysis.