Search Criteria

 






Key Word Search by:









Organization Type


State or Jurisdiction


Congressional District





help

Division or Office
help

Grants to:


Date Range Start


Date Range End


  • Special Searches




    Product Type


    Media Coverage Type








 


Search Results

Grant number like: HAA-271654-20

Permalink for this Search

1
Page size:
 1 items in 1 pages
Award Number Grant ProgramAward RecipientProject TitleAward PeriodApproved Award Total
1
Page size:
 1 items in 1 pages
HAA-271654-20Digital Humanities: Digital Humanities Advancement GrantsUniversity of California, BerkeleyMultilingual BookNLP: Building a Literary NLP Pipeline Across Languages9/1/2020 - 8/31/2025$324,874.00David Bamman   University of California, BerkeleyBerkeleyCA94704-5940USA2020Interdisciplinary Studies, GeneralDigital Humanities Advancement GrantsDigital Humanities32487402920540

The expansion of the BookNLP platform for studying the linguistic structure of textual materials to allow for the analysis of resources in Spanish, Japanese, Russian and German.

BookNLP (Bamman et al., 2014) is a natural language processing pipeline for reasoning about the linguistic structure of text of books, specifically designed for works of fiction. In addition to its pipeline of part-of-speech tagging, named entity recognition, and coreference resolution, BookNLP identifies the characters in a literary text, and represents them through the actions they participate in, the objects they possess, their attributes, and dialogue. The availability of this tool has driven much work in the computational humanities, especially surrounding character (Underwood et al., 2018; Kraicer and Piper, 2018; Dubnicek et al., 2018). At the same time, however, BookNLP has one major limitation: it currently only supports texts written in English. The goal of this project is to develop a version of BookNLP to support literature in Spanish, Japanese, Russian and German, and create a blueprint for others to develop it for additional languages in the future.