Search Criteria


Key Word Search by:

Organization Type

State or Jurisdiction

Congressional District


Division or Office

Grants to:

Date Range Start

Date Range End

  • Special Searches

    Product Type

    Media Coverage Type


Search Results

Grant number like: PX-50016-08

Permalink for this Search

Page size:
 1 items in 1 pages
Award Number Grant ProgramAward RecipientProject TitleAward PeriodApproved Award Total
Page size:
 1 items in 1 pages
PX-50016-08Preservation and Access: JISC/NEH Transatlantic Digitization Collaboration GrantsInternet ArchiveThe World Wide Web of Humanities4/1/2008 - 3/31/2009$106,395.00KristinCarpenterNegulescu   Internet ArchiveSan FranciscoCA94129-1711USA2008Interdisciplinary Studies, GeneralJISC/NEH Transatlantic Digitization Collaboration GrantsPreservation and Access10639501063950

Development of tools and methodologies for indexing and analyzing the textual parts of larger digital collections, more focused browsing ("crawling") of the Web, and unified access to data resources.

The Internet Archive (IA) houses one of the largest publicly accessible collections of digital artifacts in the world, with more than 110 billion Web captures that include content from more than 65 million Web sites in over 40 languages. In addition, IA maintains a collection of more than one million public domain texts, thousands of still and moving images, and numerous multimedia software titles. IA plays an active role in the development of digital archiving standards and open-source tools and participates in the Open Content Alliance and the International Internet Preservation Consortium. The English partners in the project are the Oxford Internet Institute, which specializes in researching the effects of the Web and technology on scholarship and teaching, and Hanzo, a company focused on advanced Web archiving technologies. The proposed project would use advanced hyperlink analysis and data mining to study how research in the digital humanities has been framed, funded, and implemented internationally over time. The resulting data would be archived in a specialized collection of several million Uniform Resource Identifiers that would be made available for future research and analysis. The collection would include a range of content such as digital humanities project Web sites and tools, portals, datasets, and research reports. In addition to the data archive outlining the development and current state of the humanities on the Web, the applicant would index the collection for full-text search, encode the indexed data, and make available an interface and tools for future research. All these resources would be freely available on the Web.