• Contact
  • Helpdesk
DARIAHDARIAHDARIAHDARIAH
  • About
    • DARIAH in a Nutshell
    • Mission & Vision
    • Organisation and Governance
    • Join DARIAH
    • History of DARIAH
    • Glossary
    • Documents
    • Publications
  • Network
    • Members and Partners
    • Regional Hubs
    • People
  • Activities
    • Working Groups
    • Training and Education
    • Open Science
      • Transformations
      • DARIAH Open
      • OpenMethods
      • Heritage Data Reuse Charter
    • Projects
    • DARIAH Theme
    • Impact Case Studies
    • Spotlight
  • Tools & Services
    • Tools and Services Catalogue
  • News & Events
    • News
    • Events Calendar
    • Annual Events
    • Newsletters

DARIAH partners up with Princeton University to deliver NLP Training for Humanists

Home News
NextPrevious

DARIAH partners up with Princeton University to deliver NLP Training for Humanists

By Eliza Papaki | News | October 30, 2020

The Center for Digital Humanities at Princeton University has recently received a grant from the US-based National Endowment for the Humanities to train humanities scholars in the development of natural language processing (NLP) tools for lesser-resourced languages.

The “New Languages for NLP” workshop series will be hosted at Princeton in 2021 and 2022 in cooperation with the Digital Research Infrastructure for Arts and Humanities (DARIAH) and the Library of Congress LC Labs.

“Humanities scholars do not only care  about patterns and frequencies, we also care about what is unique, unusual and strange,” said DARIAH Director Toma Tasovac. “But discovering either — mediocrity or weirdness —  in textual corpora is much more difficult if you don’t have access to NLP tools for the particular language variety you’re working on. Which, from the outset, puts some scholars —  and some languages — at a great disadvantage.”

Participants will work over the course of a year — between June 2021 and May 2022 — and will meet for three intensive workshops where they will learn how to annotate linguistic data and train statistical language models using cutting-edge NLP tools. 

They will also learn best practices in project and research data management as well as join discussions with leaders in the fields of multilingual NLP and DH. Furthermore, they will advance their own research projects by creating, employing and interrogating text-analysis tools and methods, while increasing much-needed linguistic diversity in the field of NLP.


“In addition to helping a number of researchers create and adopt NLP tools, we will be creating a body of knowledge and learning resources to share with the wider community via our educational portal DARIAH-Campus,” Tasovac said. “Knowledge is brittle and, especially in this age of information overload, it is essential that we capture it rather than let it get buried under an avalanche of digital noise. As a research infrastructure, we’re delighted that we can work together with Princeton on delivering, preserving and sustaining the outputs of this project.”


NLP has revolutionized our ability to analyze texts at scale. However, of the world’s more than 7,500 languages, the major NLP resources only support eighty-five. While large linguistic datasets exist for high-resource languages such as English or German, text mining, topic modeling and other methods of computational text analysis are unavailable for the vast majority of languages — especially those that are minority, regional or endangered.

“Humanities scholars are rightly suspicious of the so-called “black box” tools — the kind of tools which work ‘automagically’ but which conceal their own methodology,” Tasovac said. “I very much hope that the participants will come out of our workshops not only with useful models for their own work, but with a better understanding of how NLP tools work, what their advantages are, as well as their limitations. I also hope that we’ll inspire more cooperation between humanities and NLP scholars in general. We would all have something to gain from that.”

A Call for Proposals will be published in early November. For more info, check out the project website.

No tags.

Related Post

  • Launch of the DARIAH Tools & Services Catalogue

    By Amelia McConville

    We are delighted to share the launch of the DARIAH Tools and Services catalogue after a lot of work from our team! You can now browse the DARIAH tools and services, access those from SSHRead more

  • Helsinki Di­gital Hu­man­it­ies Hack­a­thon #DHH23

    By Amelia McConville

    Helsinki Digital Humanities Hackathon #DH­H23 gathered students and researchers of humanities, social sciences, and computer science in May and June at the University of Helsinki. During a week and a half of intensive multi-disciplinary work,Read more

  • The call for papers for the 5th DARIAH-HR conference “Digital Humanities & Heritage” is open

    By Eliza Papaki

    Call for Papers: 5th DARIAH-HR International ConferenceDigital Humanities & HeritageTheme: Rethinking Heritage across STEM, Humanities, and ArtsDate: 22–24 October 2025Location: Josip Juraj Strossmayer University of Osijek – Academy of Arts and Culture in Osijek and Faculty of AgrobiotechnicalRead more

  • Call for Applications: DARIAH-DE Travel Grants for the DARIAH Annual Event 2025 in Göttingen

    By Eliza Papaki

    For the DARIAH Annual Event (June 17–20, 2025) at SUB Göttingen, it is possible—thanks to the support of the German Federal Ministry of Education and Research (BMBF)—to award ten travel grants of €500 each toRead more

  • (Re)introducing DARIAH-IE event

    By Eliza Papaki

    Tuesday May 27th, 2025 | 11:00 – 12:30 BST On Tuesday May 27th DARIAH-IE will be hosting an online event for members of the Irish Digital Arts and Humanities communities and beyond. The aim of the seminarRead more

  • Spotlight on ArkeoGIS: Opening Archaeological Data Across Temporal and Political Boundaries

    By Eliza Papaki

    DARIAH is delighted to publish the second Spotlight article on ArkeoGIS: Opening Archaeological Data Across Temporal and Political Boundaries. This article is part of the DARIAH Spotlight campaign, a monthly series that focuses on digitalRead more

  • DARIAH is seeking a new member for the DARIAH Joint Research Committee

    By Eliza Papaki

    DARIAH ERIC is calling for applications to become a member of the DARIAH Joint Research Committee. In particular we look for applicants with expertise in the area of e-Infrastructures, as expressed in the DARIAH StrategicRead more

NextPrevious

RECENT POSTS

  • The call for papers for the 5th DARIAH-HR conference “Digital Humanities & Heritage” is open

    Call for Papers: 5th DARIAH-HR International ConferenceDigital Humanities & HeritageTheme: Rethinking Heritage across

    12 May, 2025
  • Call for Applications: DARIAH-DE Travel Grants for the DARIAH Annual Event 2025 in Göttingen

    For the DARIAH Annual Event (June 17–20, 2025) at SUB Göttingen, it

    8 May, 2025
  • (Re)introducing DARIAH-IE event

    Tuesday May 27th, 2025 | 11:00 – 12:30 BST On Tuesday May 27th DARIAH-IE

    7 May, 2025
  • Spotlight on ArkeoGIS: Opening Archaeological Data Across Temporal and Political Boundaries

    DARIAH is delighted to publish the second Spotlight article on ArkeoGIS: Opening

    30 April, 2025
  • DARIAH is seeking a new member for the DARIAH Joint Research Committee

    DARIAH ERIC is calling for applications to become a member of the

    11 April, 2025

Twitter

To keep up to date with all the exciting plans and projects ahead for @DARIAHeu throughout 2024, subscribe to our monthly newsletter

⬇️⬇️⬇️⬇️⬇️https://t.co/vPVNxCsBrJ pic.twitter.com/CUjIUwACzJ

— DARIAH-EU (@DARIAHeu) January 4, 2024

Tags

ADHO DARIAH2023 DARIAH Annual Event DARIAH Annual Event 2017 DARIAH Theme DESIR DH DH2023 DHH2023 H2020 project News SSH Open Marketplace Training
Logo of DARIAH
Follow us on:  linkedin   twitter   BlueSky   Mastodon   youtube   flickr

Contact DARIAH

Email DARIAHinfo@dariah.eu

Privacy and Legal

  • Legal Notice
  • Privacy Notice

Quick Menu

  • DARIAH in a Nutshell
  • Members and Partners
  • Projects
  • Events Calendar
  • Helpdesk
  • Website user survey

Subscribe to our mailing list and newsletter

* = required field
Creative Commons Attribution (CC BY) licence
  • About
    • DARIAH in a Nutshell
    • Mission & Vision
    • Organisation and Governance
    • Join DARIAH
    • History of DARIAH
    • Glossary
    • Documents
    • Publications
  • Network
    • Members and Partners
    • Regional Hubs
    • People
  • Activities
    • Working Groups
    • Training and Education
    • Open Science
      • Transformations
      • DARIAH Open
      • OpenMethods
      • Heritage Data Reuse Charter
    • Projects
    • DARIAH Theme
    • Impact Case Studies
    • Spotlight
  • Tools & Services
    • Tools and Services Catalogue
  • News & Events
    • News
    • Events Calendar
    • Annual Events
    • Newsletters
DARIAH