• Contact
  • Helpdesk
DARIAHDARIAHDARIAHDARIAH
  • About
    • DARIAH in a Nutshell
    • Mission & Vision
    • Organisation and Governance
    • Join DARIAH
    • History of DARIAH
    • Documents
    • Publications
  • Network
    • Members and Partners
    • Regional Hubs
    • People
  • Activities
    • Working Groups
    • Training and Education
    • Open Science
      • DARIAH Open
      • OpenMethods
      • Heritage Data Reuse Charter
    • Projects
    • DARIAH Theme
    • Impact Case Studies
  • Tools & Services
    • Tools and Services
    • Contributions
  • News & Events
    • News
    • Events Calendar
    • Annual Events
    • Newsletters

DARIAH-DE Topics Explorer

Home Tools & ServicesTools and Services

Exploring Topics and Contents in Text Collections

Contact Person: Sina Bock

Description

LDA (Latent Dirichlet Allocation) Topic Modeling is a method for analyzing the distribution of semantic word clusters, so-called “topics” in a text collection. It can be used for exploring the content of a corpus as well as generating content-related features for computational text classification. Topic modeling thereby relies on the analyzed texts themselves entirely; it does not use additional sources of information like dictionaries or external training data, making it largely independent from language and orthographic convention. It is based solely on a statistical analysis of symbol co-occurrence (on word level) that is translated into likely semantic relationships. Hence, topic modeling is, in terms of its requirements on text type and text quality, one of the more flexible methods in computational text analysis.

The Topicsexplorer is a beginner-oriented Software allowing interested researchers to experiment with topic modeling on their own computers, with their own text corpora. The entire necessary workflow from plain text to visualized results – some of them even visualized in interactive graphs – is done in a graphical user interface (GUI). The software thus allows users without programming skills to load a collection of plain-text or even xml files and to analyze it by means of the LDA algorithm. It is implemented as a stand-alone software that runs on common Windows, MacOS and Linux operating systems without installation. As a result, users can scan texts for re-occurring, semantically meaningful groups of words; they can explore how much each these semantic groups contribute to each text, and which texts probably share the same common themes and topics. For further analysis with other software results can be exported in the universally readable csv format.

The TopicsExplorer is primarily designed as an educational to for both classroom and self learner use. It allows users without prior knowledge or skill to quickly engage and experiment with topic modeling. Users can thereby without greater effort learn about the possibilities and limitations of the method.

Website: https://de.dariah.eu/en/web/guest/topicsexplorer

RECENT POSTS

  • DARIAH-EU Cooperating Partnership creates opportunities for Digital Hellenic Studies

    This summer, scholars from Princeton University (USA) will have the chance to

    14 March, 2023
  • Connecting Women Writers with Digital Tools: NEWW DARIAH WG Call for Webinar Presentations in 2023

    The NEWW DARIAH Working Group (Network of European Women Writers) focuses on

    8 March, 2023
  • 1st SSH Open Cluster Assembly Announced

    Since the end of the SSHOC project in April 2022, the SSHOC

    7 March, 2023
  • DARIAH Open Access Book Bursary: Meet the winner of the 2022 round

    DARIAH is pleased to announce the winner of its annual book bursary

    1 March, 2023
  • International Conference of the DARIAH-EU Working Group Theatralia: Performing Arts – Transitioning to the Digital Age

    The DARIAH-EU Working Group Theatralia is pleased to invite your submissions to

    13 February, 2023

TWITTER UPDATES

  • RT @DariahHr: Today is the last day of the 8th @HrvatskaIcarus Days programme. @DariahHr is a proud co-organizer of the conference, and we…yesterday
  • Earlier this year, @DARIAHeu joined PALOMERA,a new project seeking to understand why so few #OA funder policies inc… https://t.co/Hi0rLlcFxR3 days ago
  • Our colleagues in the Nordic countries have been running a series of workshops for the "Building and Linking Humani… https://t.co/HQfSSQnh3w4 days ago

Tags

ADHO DARIAH Annual Event DARIAH Annual Event 2017 DARIAH Theme DESIR DH H2020 project SSH Open Marketplace Training
Logo of DARIAH
Follow us on:  twitter   linkedin   youtube   flickr

Contact DARIAH

Email DARIAHinfo@dariah.eu

Privacy and Legal

  • Legal Notice
  • Privacy Notice

Quick Menu

  • DARIAH in a Nutshell
  • Members and Partners
  • Projects
  • Events Calendar
  • Helpdesk

Subscribe to our mailing list and newsletter

* = required field
Creative Commons Attribution (CC BY) licence
  • About
    • DARIAH in a Nutshell
    • Mission & Vision
    • Organisation and Governance
    • Join DARIAH
    • History of DARIAH
    • Documents
    • Publications
  • Network
    • Members and Partners
    • Regional Hubs
    • People
  • Activities
    • Working Groups
    • Training and Education
    • Open Science
      • DARIAH Open
      • OpenMethods
      • Heritage Data Reuse Charter
    • Projects
    • DARIAH Theme
    • Impact Case Studies
  • Tools & Services
    • Tools and Services
    • Contributions
  • News & Events
    • News
    • Events Calendar
    • Annual Events
    • Newsletters
DARIAH