From Àbèsàbèsì to XPath: An Overview of the Lexical Data Masterclass 2018

By Eliza Papaki | News | January 14, 2019

The 2018 edition of the Lexical Data Masterclass took place in Berlin, on December 3-7, at the Berlin Brandenburg Academy of Sciences (BBAW). Co-organized by DARIAH-EU, the BBAW, Inria and the Belgrade Center for Digital Humanities, with the support of CLARIN and the European Lexicographic Infrastructure (ELEXIS), LexMC2018 brought together advanced trainees with experts to share experiences, methods and techniques for the creation, management and use of lexical data.

The masterclass covered a number of topics ranging from general models for lexical content and TEI-based representation of lexical data to working efficiently with XML editors. By hosting different sessions, the aim was to provide participants the opportunity to touch upon multiple different topics, consult with experts on their own dictionary projects and get to know and test TEI Lex-0, a newly proposed baseline encoding for lexicographic data.

The projects presented during LexMC2018 are summarised below:

Specialized dictionaries

- Claudia Bonsi – Encoding an Italian meta-dictionary
- Marija Zarkovic – Spanish Legal Terms Through Time: Digitization
- Martin Wynne (Bodleian library, Oxford) – Enhancing a lexicon of variant word forms in seventeenth-century French
- Joanna Aleksandra Bilińska – Slavic Corpora Terminology Dictionary

From PDF to TEI using GROBID dictionary

- Nikolche Mickoski (Lexicographic Centre at Macedonian Academy of Sciences and Arts) – Using GROBID for OCR-ized multilingual dictionary
- Emrah Özcan (Yildiz Technical University) – Retro-digitizing Turkish dictionaries using GROBID-dictionaries and XSLT
- Biljana Lazić – Using GROBID-Dictionaries to encode the German-Serbian Mining Dictionary
- Marija Gmitrović (Institute of Serbian Language) – Using GROBID-Dictionaries to encode the Dictionary of the Jablanica Dialect

SSK update

- Lionel Tadjou (Inria, team ALMAnaCH) – Current state of the lexical scenario in the SSK

TEI based dictionary projects

- Boris Lehečka (Czech Language Institute) – Electronic Dictionary of Old Czech: TEI Modeling and Transforming

Language documentation projects

- Jonas Lau – Advancements in the Àbèsàbèsì Dictionary

From legacy formats and databases to TEI

- Fraser Dallachy – A TEI XML Version of the “Historical Thesaurus of English” Legacy Database
- Ana de Castro Salgado (FCSH, NOVA, CLUNL, Lisbon, Portugal / Academia das Ciências de Lisboa, Lisbon, Portugal) – Switching the Academy of Sciences Portuguese Dictionary to TEI Lex-0
- Zara Kancheva and Ivaylo Radev (IICT-BAS) – BTB-WordNet. From LMF to TEI with XSLT

If you would like to read in more detail these projects and their results visit the original post at the digilex website.

No tags.

Launch of the DARIAH Tools & Services Catalogue

By Amelia McConville

We are delighted to share the launch of the DARIAH Tools and Services catalogue after a lot of work from our team! You can now browse the DARIAH tools and services, access those from SSHRead more
Helsinki Digital Humanities Hackathon #DHH23

By Amelia McConville

Helsinki Digital Humanities Hackathon #DHH23 gathered students and researchers of humanities, social sciences, and computer science in May and June at the University of Helsinki. During a week and a half of intensive multi-disciplinary work,Read more
DARIAH Working Groups Funding Call 2023: Meet the winning projects

By Eliza Papaki

In late 2023, DARIAH launched the fourth Working Groups (WG) Funding Scheme Call for the years 2023-2025. This scheme is dedicated to – and open only for – the DARIAH Working Groups, and is intendedRead more
OSCARS launches its 1st Open Call for Open Science projects and services

By Eliza Papaki

On March 15th, the Science Clusters launched the first OSCARS cascading-grant call for Open Science projects and services. Over 300 attendees across and beyond Europe joined the online launch event and had the opportunity to interactRead more
DARIAH Signs New Cooperating Partnership Agreement with the University of Leeds

By Eliza Papaki

The Digital Research Infrastructure for the Arts and Humanities (DARIAH-EU) is proud to announce it has signed a Cooperating Partnership agreement with the University of Leeds. DARIAH is a European Research Infrastructure Consortium (ERIC) whoseRead more
DARIAH-CH Workshop “Creating workflows in the SSH Open Marketplace”

By Eliza Papaki

The Social Sciences and Humanities Open Marketplace (SSH Open Marketplace) – marketplace.sshopencloud.eu – is a discovery portal which pools and contextualises resources for Social Sciences and Humanities research communities: tools, services, training materials, datasets, publications andRead more
Job Opportunity: DARIAH’s Board of Directors is looking for a new member (0,5 FTE)

By Eliza Papaki

DARIAH, the Digital Research Infrastructure for the Arts and Humanities, is a European research infrastructure, which aims to enhance and support digitally-enabled research and teaching across the arts and humanities. DARIAH’s mission is to empowerRead more