Skip Main Navigation
Page Content

Save This Event

Event Saved

SADiLaR GROBID-Dictionaries workshop (Pretoria)

South African Centre for Digital Language Resources (SADiLaR)

Friday, October 26, 2018 from 9:00 AM to 4:00 PM (SAST)

SADiLaR GROBID-Dictionaries workshop (Pretoria)

Ticket Information

Type Remaining End Quantity
SADiLaR GROBID-Dictionaries (Pretoria) 1 Ticket Ended Free  

Event Details

SADiLaR GROBID-Dictionaries workshop 

Over the last decade there has been a worldwide effort to make physical lexical resources, such as dictionaries, available in digital format. A large number of digitized lexical resources remain unexploited due to their unstructured content, and given their complexity, manually structuring such resources is a costly task. During this period a large number of natural language processing tools have become available to the lexicography community, but the usability of several e-lexicography tools represents a serious obstacle for researchers with little or no background in computer science.

GROBID-Dictionaries is the first machine learning infrastructure for automatically structuring digitised dictionaries, originally in PDF format, independently from the language or the lexicographic school or style. The system allows for the cascading extraction of lexical information using pre-trained models and the serialisation of each structuring level in a TEI-compliant output. The goal of the workshop is to get familiar with the training process of each model of the infrastructure and learn how to use them to drastically speed up the encoding of lexical samples in TEI.

Recommended reading

https://hal.archives-ouvertes.fr/hal-01508868v2/document
https://hal.archives-ouvertes.fr/hal-01708137v2/document

Participants

Lexicographers, librarians, and scholars involved in the digitisation of dictionaries and lexical resources are invited.

Workshop location

University of Pretoria 
Pretoria
Humanities building

Costs

Participation in the workshop is FREE.

NB Limited space available.

Coffee, tea and a light lunch will be provided.

Registration

Please register before or on 28 September 2018.

If you have any question please liaise with Roald Eiselen via e-mail - roald.eiselen@nwu.ac.za 

Phone (SADiLaR office): 018 285 2046 or Phone Ms Charmaine Jacobs: 076 529 7888


Have questions about SADiLaR GROBID-Dictionaries workshop (Pretoria)? Contact South African Centre for Digital Language Resources (SADiLaR)

Save This Event

Event Saved

When & Where


University of Pretoria
Humanities building
Pretoria, Gauteng 0002
South Africa

Friday, October 26, 2018 from 9:00 AM to 4:00 PM (SAST)


  Add to my calendar

Organizer

South African Centre for Digital Language Resources (SADiLaR)

The South African Centre for Digital Language Resources (SADiLaR) is a research infrastructure set up by the Department of Science and Technology (DST) as part of the South African Research Infrastructure Roadmap (SARIR).

  Contact the Organizer

Interested in hosting your own event?

Join millions of people on Eventbrite.

Please log in or sign up

In order to purchase these tickets in installments, you'll need an Eventbrite account. Log in or sign up for a free account to continue.