endorsed by
SIGUL
a joint
 
Special Interest Group on Under-resourced Languages

 

Programme

09:15-09:30
Welcome and Introduction
09:30-10:30

Keynote Talk
Laurent Besacier, Computational language documentation: some results from the BULB project

10:30-11:00
Coffee break
11:00-12:40
Morning Session
11:00-11:30
Mat Bettinson and Steven Bird, Image-Gesture-Voice: a Web Component for Eliciting Speech
11:30-11:50
David Nathan, Beyond Protocol: Indigenous Knowledge Resource Circulation in the Digital Age
11:50-12:20
Martin Benjamin, Hard Numbers: Language Exclusion in Computational Linguistics and Natural Language Processing
12:20-12:40
Amelie Dorn, Eveline Wandl-Vogt, Yalemisew Abgaz, Alejandro Benito Santos and Roberto Therón, Unlocking Cultural Conceptualisation in Indigenous Language Resources: Collaborative Computing Methodologies
12:40-14:00
Lunch break
14:00-15:00
Poster Session
Aalok Sathe, A Rule-Based System for the Transcription of Sanskrit from the Devanagari Orthography to the International Phonetic Alphabet
Aidan Pine and Mark Turin, Seeing the Heiltsuk Orthography from Font Encoding through to Unicode: a Case Study Using Convertextract
Mathias Coeckelbergs, Classifying and Searching Resource-Poor Languages more Efficiently. Using the FastText Word Embeddings for the Aramaic Language
Dmitri Dmitriev, Digitizing National Cuisines: Cooking Recipes as Conceptual Graphs
Amel Fraisse, Ronald Jenn and Shelley Fisher Fishkin, Building Multilingual Parallel Corpora for Under-Resourced Languages Using Translated Fictional Texts
Shweta Sinha and Shyam S. Agrawal, Sustaining Linguistic Diversity through Human Language Technology: a Case Study for Hindi
15:00-16:00
Afternoon Session 1
15:00-15:30
Dorothee Beermann, Lars Hellan and Tormod Haugland, Convergent Development of Digital Resources for West African Languages
15:30-16:00
Delyth Prys and Dewi Jones, Gathering Data for Speech Technology in the Welsh Language: a Case Study
16:00-16:30
Coffee break
16:30-17:30
Afternoon Session 2
16:30-16:50
László Grad-Gyenge and Linda Andersson, The MediaBubble Dataset: a Crowdsourcing Dataset for Topic Detection Tasks for the Hungarian Language
16:50-17:10
Adrian Doyle, John McCrae and Clodagh Downey, Preservation of Original Orthography in the Construction of an Old Irish Corpus
17:10-17:30
Alice Millour and Karën Fort, Krik: First Steps into Crowdsourcing POS tags for Kréyòl Gwadloupéyen
17:30-18:00
Discussion and Conclusions

 

CSS Valido!

 

Web Master and Content Manager: | Last Update: 27/04/2018
Copyright © 2017-2018 CCURL - All rights reserved | Web Host: Istituto di Linguistica Computazionale del CNR (Pisa, ITALY)