Title: Language and Speech Technologies for Basque in the Times of DNNs and LLMs
Date: Thursday, September 21
Time: 10:30 a.m. (MDT)
Location: In person, CCP 259 (City Center Plaza, Computer Science building downtown, 777 W Main St, Boise, ID 83702)
Bronco Shuttle Route: Orange Route – Heading downtown from main campus? Hop on the Orange Shuttle and avoid downtown parking! Use the Shuttle Tracker app to simplify your trip.
Event details available on our Colloquium webpage here: https://www.boisestate.edu/computing/event/igor-leturia-presents-language-and-speech-technologies-for-basque-in-the-times-of-dnns-and-llms/
Abstract: Basque has always had a very active presence in the field of language and speech technologies, despite being a minority and less-resourced language, The presentation will show the strategies used by a language of these characteristics throughout the different ages of NLP, focusing on current neural MT, ASR and TTS technology developed at Orai – Elhuyar, and also ongoing works and approaches in the newest LLM paradigm.
Speaker Bio: Igor Leturia holds Engineer degrees in Computer Science and Systems Electronics from the University of Mondragon and a degree in Computer Science from the Paul Sabatier University of Toulouse. He also holds a Master’s Degree and a Ph.D. in Language Technologies from the University of the Basque Country, for his thesis The Web as Corpus of Basque. Since 2003, he has been working at Elhuyar Fundazioa in the department of Research and Development in Language Technologies for Basque, of which he was the Director from 2008 to 2014, and since 2014 he has been the director of the Speech Technologies area. For years, his main research area has been text corpora and, in particular, the use of the web as a source of corpora. In recent years, he has been working on speech technologies, both ASR and TTS. In ASR, he has led the development of Aditu, an automatic transcription/subtitling service for the Basque language, which also allows the automatic translation of these transcriptions and subtitles into different languages. In the field of TTS, he has participated in the creation of neural TTS technology for Basque (monospeaker, multispeaker, translingual…). He is also involved in a project that aims to combine ASR, MT and TTS to provide automatic dubbing and speech-to-speech translation between Basque and other languages.
The Computing Colloquium is a research seminar presented by Boise State University’s Computing PhD program. Our Colloquium Committee members recruit speakers from all four Computing PhD emphasis areas–Computer Science, Computational Mathematics, Science and Engineering (CMSE), Cybersecurity, and Data Science. The Computing PhD program wishes to thank our current committee members for their collaboration and planning of the series: Dr. Liljana Babinkostova, Dr. Elena Sherman, Dr. Nasir Eisty and Dr. Rafael Ribas. We are also grateful to Boise State University’s Graduate College, College of Arts and Sciences and College of Engineering for their continued support of the Computing Colloquium seminar.