Marianne Starlander


2023

pdf bib
Improving Standard German Captioning of Spoken Swiss German: Evaluating Multilingual Pre-trained Models
Jonathan David Mutal | Pierrette Bouillon | Johanna Gerlach | Marianne Starlander
Proceedings of Machine Translation Summit XIX, Vol. 2: Users Track

Multilingual pre-trained language models are often the best alternative in low-resource settings. In the context of a cascade architecture for automatic Standard German captioning of spoken Swiss German, we evaluate different models on the task of transforming normalised Swiss German ASR output into Standard German. Instead of training a large model from scratch, we fine-tuned publicly available pre-trained models, which reduces the cost of training high-quality neural machine translation models. Results show that pre-trained multilingual models achieve the highest scores, and that a higher number of languages included in pre-training improves the performance. We also observed that the type of source and target included in fine-tuning data impacts the results.

pdf bib
Evaluating a Multilingual Pre-trained Model for the Automatic Standard German captioning of Swiss German TV
Johanna Gerlach | Pierrette Bouillon | Silvia Rodríguez Vázquez | Jonathan Mutal | Marianne Starlander
Proceedings of the 8th edition of the Swiss Text Analytics Conference

2022

pdf bib
The PASSAGE project : Standard German Subtitling of Swiss German TV content
Pierrette Bouillon | Johanna Gerlach | Jonathan Mutal | Marianne Starlander
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation

We present the PASSAGE project, which aims at automatic Standard German subtitling of Swiss German TV content. This is achieved in a two step process, beginning with ASR to produce a normalised transcription, followed by translation into Standard German. We focus on the second step, for which we explore different approaches and contribute aligned corpora for future research.

2015

pdf bib
Let the EAGLES fly into new standards: Adapting our CAT tool evaluation methodology to the ISO 25000 series
Marianne Starlander
Proceedings of Translating and the Computer 37

2013

pdf bib
Training translation students to evaluate CAT tools using EAGLES: a case study
Marianne Starlander
Proceedings of Translating and the Computer 35

2009

pdf bib
Technology in Translator Training and tools for translators
Pierrette Bouillon | Marianne Starlander
Proceedings of Machine Translation Summit XII: Plenaries

pdf bib
Relating recognition, translation and usability of two different versions of MedSLT
Marianne Starlander | Paula Estrella
Proceedings of Machine Translation Summit XII: Posters

2008

pdf bib
Comparing two different bidirectional versions of the limited-domain medical spoken language translator MedSLT
Marianne Starlander | Pierrette Bouillon | Glenn Flores | Manny Rayner | Nikos Tsourakis
Proceedings of the 12th Annual Conference of the European Association for Machine Translation

pdf bib
The 2008 MedSLT System
Manny Rayner | Pierrette Bouillon | Jane Brotanek | Glenn Flores | Sonia Halimi | Beth Ann Hockey | Hitoshi Isahara | Kyoko Kanzaki | Elisabeth Kron | Yukie Nakao | Marianne Santaholma | Marianne Starlander | Nikos Tsourakis
Coling 2008: Proceedings of the workshop on Speech Processing for Safety Critical Translation and Pervasive Applications

pdf bib
A Small-Vocabulary Shared Task for Medical Speech Translation
Manny Rayner | Pierrette Bouillon | Glenn Flores | Farzad Ehsani | Marianne Starlander | Beth Ann Hockey | Jane Brotanek | Lukas Biewald
Coling 2008: Proceedings of the workshop on Speech Processing for Safety Critical Translation and Pervasive Applications

pdf bib
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Pierrette Bouillon | Sonia Halimi | Yukie Nakao | Kyoko Kanzaki | Hitoshi Isahara | Nikos Tsourakis | Marianne Starlander | Beth Ann Hockey | Manny Rayner
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We describe recent work on MedSLT, a medium-vocabulary interlingua-based medical speech translation system, focussing on issues that arise when handling languages of which the grammar engineer has little or no knowledge. We show how we can systematically create and maintain multiple forms of grammars, lexica and interlingual representations, with some versions being used by language informants, and some by grammar engineers. In particular, we describe the advantages of structuring the interlingua definition as a simple semantic grammar, which includes a human-readable surface form. We show how this allows us to rationalise the process of evaluating translations between languages lacking common speakers, and also makes it possible to create a simple generic tool for debugging to-interlingua translation rules. Examples presented focus on the concrete case of translation between Japanese and Arabic in both directions.

pdf bib
Many-to-Many Multilingual Medical Speech Translation on a PDA
Kyoko Kanzaki | Yukie Nakao | Manny Rayner | Marianne Santaholma | Marianne Starlander | Nikos Tsourakis
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Government and Commercial Uses of MT

Particularly considering the requirement of high reliability, we argue that the most appropriate architecture for a medical speech translator that can be realised using today’s technology combines unidirectional (doctor to patient) translation, medium-vocabulary controlled language coverage, interlingua-based translation, an embedded help component, and deployability on a hand-held hardware platform. We present an overview of the Open Source MedSLT prototype, which has been developed in accordance with these design principles. The system is implemented on top of the Regulus and Nuance 8.5 platforms, translates patient examination questions for all language pairs in the set {English, French, Japanese, Arabic, Catalan}, using vocabularies of about 400 to 1 100 words, and can be run in a distributed client/server environment, where the client application is hosted on a Nokia Internet Tablet device.

2007

pdf bib
Les ellipses dans un système de traduction automatique de la parole
Pierrette Bouillon | Manny Rayner | Marianne Starlander | Marianne Santaholma
Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles. Posters

Dans tout dialogue, les phrases elliptiques sont très nombreuses. Dans cet article, nous évaluons leur impact sur la reconnaissance et la traduction dans le système de traduction automatique de la parole MedSLT. La résolution des ellipses y est effectuée par une méthode robuste et portable, empruntée aux systèmes de dialogue homme-machine. Cette dernière exploite une représentation sémantique plate et combine des techniques linguistiques (pour construire la représentation) et basées sur les exemples (pour apprendre sur la base d’un corpus ce qu’est une ellipse bien formée dans un sous-domaine donné et comment la résoudre).

pdf bib
A Bidirectional Grammar-Based Medical Speech Translator
Pierrette Bouillon | Glenn Flores | Marianne Starlander | Nikos Chatzichrisafis | Marianne Santaholma | Nikos Tsourakis | Manny Rayner | Beth Ann Hockey
Proceedings of the Workshop on Grammar-Based Approaches to Spoken Language Processing

pdf bib
Using a Wizard of Oz as a Baseline to Determine which System Architecture is the Best for a Spoken Language Translation System
Marianne Starlander
Proceedings of the 16th Nordic Conference of Computational Linguistics (NODALIDA 2007)

2006

pdf bib
Evaluating Task Performance for a Unidirectional Controlled Language Medical Speech Translation System
Nikos Chatzichrisafis | Pierrette Bouillon | Manny Rayner | Marianne Santaholma | Marianne Starlander | Beth Ann Hockey
Proceedings of the First International Workshop on Medical Speech Translation

pdf bib
MedSLT: A Limited-Domain Unidirectional Grammar-Based Medical Speech Translator
Manny Rayner | Pierrette Bouillon | Nikos Chatzichrisafis | Marianne Santaholma | Marianne Starlander | Beth Ann Hockey | Yukie Nakao | Hitoshi Isahara | Kyoko Kanzaki
Proceedings of the First International Workshop on Medical Speech Translation

pdf bib
Une grammaire partagée multitâche pour le traitement de la parole : application aux langues romanes [A multitask shared grammar for speech processing: application to romance languages]
Pierrette Bouillon | Manny Rayner | Bruna Novellas | Marianne Starlander | Marianne Santaholma | Yukie Nakao | Nikos Chatzichrisafis
Traitement Automatique des Langues, Volume 47, Numéro 3 : Varia [Varia]

pdf bib
Une grammaire multilingue partagée pour la traduction automatique de la parole
Pierrette Bouillon | Manny Rayner | Bruna Novellas | Yukie Nakao | Marianne Santaholma | Marianne Starlander | Nikos Chatzichrisafis
Actes de la 13ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs

Aujourd’hui, l’approche la plus courante en traitement de la parole consiste à combiner un reconnaisseur statistique avec un analyseur robuste. Pour beaucoup d’applications cependant, les reconnaisseurs linguistiques basés sur les grammaires offrent de nombreux avantages. Dans cet article, nous présentons une méthodologie et un ensemble de logiciels libres (appelé Regulus) pour dériver rapidement des reconnaisseurs linguistiquement motivés à partir d’une grammaire générale partagée pour le catalan et le français.

2005

pdf bib
Japanese Speech Understanding using Grammar Specialization
Manny Rayner | Nikos Chatzichrisafis | Pierrette Bouillon | Yukie Nakao | Hitoshi Isahara | Kyoko Kanzaki | Beth Ann Hockey | Marianne Santaholma | Marianne Starlander
Proceedings of HLT/EMNLP 2005 Interactive Demonstrations

pdf bib
Practicing Controlled Language through a Help System integrated into the Medical Speech Translation System (MedSLT)
Marianne Starlander | Pierrette Bouillon | Nikos Chatzichrisafis | Marianne Santaholma | Manny Rayner | Beth Ann Hockey | Hitoshi Isahara | Kyoko Kanzaki | Yukie Nakao
Proceedings of Machine Translation Summit X: Papers

In this paper, we present evidence that providing users of a speech to speech translation system for emergency diagnosis (MedSLT) with a tool that helps them to learn the coverage greatly improves their success in using the system. In MedSLT, the system uses a grammar-based recogniser that provides more predictable results to the translation component. The help module aims at addressing the lack of robustness inherent in this type of approach. It takes as input the result of a robust statistical recogniser that performs better for out-of-coverage data and produces a list of in-coverage example sentences. These examples are selected from a defined list using a heuristic that prioritises sentences maximising the number of N-grams shared with those extracted from the recognition result.

pdf bib
A generic multi-lingual open source platform for limited-domain medical speech translation
Pierrette Bouillon | Manny Rayner | Nikos Chatzichrisafis | Beth Ann Hockey | Marianne Santaholma | Marianne Starlander | Yukie Nakao | Kyoko Kanzaki | Hitoshi Isahara
Proceedings of the 10th EAMT Conference: Practical applications of machine translation

2004

pdf bib
Comparing rule-based and statistical approaches to speech understanding in a limited domain speech translation system
Manny Rayner | Pierrette Bouillon | Beth Ann Hockey | Nikos Chatzichrisafis | Marianne Starlander
Proceedings of the 10th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages

2002

pdf bib
Corpus-based Evaluation of a French Spelling and Grammar Checker
Marianne Starlander | Andrei Popescu-Belis
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)