An Open-Source Web-Based Tool for Resource-Agnostic Interactive Translation Prediction

Open access

Abstract

We present a web-based open-source tool for interactive translation prediction (ITP) and describe its underlying architecture. ITP systems assist human translators by making context-based computer-generated suggestions as they type. Most of the ITP systems in literature are strongly coupled with a statistical machine translation system that is conveniently adapted to provide the suggestions. Our system, however, follows a resource-agnostic approach and suggestions are obtained from any unmodified black-box bilingual resource. This paper reviews our ITP method and describes the architecture of Forecat, a web tool, partly based on the recent technology of web components, that eases the use of our ITP approach in any web application requiring this kind of translation assistance. We also evaluate the performance of our method when using an unmodified Moses-based statistical machine translation system as the bilingual resource.

If the inline PDF is not rendering correctly, you can download the PDF file here.

  • Al-Onaizan Yaser and Kishore Papineni. Distortion models for statistical machine translation. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics pages 529–536. Association for Computational Linguistics 2006.

  • Alabau Vicent Daniel Ortiz-Martínez Alberto Sanchis and Francisco Casacuberta. Mul-timodal interactive machine translation. In ICMI-MLMI ’10: Proceedings of the 2010 International Conference on Multimodal Interfaces 2010.

  • Alabau Vicent Ragnar Bonk Christian Buck Michael Carl Francisco Casacuberta Mercedes García-Martínez Jesús González-Rubio Philipp Koehn Luis A. Leiva Bar-tolomé Mesa-Lao Daniel Ortiz Herve Saint-Amand Germán Sanchis-Trilles and Chara Tsoukala. CASMACAT: An open source workbench for advanced computer aided translation. Prague Bull. Math. Linguistics 100:101–112 2013.

  • Barrachina Sergio Oliver Bender Francisco Casacuberta Jorge Civera Elsa Cubel Shahram Khadivi Antonio Lagarda Hermann Ney Jesús Tomás Enrique Vidal and Juan-Miguel Vilar. Statistical approaches to computer-assisted translation. Computational Linguistics 35(1):3–28 2009.

  • Blatz John Erin Fitzgerald George Foster Simona Gandrabur Cyril Goutte Alex Kulesza Alberto Sanchis and Nicola Ueffing. Confidence estimation for machine translation. In Proceedings of the 20th International Conference on Computational Linguistics COLING ’04 Stroudsburg PA USA 2004. Association for Computational Linguistics. doi: 10. 3115/1220355.1220401. URL http://dx.doi.org/10.3115/1220355.1220401.

  • Esplà-Gomis Miquel Felipe Sánchez-Martínez and Mikel L. Forcada. Using external sources of bilingual information for on-the-fly word alignment. Technical report Departament de Llenguatges i Sistemes Informàtics Universitat d'Alacant 2012.

  • Forcada Mikel L Mireia Ginestí-Rosell Jacob Nordfalk Jim O'Regan Sergio Ortiz-Rojas Juan Antonio Pérez-Ortiz Felipe Sánchez-Martínez Gema Ramírez-Sánchez and Francis M Tyers. Apertium: a free/open-source platform for rule-based machine translation. Machine Translation 25(2):127–144 2011.

  • Foster George F. Pierre Isabelle and Pierre Plamondon. Target-text mediated interactive machine translation. Machine Translation 12(1-2):175–194 1997.

  • Haddow Barry and Philipp Koehn. Analysing the effect of out-of-domain data on SMT systems. In Proceedings of the Seventh Workshop on Statistical Machine Translation Montreal Canada June 2012. Association for Computational Linguistics.

  • Junczys-Dowmunt Marcin. Phrasal rank-encoding: Exploiting phrase redundancy and trans-lational relations for phrase table compression. The Prague Bulletin of Mathematical Linguistics 98:63–74 2012.

  • Koehn Philipp. A web-based interactive computer aided translation tool. In Proceedings of the ACL-IJCNLP 2009 Software Demonstrations pages 17–20 2009.

  • Koehn Philipp Hieu Hoang Alexandra Birch Chris Callison-Burch Marcello Federico Nicola Bertoldi Brooke Cowan Wade Shen Christine Moran Richard Zens et al. Moses: Open source toolkit for statistical machine translation. In Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions pages 177–180. Association for Computational Linguistics 2007.

  • Langlais Philippe Sébastien Sauvé George Foster Elliott Macklovitch and Guy Lapalme. Evaluation of TransType a computer-aided translation typing system: a comparison of a theoretical-and a user-oriented evaluation procedures. In Conference on Language Resources and Evaluation (LREC) 2000.

  • Macklovitch Elliott. TransType2: The last word. In Proceedings of the 5th International Conference on Languages Resources and Evaluation (LREC 06) pages 167–172 2006.

  • Nirenburg Sergei and Robert Frederking. Toward multi-engine machine translation. In Proceedings of the Workshop on Human Language Technology HLT ’94 pages 147–151 Stroudsburg PA USA 1994. Association for Computational Linguistics. ISBN 1-55860-357-3. doi: 10.3115/1075812.1075842. URL http://dx.doi.org/10.3115/1075812.1075842.

  • Ortiz-Martínez Daniel. Advances in Fully-Automatic and Interactive Phrase-Based Statistical Machine Translation. PhD thesis Universitat Politècnica de València 2011.

  • Pérez-Ortiz Juan Antonio Daniel Torregrosa and Mikel Forcada. Black-box integration of heterogeneous bilingual resources into an interactive translation system. In Proceedings of the EACL 2014 Workshop on Humans and Computer-Assisted Translation 2014.

  • Sánchez-Martinez Felipe. Choosing the best machine translation system to translate a sentence by using only source-language information. In Proceedings of the 15th Annual Conference of the European Associtation for Machine Translation pages 97–104 2011.

Search
Journal information
Cited By
Metrics
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 203 96 3
PDF Downloads 116 69 1