Description
Book SynopsisAutomatic language translation systems like those used by Google, have been revolutionized by recent advances in the methods used in statistical machine translation. This first textbook on the topic explains these innovations carefully and shows the reader, whether a student or a developer, how to build their own translation system.
Trade Review'Philipp Koehn has provided the first comprehensive text for the rapidly growing field of statistical machine translation. This book is an invaluable resource for students, researchers, and software developers, providing a lucid and detailed presentation of all the important ideas needed to understand or create a state-of-the-art statistical machine translation system.' Robert C. Moore, Principal Researcher, Microsoft Research
'The book primarily represents an ideal introduction to the field of statistical machine translation, but also tackles many of the recent results in this area. It is the product of the many years of both active research and extensive teaching of the author … Each chapter is additionally endowed with a summary, further reading and exercises, achieving thus completely the proposed goal of an accessible introduction to the statistical machine translation field. Apart from its formative role for beginners, the book also stands as a complete guide for researchers in a domain of high interest and rapid expansion … For all these reasons, this book should be welcomed as a highly valuable publication.' Zentralblatt MATH
'… Statistical Machine Translation provides an excellent synthesis of a vast amount of literature (the bibliography section takes up 45 double-column pages) and presents it in a well-structured and articulate way. Moreover, the book has been class-tested and contains a set of exercises at the end of each chapter, as well as numerous references to open source tools and resources which enable the diligent reader to build MT systems for any language pair.' Target: International Journal of Translation Studies
Table of ContentsPreface; Part I. Foundations: 1. Introduction; 2. Words, sentences, corpora; 3. Probability theory; Part II. Core Methods: 4. Word-based models; 5. Phrase-based models; 6. Decoding; 7. Language models; 8. Evaluation; Part III. Advanced Topics: 9. Discriminative training; 10. Integrating linguistic information; 11. Tree-based models; Bibliography; Author index; Index.