Empirical Methods for Exploiting Parallel Texts
Title | Empirical Methods for Exploiting Parallel Texts PDF eBook |
Author | I. Dan Melamed |
Publisher | MIT Press |
Pages | 224 |
Release | 2001 |
Genre | Computers |
ISBN | 9780262133807 |
This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. Parallel texts (bitexts) are a goldmine of linguistic knowledge, because the translation of a text into another language can be viewed as a detailed annotation of what that text means. Knowledge about translational equivalence, which can be gleaned from bitexts, is of central importance for applications such as manual and machine translation, cross-language information retrieval, and corpus linguistics. The availability of bitexts has increased dramatically since the advent of the Web, making their study an exciting new area of research in natural language processing. This book lays out the theory and the practical techniques for discovering and applying translational equivalence at the lexical level. It is a start-to-finish guide to designing and evaluating many translingual applications.
Computational Linguistics and Intelligent Text Processing
Title | Computational Linguistics and Intelligent Text Processing PDF eBook |
Author | Alexander Gelbukh |
Publisher | Springer |
Pages | 778 |
Release | 2010-03-17 |
Genre | Computers |
ISBN | 3642121160 |
th CICLing 2010 was the 11 Annual Conference on Intelligent Text Processing and Computational Linguistics. The CICLing conferences provide a wide-scope forum for discussion of the art and craft of natural language processing research as well as the best practices in its applications. This volume contains three invited papers and the regular papers accepted for oral presentation at the conference. The papers accepted for poster pres- tation were published in a special issue of another journal (see information on thewebsite).Since 2001,theproceedingsofCICLingconferenceshavebeen p- lished in Springer’s Lecture Notes in Computer Science series, as volumes 2004, 2276, 2588, 2945, 3406, 3878, 4394, 4919, and 5449. The volume is structured into 12 sections: – Lexical Resources – Syntax and Parsing – Word Sense Disambiguation and Named Entity Recognition – Semantics and Dialog – Humor and Emotions – Machine Translation and Multilingualism – Information Extraction – Information Retrieval – Text Categorization and Classi?cation – Plagiarism Detection – Text Summarization – Speech Generation The 2010 event received a record high number of submissions in the - year history of the CICLing series. A total of 271 papers by 565 authors from 47 countriesweresubmittedforevaluationbytheInternationalProgramCommittee (see Tables 1 and 2). This volume contains revised versions of 61 papers, by 152 authors, selected for oral presentation; the acceptance rate was 23%.
Bibliography of Translation Studies: 2001
Title | Bibliography of Translation Studies: 2001 PDF eBook |
Author | Lynne Bowker |
Publisher | Routledge |
Pages | 93 |
Release | 2017-07-05 |
Genre | Language Arts & Disciplines |
ISBN | 1351573853 |
A volume of selected, annotated references arranged under specific headings to provide a non-partisan guide to teachers involved in designing courses in translation and/or interpreting.
Routledge Encyclopedia of Translation Technology
Title | Routledge Encyclopedia of Translation Technology PDF eBook |
Author | Chan Sin-wai |
Publisher | Routledge |
Pages | 958 |
Release | 2014-11-13 |
Genre | Foreign Language Study |
ISBN | 1317608143 |
The Routledge Encyclopedia of Translation Technology provides a state-of-the art survey of the field of computer-assisted translation. It is the first definitive reference to provide a comprehensive overview of the general, regional and topical aspects of this increasingly significant area of study. The Encyclopedia is divided into three parts: Part One presents general issues in translation technology, such as its history and development, translator training and various aspects of machine translation, including a valuable case study of its teaching at a major university; Part Two discusses national and regional developments in translation technology, offering contributions covering the crucial territories of China, Canada, France, Hong Kong, Japan, South Africa, Taiwan, the Netherlands and Belgium, the United Kingdom and the United States Part Three evaluates specific matters in translation technology, with entries focused on subjects such as alignment, bitext, computational lexicography, corpus, editing, online translation, subtitling and technology and translation management systems. The Routledge Encyclopedia of Translation Technology draws on the expertise of over fifty contributors from around the world and an international panel of consultant editors to provide a selection of articles on the most pertinent topics in the discipline. All the articles are self-contained, extensively cross-referenced, and include useful and up-to-date references and information for further reading. It will be an invaluable reference work for anyone with a professional or academic interest in the subject.
Bitext Alignment
Title | Bitext Alignment PDF eBook |
Author | Jörg Tiedemann |
Publisher | Morgan & Claypool Publishers |
Pages | 168 |
Release | 2011 |
Genre | Computers |
ISBN | 1608455106 |
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks
Machine Learning: ECML 2003
Title | Machine Learning: ECML 2003 PDF eBook |
Author | Nada Lavrač |
Publisher | Springer |
Pages | 521 |
Release | 2003-11-18 |
Genre | Computers |
ISBN | 3540398570 |
The proceedings of ECML/PKDD2003 are published in two volumes: the P- ceedings of the 14th European Conference on Machine Learning (LNAI 2837) and the Proceedings of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases (LNAI 2838). The two conferences were held on September 22–26, 2003 in Cavtat, a small tourist town in the vicinity of Dubrovnik, Croatia. As machine learning and knowledge discovery are two highly related ?elds, theco-locationofbothconferencesisbene?cialforbothresearchcommunities.In Cavtat, ECML and PKDD were co-located for the third time in a row, following the successful co-location of the two European conferences in Freiburg (2001) and Helsinki (2002). The co-location of ECML2003 and PKDD2003 resulted in a joint program for the two conferences, including paper presentations, invited talks, tutorials, and workshops. Out of 332 submitted papers, 40 were accepted for publication in the ECML2003proceedings,and40wereacceptedforpublicationinthePKDD2003 proceedings. All the submitted papers were reviewed by three referees. In ad- tion to submitted papers, the conference program consisted of four invited talks, four tutorials, seven workshops, two tutorials combined with a workshop, and a discovery challenge.
Semantic Domains in Computational Linguistics
Title | Semantic Domains in Computational Linguistics PDF eBook |
Author | Alfio Gliozzo |
Publisher | Springer Science & Business Media |
Pages | 138 |
Release | 2009-07-31 |
Genre | Language Arts & Disciplines |
ISBN | 3540681582 |
Semantic fields are lexically coherent – the words they contain co-occur in texts. In this book the authors introduce and define semantic domains, a computational model for lexical semantics inspired by the theory of semantic fields. Semantic domains allow us to exploit domain features for texts, terms and concepts, and they can significantly boost the performance of natural-language processing systems. Semantic domains can be derived from existing lexical resources or can be acquired from corpora in an unsupervised manner. They also have the property of interlinguality, and they can be used to relate terms in different languages in multilingual application scenarios. The authors give a comprehensive explanation of the computational model, with detailed chapters on semantic domains, domain models, and applications of the technique in text categorization, word sense disambiguation, and cross-language text categorization. This book is suitable for researchers and graduate students in computational linguistics.