Finite-State Text Processing
Title | Finite-State Text Processing PDF eBook |
Author | Kyle Gorman |
Publisher | Morgan & Claypool Publishers |
Pages | 160 |
Release | 2021-05-26 |
Genre | Computers |
ISBN | 1636391141 |
Weighted finite-state transducers (WFSTs) are commonly used by engineers and computational linguists for processing and generating speech and text. This book first provides a detailed introduction to this formalism. It then introduces Pynini, a Python library for compiling finite-state grammars and for combining, optimizing, applying, and searching finite-state transducers. This book illustrates this library's conventions and use with a series of case studies. These include the compilation and application of context-dependent rewrite rules, the construction of morphological analyzers and generators, and text generation and processing applications.
Finite-State Text Processing
Title | Finite-State Text Processing PDF eBook |
Author | Kyle Gorman |
Publisher | Springer Nature |
Pages | 140 |
Release | 2022-06-01 |
Genre | Computers |
ISBN | 3031021797 |
Weighted finite-state transducers (WFSTs) are commonly used by engineers and computational linguists for processing and generating speech and text. This book first provides a detailed introduction to this formalism. It then introduces Pynini, a Python library for compiling finite-state grammars and for combining, optimizing, applying, and searching finite-state transducers. This book illustrates this library's conventions and use with a series of case studies. These include the compilation and application of context-dependent rewrite rules, the construction of morphological analyzers and generators, and text generation and processing applications.
Finite-state Language Processing
Title | Finite-state Language Processing PDF eBook |
Author | Emmanuel Roche |
Publisher | MIT Press |
Pages | 494 |
Release | 1997 |
Genre | Computers |
ISBN | 9780262181822 |
Finite-state devices, such as finite-state automata, graphs, and finite-state transducers, have been present since the emergence of computer science and are extensively used in areas as various as program compilation, hardware modeling, and database management. Although finite-state devices have been known for some time in computational linguistics, more powerful formalisms such as context-free grammars or unification grammars have typically been preferred. Recent mathematical and algorithmic results in the field of finite-state technology have had a great impact on the representation of electronic dictionaries and on natural language processing, resulting in a new technology for language emerging out of both industrial and academic research. This book presents a discussion of fundamental finite-state algorithms, and constitutes an approach from the perspective of natural language processing.
Finite-State Techniques
Title | Finite-State Techniques PDF eBook |
Author | Stoyan Mihov |
Publisher | Cambridge University Press |
Pages | 316 |
Release | 2019-08-01 |
Genre | Computers |
ISBN | 1108621139 |
Finite-state methods are the most efficient mechanisms for analysing textual and symbolic data, providing elegant solutions for an immense number of practical problems in computational linguistics and computer science. This book for graduate students and researchers gives a complete coverage of the field, starting from a conceptual introduction and building to advanced topics and applications. The central finite-state technologies are introduced with mathematical rigour, ranging from simple finite-state automata to transducers and bimachines as 'input-output' devices. Special attention is given to the rich possibilities of simplifying, transforming and combining finite-state devices. All algorithms presented are accompanied by full correctness proofs and executable source code in a new programming language, C(M), which focuses on transparency of steps and simplicity of code. Thus, by enabling readers to obtain a deep formal understanding of the subject and to put finite-state methods to real use, this book closes the gap between theory and practice.
Natural Language Processing and Text Mining
Title | Natural Language Processing and Text Mining PDF eBook |
Author | Anne Kao |
Publisher | Springer Science & Business Media |
Pages | 272 |
Release | 2007-03-06 |
Genre | Computers |
ISBN | 1846287545 |
Natural Language Processing and Text Mining not only discusses applications of Natural Language Processing techniques to certain Text Mining tasks, but also the converse, the use of Text Mining to assist NLP. It assembles a diverse views from internationally recognized researchers and emphasizes caveats in the attempt to apply Natural Language Processing to text mining. This state-of-the-art survey is a must-have for advanced students, professionals, and researchers.
Finite-state Methods and Natural Language Processing
Title | Finite-state Methods and Natural Language Processing PDF eBook |
Author | Jakub Piskorski |
Publisher | IOS Press |
Pages | 248 |
Release | 2009 |
Genre | Computers |
ISBN | 158603975X |
Contains papers that cover a range of Natural Language Processing (NLP) applications, including machine learning and translation, logic, computational phonology, morphology and semantics, data mining, information extraction and disambiguation, as well as programming, optimization and compression of finite-state networks.
Computational Linguistics and Intelligent Text Processing
Title | Computational Linguistics and Intelligent Text Processing PDF eBook |
Author | Alexander Gelbukh |
Publisher | Springer |
Pages | 619 |
Release | 2009-02-17 |
Genre | Computers |
ISBN | 3642003826 |
th CICLing 2009 markedthe 10 anniversary of the Annual Conference on Intel- gent Text Processing and Computational Linguistics. The CICLing conferences provide a wide-scope forum for the discussion of the art and craft of natural language processing research as well as the best practices in its applications. This volume contains ?ve invited papers and the regular papers accepted for oral presentation at the conference. The papers accepted for poster presentation were published in a special issue of another journal (see the website for more information). Since 2001, the proceedings of CICLing conferences have been published in Springer’s Lecture Notes in Computer Science series, as volumes 2004, 2276, 2588, 2945, 3406, 3878, 4394, and 4919. This volume has been structured into 12 sections: – Trends and Opportunities – Linguistic Knowledge Representation Formalisms – Corpus Analysis and Lexical Resources – Extraction of Lexical Knowledge – Morphology and Parsing – Semantics – Word Sense Disambiguation – Machine Translation and Multilinguism – Information Extraction and Text Mining – Information Retrieval and Text Comparison – Text Summarization – Applications to the Humanities A total of 167 papers by 392 authors from 40 countries were submitted for evaluation by the International Program Committee, see Tables 1 and 2. This volume contains revised versions of 44 papers, by 120 authors, selected for oral presentation; the acceptance rate was 26. 3%.