The Making and Breaking of Classification Models in Linguistics
Title | The Making and Breaking of Classification Models in Linguistics PDF eBook |
Author | Jane Klavan |
Publisher | Walter de Gruyter GmbH & Co KG |
Pages | 262 |
Release | 2024-06-04 |
Genre | Language Arts & Disciplines |
ISBN | 3110665182 |
The book provides a methodological blueprint for the study of constructional alternations – using corpus-linguistic methods in combination with different types of experimental data. The book looks at a case study from Estonian. This morphologically rich language is typologically different from Indo-European languages such as English. Corpus-based studies allow us to detect patterns in the data and determine what is typical in the language. Experiments are needed to determine the upper and lower limits of human classification behaviour. They give us an idea of what is possible in a language and show how human classification behaviour is susceptible to more variation than corpus-based models lead us to believe. Corpora and forced choice data tell us that when we produce language, we prefer one construction. Acceptability judgement data tell us that when we comprehend language, we judge both constructions as acceptable. The book makes a theoretical contribution to the what, why, and how of constructional alternations.
The Making and Breaking of Classification Models in Linguistics
Title | The Making and Breaking of Classification Models in Linguistics PDF eBook |
Author | Jane Klavan |
Publisher | Walter de Gruyter GmbH & Co KG |
Pages | 250 |
Release | 2024-06-04 |
Genre | Language Arts & Disciplines |
ISBN | 3110668467 |
The book provides a methodological blueprint for the study of constructional alternations – using corpus-linguistic methods in combination with different types of experimental data. The book looks at a case study from Estonian. This morphologically rich language is typologically different from Indo-European languages such as English. Corpus-based studies allow us to detect patterns in the data and determine what is typical in the language. Experiments are needed to determine the upper and lower limits of human classification behaviour. They give us an idea of what is possible in a language and show how human classification behaviour is susceptible to more variation than corpus-based models lead us to believe. Corpora and forced choice data tell us that when we produce language, we prefer one construction. Acceptability judgement data tell us that when we comprehend language, we judge both constructions as acceptable. The book makes a theoretical contribution to the what, why, and how of constructional alternations.
Breaking the Language Barrier: Demystifying Language Models with OpenAI
Title | Breaking the Language Barrier: Demystifying Language Models with OpenAI PDF eBook |
Author | Rayan Wali |
Publisher | Rayan Wali |
Pages | 301 |
Release | 2023-03-08 |
Genre | Computers |
ISBN |
Breaking the Language Barrier: Demystifying Language Models with OpenAI is an informative guide that covers practical NLP use cases, from machine translation to vector search, in a clear and accessible manner. In addition to providing insights into the latest technology that powers ChatGPT and other OpenAI language models, including GPT-3 and DALL-E, this book also showcases how to use OpenAI on the cloud, specifically on Microsoft Azure, to create scalable and efficient solutions.
Hands-On Large Language Models
Title | Hands-On Large Language Models PDF eBook |
Author | Jay Alammar |
Publisher | "O'Reilly Media, Inc." |
Pages | 449 |
Release | 2024-09-11 |
Genre | Computers |
ISBN | 1098150929 |
AI has acquired startling new language capabilities in just the past few years. Driven by the rapid advances in deep learning, language AI systems are able to write and understand text better than ever before. This trend enables the rise of new features, products, and entire industries. With this book, Python developers will learn the practical tools and concepts they need to use these capabilities today. You'll learn how to use the power of pre-trained large language models for use cases like copywriting and summarization; create semantic search systems that go beyond keyword matching; build systems that classify and cluster text to enable scalable understanding of large amounts of text documents; and use existing libraries and pre-trained models for text classification, search, and clusterings. This book also shows you how to: Build advanced LLM pipelines to cluster text documents and explore the topics they belong to Build semantic search engines that go beyond keyword search with methods like dense retrieval and rerankers Learn various use cases where these models can provide value Understand the architecture of underlying Transformer models like BERT and GPT Get a deeper understanding of how LLMs are trained Understanding how different methods of fine-tuning optimize LLMs for specific applications (generative model fine-tuning, contrastive fine-tuning, in-context learning, etc.)
Natural Language Processing
Title | Natural Language Processing PDF eBook |
Author | Yue Zhang |
Publisher | Cambridge University Press |
Pages | 487 |
Release | 2021-01-07 |
Genre | Computers |
ISBN | 1108349773 |
With a machine learning approach and less focus on linguistic details, this gentle introduction to natural language processing develops fundamental mathematical and deep learning models for NLP under a unified framework. NLP problems are systematically organised by their machine learning nature, including classification, sequence labelling, and sequence-to-sequence problems. Topics covered include statistical machine learning and deep learning models, text classification and structured prediction models, generative and discriminative models, supervised and unsupervised learning with latent variables, neural networks, and transition-based methods. Rich connections are drawn between concepts throughout the book, equipping students with the tools needed to establish a deep understanding of NLP solutions, adapt existing models, and confidently develop innovative models of their own. Featuring a host of examples, intuition, and end of chapter exercises, plus sample code available as an online resource, this textbook is an invaluable tool for the upper undergraduate and graduate student.
Handbook of Language Analysis in Psychology
Title | Handbook of Language Analysis in Psychology PDF eBook |
Author | Morteza Dehghani |
Publisher | Guilford Publications |
Pages | 650 |
Release | 2022-03-02 |
Genre | Social Science |
ISBN | 1462548431 |
Recent years have seen an explosion of interest in the use of computerized text analysis methods to address basic psychological questions. This comprehensive handbook brings together leading language analysis scholars to present foundational concepts and methods for investigating human thought, feeling, and behavior using language. Contributors work toward integrating psychological science and theory with natural language processing (NLP) and machine learning. Ethical issues in working with natural language data sets are discussed in depth. The volume showcases NLP-driven techniques and applications in areas including interpersonal relationships, personality, morality, deception, social biases, political psychology, psychopathology, and public health.
Computational Linguistics
Title | Computational Linguistics PDF eBook |
Author | Le-Minh Nguyen |
Publisher | Springer Nature |
Pages | 525 |
Release | 2020-07-01 |
Genre | Computers |
ISBN | 9811561680 |
This book constitutes the refereed proceedings of the 16th International Conference of the Pacific Association for Computational Linguistics, PACLING 2019, held in Hanoi, Vietnam, in October 2019. The 28 full papers and 14 short papers presented were carefully reviewed and selected from 70 submissions. The papers are organized in topical sections on text summarization; relation and word embedding; machine translation; text classification; web analyzing; question and answering, dialog analyzing; speech and emotion analyzing; parsing and segmentation; information extraction; and grammar error and plagiarism detection.