Corpus linguistics
Title | Corpus linguistics PDF eBook |
Author | Stefanowitsch, Anatol |
Publisher | Language Science Press |
Pages | 510 |
Release | 2020 |
Genre | Language Arts & Disciplines |
ISBN | 3961102244 |
Corpora are used widely in linguistics, but not always wisely. This book attempts to frame corpus linguistics systematically as a variant of the observational method. The first part introduces the reader to the general methodological discussions surrounding corpus data as well as the practice of doing corpus linguistics, including issues such as the scientific research cycle, research design, extraction of corpus data and statistical evaluation. The second part consists of a number of case studies from the main areas of corpus linguistics (lexical associations, morphology, grammar, text and metaphor), surveying the range of issues studied in corpus linguistics while at the same time showing how they fit into the methodology outlined in the first part.
Corpus Linguistics and Statistics with R
Title | Corpus Linguistics and Statistics with R PDF eBook |
Author | Guillaume Desagulier |
Publisher | Springer |
Pages | 359 |
Release | 2017-11-17 |
Genre | Computers |
ISBN | 3319645722 |
This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and text-based sensorimetrics. Chapters cover topics including corpus processing, frequencing data, and clustering methods. Case studies illustrate each chapter with accompanying data sets, R code, and exercises for use by readers. This book may be used in advanced undergraduate courses, graduate courses, and self-study.
Corpus Linguistics
Title | Corpus Linguistics PDF eBook |
Author | Tony McEnery |
Publisher | Cambridge University Press |
Pages | 311 |
Release | 2011-10-06 |
Genre | Language Arts & Disciplines |
ISBN | 1139502441 |
Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.
Computational Methods for Corpus Annotation and Analysis
Title | Computational Methods for Corpus Annotation and Analysis PDF eBook |
Author | Xiaofei Lu |
Publisher | Springer |
Pages | 192 |
Release | 2014-07-08 |
Genre | Language Arts & Disciplines |
ISBN | 9401786453 |
In the past few decades the use of increasingly large text corpora has grown rapidly in language and linguistics research. This was enabled by remarkable strides in natural language processing (NLP) technology, technology that enables computers to automatically and efficiently process, annotate and analyze large amounts of spoken and written text in linguistically and/or pragmatically meaningful ways. It has become more desirable than ever before for language and linguistics researchers who use corpora in their research to gain an adequate understanding of the relevant NLP technology to take full advantage of its capabilities. This volume provides language and linguistics researchers with an accessible introduction to the state-of-the-art NLP technology that facilitates automatic annotation and analysis of large text corpora at both shallow and deep linguistic levels. The book covers a wide range of computational tools for lexical, syntactic, semantic, pragmatic and discourse analysis, together with detailed instructions on how to obtain, install and use each tool in different operating systems and platforms. The book illustrates how NLP technology has been applied in recent corpus-based language studies and suggests effective ways to better integrate such technology in future corpus linguistics research. This book provides language and linguistics researchers with a valuable reference for corpus annotation and analysis.
Statistics in Corpus Linguistics
Title | Statistics in Corpus Linguistics PDF eBook |
Author | Vaclav Brezina |
Publisher | Cambridge University Press |
Pages | 317 |
Release | 2018-09-20 |
Genre | Foreign Language Study |
ISBN | 1107125707 |
A comprehensive and accessible introduction to statistics in corpus linguistics, covering multiple techniques of quantitative language analysis and data visualisation.
A Practical Handbook of Corpus Linguistics
Title | A Practical Handbook of Corpus Linguistics PDF eBook |
Author | Magali Paquot |
Publisher | Springer Nature |
Pages | 686 |
Release | 2021-05-04 |
Genre | Philosophy |
ISBN | 3030462161 |
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
Quantitative Corpus Linguistics with R
Title | Quantitative Corpus Linguistics with R PDF eBook |
Author | Stefan Th. Gries |
Publisher | Routledge |
Pages | 257 |
Release | 2009-03-04 |
Genre | Education |
ISBN | 1135895600 |
The first textbook of its kind, Quantitative Corpus Linguistics with R demonstrates how to use the open source programming language R for corpus linguistic analyses. Computational and corpus linguists doing corpus work will find that R provides an enormous range of functions that currently require several programs to achieve – searching and processing corpora, arranging and outputting the results of corpus searches, statistical evaluation, and graphing.