Spoken Language Corpus and Linguistic Informatics
Title | Spoken Language Corpus and Linguistic Informatics PDF eBook |
Author | Yuji Kawaguchi |
Publisher | John Benjamins Publishing |
Pages | 441 |
Release | 2006-01-01 |
Genre | Language Arts & Disciplines |
ISBN | 9027233179 |
Printbegrænsninger: Der kan printes 10 sider ad gangen og max. 40 sider pr. session
Spoken Corpora and Linguistic Studies
Title | Spoken Corpora and Linguistic Studies PDF eBook |
Author | Tommaso Raso |
Publisher | John Benjamins Publishing Company |
Pages | 508 |
Release | 2014-11-14 |
Genre | Language Arts & Disciplines |
ISBN | 9027270031 |
The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media
Developing Linguistic Corpora
Title | Developing Linguistic Corpora PDF eBook |
Author | Martin Wynne |
Publisher | Oxbow Books Limited |
Pages | 100 |
Release | 2005 |
Genre | Language Arts & Disciplines |
ISBN |
A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.
Linguistic Informatics and Spoken Language Corpora
Title | Linguistic Informatics and Spoken Language Corpora PDF eBook |
Author | Yuji Kawaguchi |
Publisher | |
Pages | 446 |
Release | 2006 |
Genre | |
ISBN |
Corpus Linguistics: An Introduction
Title | Corpus Linguistics: An Introduction PDF eBook |
Author | Dash, Niladri Sekhar |
Publisher | Pearson Education India |
Pages | 208 |
Release | 2008 |
Genre | |
ISBN | 8131752623 |
Corpus Linguistics: An Introduction will appeal to a wide spectrum of scholars, researchers, and particularly to students of linguistics. It offers guidelines for the creation and usage of corpora in the form of empirical language databases with direct functional and theoretical interpretation of a natural language. Drawn from original research and written in an accessible language and style, this book will create avenues for further advancements in mainstream and applied linguistics and language technology.
Corpus-based Perspectives in Linguistics
Title | Corpus-based Perspectives in Linguistics PDF eBook |
Author | Yuji Kawaguchi |
Publisher | John Benjamins Publishing |
Pages | 464 |
Release | 2007 |
Genre | Language Arts & Disciplines |
ISBN | 9789027233189 |
UBLI has conducted field surveys since 2002 and built spoken language corpora for French, Spanish, Italian (Salentino dialect), Russian, Malaysian, Turkish, Japanese, and Canadian multilinguals. This volume features new research presented at the UBLI second workshop on Corpus Linguistics Research Domain, which was held on September 14, 2006. The first part consisting of eleven presentations to this workshop shows a wide range of subjects within the area of corpus-based research, such as dictionary, linguistic atlas, dialect, translation, ancient texts, non-standard texts, sociolinguistics, second language acquisition, and natural language processing. The second part of this volume comprises ten additional contributions to both written and spoken corpora by the members and research assistants of UBLI.
History, Features, and Typology of Language Corpora
Title | History, Features, and Typology of Language Corpora PDF eBook |
Author | Niladri Sekhar Dash |
Publisher | Springer |
Pages | 311 |
Release | 2018-02-01 |
Genre | Language Arts & Disciplines |
ISBN | 9811074585 |
This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora. This book has several important features: it discusses many technical issues of the field in a lucid manner; contains extensive new diagrams and charts for easy comprehension; and presents discussions in simplified English to cater to the needs of non-native English readers. This is an important resource authored by academics who have many years of experience teaching and researching corpus linguistics. Its focus on Indian languages and on English corpora makes it applicable to students of graduate and postgraduate courses in applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.