Human Language Technology. Challenges for Computer Science and Linguistics

Human Language Technology. Challenges for Computer Science and Linguistics
Title Human Language Technology. Challenges for Computer Science and Linguistics PDF eBook
Author Zygmunt Vetulani
Publisher Springer
Pages 449
Release 2018-06-15
Genre Computers
ISBN 3319937820

Download Human Language Technology. Challenges for Computer Science and Linguistics Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 7h Language and Technology Conference: Challenges for Computer Science and Linguistics, LTC 2015, held in Poznan, Poland, in November 2015. The 31 revised papers presented in this volume were carefully reviewed and selected from 108 submissions. The papers selected to this volume belong to various fields of: Speech Processing; Multiword Expressions; Parsing; Language Resources and Tools; Ontologies and Wordnets; Machine Translation; Information and Data Extraction; Text Engineering and Processing; Applications in Language Learning; Emotions, Decisions and Opinions; Less-Resourced Languages.

Web Corpus Construction

Web Corpus Construction
Title Web Corpus Construction PDF eBook
Author Roland Schäfer
Publisher Morgan & Claypool Publishers
Pages 197
Release 2013-07-01
Genre Computers
ISBN 1627053123

Download Web Corpus Construction Book in PDF, Epub and Kindle

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora).

Human Language Technologies – The Baltic Perspective

Human Language Technologies – The Baltic Perspective
Title Human Language Technologies – The Baltic Perspective PDF eBook
Author K. Muischnek
Publisher IOS Press
Pages 208
Release 2018-09-28
Genre Computers
ISBN 1614999120

Download Human Language Technologies – The Baltic Perspective Book in PDF, Epub and Kindle

Computational linguistics, speech processing, natural language processing and language technologies in general have all become increasingly important in an era of all-pervading technological development. This book, Human Language Technologies – The Baltic Perspective, presents the proceedings of the 8th International Baltic Human Language Technologies Conference (Baltic HLT 2018), held in Tartu, Estonia, on 27-29 September 2018. The main aim of Baltic HLT is to provide a forum for sharing new ideas and recent advances in computational linguistics and related disciplines, and to promote cooperation between the research communities of the Baltic States and beyond. The 24 articles in this volume cover a wide range of subjects, including machine translation, automatic morphology, text classification, various language resources, and NLP pipelines, as well as speech technology; the latter being the most popular topic with 8 papers. Delivering an overview of the state-of-the-art language technologies from a Baltic perspective, the book will be of interest to all those whose work involves language processing in whatever form.

Human Language Technologies – The Baltic Perspective

Human Language Technologies – The Baltic Perspective
Title Human Language Technologies – The Baltic Perspective PDF eBook
Author I. Skadiņa
Publisher IOS Press
Pages 188
Release 2016-10-14
Genre Computers
ISBN 1614997012

Download Human Language Technologies – The Baltic Perspective Book in PDF, Epub and Kindle

Throughout the last decade, the Baltic states have played an active role in regional and international language technology activities, supporting less-resourced languages in the digital age. This book presents the proceedings of the 7th International Conference: Human Language Technologies – The Baltic Perspective (Baltic HLT 2016), held in Riga, Latvia, in October 2016. Baltic HLT 2016 provided a forum for sharing ideas and recent advances in human language processing with a special focus on less-resourced languages. Papers selected for the conference cover a wide range of topics, including a general overview of language technology progress in the Baltic states, actual research topics in written and spoken language processing, the creation of language resources and their applications, and proposals for a European language platform. The book is divided into five sections: overview; speech technologies and corpora; machine translation; written language resources; and methods and tools for language processing. The book will be a useful resource, not only for Baltic language researchers, but also for those working with other less-resourced languages in Europe and beyond.

Human Language Technologies

Human Language Technologies
Title Human Language Technologies PDF eBook
Author Inguna Skadina
Publisher IOS Press
Pages 264
Release 2010
Genre Computers
ISBN 1607506408

Download Human Language Technologies Book in PDF, Epub and Kindle

This book contains papers from the Fourth International Conference on Human Language Technologies - the Baltic Perspective (Baltic HLT 2010), held in Riga in October 2010. This conference is the latest in a series which provides a forum for sharing recent advances in human language processing, and promotes cooperation between the computer science and linguistics communities of the Baltic countries and the rest of the world. Bringing together scientists, developers, providers and users, the conference is an opportunity to exchange information, discuss problems, find new synergies, and promote i.

Natural Language Processing for Social Media

Natural Language Processing for Social Media
Title Natural Language Processing for Social Media PDF eBook
Author Atefeh Farzindar
Publisher Morgan & Claypool Publishers
Pages 242
Release 2017-12-15
Genre Computers
ISBN 1681733277

Download Natural Language Processing for Social Media Book in PDF, Epub and Kindle

In recent years, online social networking has revolutionized interpersonal communication. The newer research on language analysis in social media has been increasingly focusing on the latter's impact on our daily lives, both on a personal and a professional level. Natural language processing (NLP) is one of the most promising avenues for social media data processing. It is a scientific challenge to develop powerful methods and algorithms which extract relevant information from a large volume of data coming from multiple sources and languages in various formats or in free form. We discuss the challenges in analyzing social media texts in contrast with traditional documents. Research methods in information extraction, automatic categorization and clustering, automatic summarization and indexing, and statistical machine translation need to be adapted to a new kind of data. This book reviews the current research on NLP tools and methods for processing the non-traditional information from social media data that is available in large amounts (big data), and shows how innovative NLP approaches can integrate appropriate linguistic information in various fields such as social media monitoring, healthcare, business intelligence, industry, marketing, and security and defence. We review the existing evaluation metrics for NLP and social media applications, and the new efforts in evaluation campaigns or shared tasks on new datasets collected from social media. Such tasks are organized by the Association for Computational Linguistics (such as SemEval tasks) or by the National Institute of Standards and Technology via the Text REtrieval Conference (TREC) and the Text Analysis Conference (TAC). In the concluding chapter, we discuss the importance of this dynamic discipline and its great potential for NLP in the coming decade, in the context of changes in mobile technology, cloud computing, virtual reality, and social networking. In this second edition, we have added information about recent progress in the tasks and applications presented in the first edition. We discuss new methods and their results. The number of research projects and publications that use social media data is constantly increasing due to continuously growing amounts of social media data and the need to automatically process them. We have added 85 new references to the more than 300 references from the first edition. Besides updating each section, we have added a new application (digital marketing) to the section on media monitoring and we have augmented the section on healthcare applications with an extended discussion of recent research on detecting signs of mental illness from social media.

Human Language Technologies – The Baltic Perspective

Human Language Technologies – The Baltic Perspective
Title Human Language Technologies – The Baltic Perspective PDF eBook
Author A. Utka
Publisher IOS Press
Pages 280
Release 2020-09-30
Genre Computers
ISBN 1643681176

Download Human Language Technologies – The Baltic Perspective Book in PDF, Epub and Kindle

Human language technology is the study of the methods by which computer programs or electronic devices can analyze, produce, modify or respond to human texts and speech. It consists of natural language processing and computational linguistics on the one hand, and speech technology on the other. This book presents the proceedings of the 9th International Conference, Human Language Technologies – The Baltic Perspective (Baltic HLT 2020), organised in Kaunas, Lithuania on 22 and 23 September 2020. This biennial conference offers researchers a platform to share knowledge on recent advances in human language processing for the Baltic languages, as well as promoting interdisciplinary and international cooperation in human language-technology research within and beyond the Baltic States. In addition to the traditional topics of natural language processing and language technologies, this year’s conference featured a special session on resource and tool development for teaching and learning the less resourced Baltic languages. This year, 42 submissions were received, each of which was evaluated by two reviewers, resulting in a total of 34 papers being accepted for presentation and publication. The book is divided into four sections: speech and text analysis (9 papers); machine translation and natural understanding (6 papers); tools and resources (14 papers); and language learning resources (5 papers). Providing a fascinating overview of current research in the field from a primarily Baltic perspective, the book will be of interest to all those whose work involves human language technology.