Linguistic Corpora and Big Data in Spanish and Portuguese

Linguistic Corpora and Big Data in Spanish and Portuguese
Title Linguistic Corpora and Big Data in Spanish and Portuguese PDF eBook
Author Miguel Calderón Campos, Gael Vaamonde
Publisher Walter de Gruyter GmbH & Co KG
Pages 260
Release 2024-06-29
Genre
ISBN 3110781522

Download Linguistic Corpora and Big Data in Spanish and Portuguese Book in PDF, Epub and Kindle

Linguistic Corpora and Big Data in Spanish and Portuguese

Linguistic Corpora and Big Data in Spanish and Portuguese
Title Linguistic Corpora and Big Data in Spanish and Portuguese PDF eBook
Author Miguel Calderón Campos
Publisher Walter de Gruyter GmbH & Co KG
Pages 238
Release 2024-10-21
Genre Language Arts & Disciplines
ISBN 3110781468

Download Linguistic Corpora and Big Data in Spanish and Portuguese Book in PDF, Epub and Kindle

In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacrónica del español (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.

Linguistic Corpora and Big Data in Spanish and Portuguese

Linguistic Corpora and Big Data in Spanish and Portuguese
Title Linguistic Corpora and Big Data in Spanish and Portuguese PDF eBook
Author Miguel Calderón Campos
Publisher
Pages 0
Release 2024
Genre
ISBN 9783110781458

Download Linguistic Corpora and Big Data in Spanish and Portuguese Book in PDF, Epub and Kindle

In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacrónica del español (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.

Exploring Linguistic Science

Exploring Linguistic Science
Title Exploring Linguistic Science PDF eBook
Author Allison Burkette
Publisher
Pages 253
Release 2018-03-15
Genre Language Arts & Disciplines
ISBN 1108424805

Download Exploring Linguistic Science Book in PDF, Epub and Kindle

Introduces students to the scientific study of language, using the basic principles of complexity theory.

Information Management and Big Data

Information Management and Big Data
Title Information Management and Big Data PDF eBook
Author Juan Antonio Lossio-Ventura
Publisher Springer
Pages 400
Release 2019-02-07
Genre Computers
ISBN 3030116808

Download Information Management and Big Data Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 5th International Conference on Information Management and Big Data, SIMBig 2018, held in Lima, Peru, in September 2018. The 34 papers presented were carefully reviewed and selected from 101 submissions. The papers address issues such as data mining, artificial intelligence, Natural Language Processing, information retrieval, machine learning, web mining.

Advances in Big Data and Cloud Computing

Advances in Big Data and Cloud Computing
Title Advances in Big Data and Cloud Computing PDF eBook
Author J. Dinesh Peter
Publisher Springer
Pages 575
Release 2018-12-12
Genre Technology & Engineering
ISBN 9811318824

Download Advances in Big Data and Cloud Computing Book in PDF, Epub and Kindle

This book is a compendium of the proceedings of the International Conference on Big Data and Cloud Computing. It includes recent advances in the areas of big data analytics, cloud computing, internet of nano things, cloud security, data analytics in the cloud, smart cities and grids, etc. This volume primarily focuses on the application of the knowledge that promotes ideas for solving the problems of the society through cutting-edge technologies. The articles featured in this proceeding provide novel ideas that contribute to the growth of world class research and development. The contents of this volume will be of interest to researchers and professionals alike.

Computational Processing of the Portuguese Language

Computational Processing of the Portuguese Language
Title Computational Processing of the Portuguese Language PDF eBook
Author A. Joaquim da Silva Teixeira
Publisher Springer
Pages 290
Release 2008-09-08
Genre Language Arts & Disciplines
ISBN 3540859802

Download Computational Processing of the Portuguese Language Book in PDF, Epub and Kindle

This book constitutes the thoroughly refereed proceedings of the 8th International Workshop on Computational Processing of the Portuguese Language, PROPOR 2008, held in Aveiro, Portugal, in September 2008. The 21 revised full papers and 16 revised short papers presented were carefully reviewed and selected from 63 submissions. The papers are organized in topical sections on speech analysis; ontologies, semantics and anaphora resolution; speech synthesis; machine learning applied to natural language processing; speech recognition and applications; natural language processing tools and applications; posters.