Programming Elastic MapReduce
Title | Programming Elastic MapReduce PDF eBook |
Author | Kevin Schmidt |
Publisher | "O'Reilly Media, Inc." |
Pages | 264 |
Release | 2013-12-10 |
Genre | Computers |
ISBN | 1449364047 |
Although you don’t need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application. Using code samples and example configurations, you’ll learn how to assemble the building blocks necessary to solve your biggest data analysis problems. Get an overview of the AWS and Apache software tools used in large-scale data analysis Go through the process of executing a Job Flow with a simple log analyzer Discover useful MapReduce patterns for filtering and analyzing data sets Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow Learn the basics for using Amazon EMR to run machine learning algorithms Develop a project cost model for using Amazon EMR and other AWS tools
Programming Elastic MapReduce
Title | Programming Elastic MapReduce PDF eBook |
Author | Kevin Schmidt. Christopher Phillips |
Publisher | |
Pages | |
Release | 2013 |
Genre | |
ISBN | 9781449364038 |
Learning Big Data with Amazon Elastic MapReduce
Title | Learning Big Data with Amazon Elastic MapReduce PDF eBook |
Author | Amarkant Singh |
Publisher | |
Pages | 242 |
Release | 2014-10-10 |
Genre | Computers |
ISBN | 9781782173434 |
This book is aimed at developers and system administrators who want to learn about Big Data analysis using Amazon Elastic MapReduce. Basic Java programming knowledge is required. You should be comfortable with using command-line tools. Prior knowledge of AWS, API, and CLI tools is not assumed. Also, no exposure to Hadoop and MapReduce is expected.
Functional Programming in C#
Title | Functional Programming in C# PDF eBook |
Author | Oliver Sturm |
Publisher | John Wiley and Sons |
Pages | 288 |
Release | 2011-04-11 |
Genre | Computers |
ISBN | 0470744588 |
Presents a guide to the features of C♯, covering such topics as functions, generics, iterators, currying, caching, order functions, sequences, monads, and MapReduce.
Programming MapReduce with Scalding
Title | Programming MapReduce with Scalding PDF eBook |
Author | Antonios Chalkiopoulos |
Publisher | Packt Publishing Ltd |
Pages | 225 |
Release | 2014-06-25 |
Genre | Computers |
ISBN | 1783287020 |
This book is an easy-to-understand, practical guide to designing, testing, and implementing complex MapReduce applications in Scala using the Scalding framework. It is packed with examples featuring log-processing, ad-targeting, and machine learning. This book is for developers who are willing to discover how to effectively develop MapReduce applications. Prior knowledge of Hadoop or Scala is not required; however, investing some time on those topics would certainly be beneficial.
Programming Hive
Title | Programming Hive PDF eBook |
Author | Edward Capriolo |
Publisher | "O'Reilly Media, Inc." |
Pages | 351 |
Release | 2012-09-26 |
Genre | Computers |
ISBN | 1449319335 |
Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem. This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data. Use Hive to create, alter, and drop databases, tables, views, functions, and indexes Customize data formats and storage options, from files to external databases Load and extract data from tables—and use queries, grouping, filtering, joining, and other conventional query methods Gain best practices for creating user defined functions (UDFs) Learn Hive patterns you should use and anti-patterns you should avoid Integrate Hive with other data processing programs Use storage handlers for NoSQL databases and other datastores Learn the pros and cons of running Hive on Amazon’s Elastic MapReduce
Web-Scale Data Management for the Cloud
Title | Web-Scale Data Management for the Cloud PDF eBook |
Author | Wolfgang Lehner |
Publisher | Springer Science & Business Media |
Pages | 209 |
Release | 2013-04-06 |
Genre | Computers |
ISBN | 1461468566 |
The efficient management of a consistent and integrated database is a central task in modern IT and highly relevant for science and industry. Hardly any critical enterprise solution comes without any functionality for managing data in its different forms. Web-Scale Data Management for the Cloud addresses fundamental challenges posed by the need and desire to provide database functionality in the context of the Database as a Service (DBaaS) paradigm for database outsourcing. This book also discusses the motivation of the new paradigm of cloud computing, and its impact to data outsourcing and service-oriented computing in data-intensive applications. Techniques with respect to the support in the current cloud environments, major challenges, and future trends are covered in the last section of this book. A survey addressing the techniques and special requirements for building database services are provided in this book as well.