Data Engineering with Python
Title | Data Engineering with Python PDF eBook |
Author | Paul Crickard |
Publisher | Packt Publishing Ltd |
Pages | 357 |
Release | 2020-10-23 |
Genre | Computers |
ISBN | 1839212306 |
Build, monitor, and manage real-time data pipelines to create data engineering infrastructure efficiently using open-source Apache projects Key Features Become well-versed in data architectures, data preparation, and data optimization skills with the help of practical examples Design data models and learn how to extract, transform, and load (ETL) data using Python Schedule, automate, and monitor complex data pipelines in production Book DescriptionData engineering provides the foundation for data science and analytics, and forms an important part of all businesses. This book will help you to explore various tools and methods that are used for understanding the data engineering process using Python. The book will show you how to tackle challenges commonly faced in different aspects of data engineering. You’ll start with an introduction to the basics of data engineering, along with the technologies and frameworks required to build data pipelines to work with large datasets. You’ll learn how to transform and clean data and perform analytics to get the most out of your data. As you advance, you'll discover how to work with big data of varying complexity and production databases, and build data pipelines. Using real-world examples, you’ll build architectures on which you’ll learn how to deploy data pipelines. By the end of this Python book, you’ll have gained a clear understanding of data modeling techniques, and will be able to confidently build data engineering pipelines for tracking data, running quality checks, and making necessary changes in production.What you will learn Understand how data engineering supports data science workflows Discover how to extract data from files and databases and then clean, transform, and enrich it Configure processors for handling different file formats as well as both relational and NoSQL databases Find out how to implement a data pipeline and dashboard to visualize results Use staging and validation to check data before landing in the warehouse Build real-time pipelines with staging areas that perform validation and handle failures Get to grips with deploying pipelines in the production environment Who this book is for This book is for data analysts, ETL developers, and anyone looking to get started with or transition to the field of data engineering or refresh their knowledge of data engineering using Python. This book will also be useful for students planning to build a career in data engineering or IT professionals preparing for a transition. No previous knowledge of data engineering is required.
Data Pipelines with Apache Airflow
Title | Data Pipelines with Apache Airflow PDF eBook |
Author | Bas P. Harenslak |
Publisher | Simon and Schuster |
Pages | 478 |
Release | 2021-04-27 |
Genre | Computers |
ISBN | 1617296902 |
This book teaches you how to build and maintain effective data pipelines. Youll explore the most common usage patterns, including aggregating multiple data sources, connecting to and from data lakes, and cloud deployment. --
Data Governance
Title | Data Governance PDF eBook |
Author | Dimitrios Sargiotis |
Publisher | Springer Nature |
Pages | 553 |
Release | |
Genre | |
ISBN | 3031672682 |
Data Science and Security
Title | Data Science and Security PDF eBook |
Author | Samiksha Shukla |
Publisher | Springer Nature |
Pages | 505 |
Release | 2022-07-01 |
Genre | Technology & Engineering |
ISBN | 981192211X |
This book presents best selected papers presented at the International Conference on Data Science for Computational Security (IDSCS 2022), organized by the Department of Data Science, CHRIST (Deemed to be University), Pune Lavasa Campus, India, during 11 – 12 February 2022. The book proposes new technologies and discusses future solutions and applications of data science, data analytics and security. The book targets current research works in the areas of data science, data security, data analytics, artificial intelligence, machine learning, computer vision, algorithms design, computer networking, data mining, big data, text mining, knowledge representation, soft computing and cloud computing.
QlikView Your Business
Title | QlikView Your Business PDF eBook |
Author | Oleg Troyansky |
Publisher | John Wiley & Sons |
Pages | 801 |
Release | 2015-08-10 |
Genre | Computers |
ISBN | 1118949552 |
Unlock the meaning of your data with QlikView The Qlik platform was designed to provide a fast and easy data analytics tool, and QlikView Your Business is your detailed, full-color, step-by-step guide to understanding Qlikview's powerful features and techniques so you can quickly start unlocking your data’s potential. This expert author team brings real-world insight together with practical business analytics, so you can approach, explore, and solve business intelligence problems using the robust Qlik toolset and clearly communicate your results to stakeholders using powerful visualization features in QlikView and Qlik Sense. This book starts at the basic level and dives deep into the most advanced QlikView techniques, delivering tangible value and knowledge to new users and experienced developers alike. As an added benefit, every topic presented is enhanced with tips, tricks, and insightful recommendations that the authors accumulated through years of developing QlikView analytics. This is the book for you: If you are a developer whose job is to load transactional data into Qlik BI environment, and who needs to understand both the basics and the most advanced techniques of Qlik data modelling and scripting If you are a data analyst whose job is to develop actionable and insightful QlikView visualizations to share within your organization If you are a project manager or business person, who wants to get a better understanding of the Qlik Business Intelligence platform and its capabilities What You Will Learn: The book covers three common business scenarios - Sales, Profitability, and Inventory Analysis. Each scenario contains four chapters, covering the four main disciplines of business analytics: Business Case, Data Modeling, Scripting, and Visualizations. The material is organized by increasing levels of complexity. Following our comprehensive tutorial, you will learn simple and advanced QlikView and Qlik Sense concepts, including the following: Data Modeling: Transforming Transactional data into Dimensional models Building a Star Schema Linking multiple fact tables using Link Tables Combing multiple tables into a single fact able using Concatenated Fact models Managing slowly changing dimensions Advanced date handling, using the As of Date table Calculating running balances Basic and Advanced Scripting: How to use the Data Load Script language for implementing data modeling techniques How to build and use the QVD data layer Building a multi-tier data architectures Using variables, loops, subroutines, and other script control statements Advanced scripting techniques for a variety of ETL solutions Building Insightful Visualizations in QlikView: Introduction into QlikView sheet objects — List Boxes, Text Objects, Charts, and more Designing insightful Dashboards in QlikView Using advanced calculation techniques, such as Set Analysis and Advanced Aggregation Using variables for What-If Analysis, as well as using variables for storing calculations, colors, and selection filters Advanced visualization techniques - normalized and non-normalized Mekko charts, Waterfall charts, Whale Tail charts, and more Building Insightful Visualizations in Qlik Sense: Introducing Qlik Sense - how it is different from QlikView and what is similar? Creating Sense sheet objects Building and using the Library of Master Items Exploring Qlik Sense unique features — Storytelling, Geo Mapping, and using Extensions Whether you are just starting out with QlikView or are ready to dive deeper, QlikView Your Business is your comprehensive guide to sharpening your QlikView skills and unleashing the power of QlikView in your organization.
PostgreSQL Query Optimization
Title | PostgreSQL Query Optimization PDF eBook |
Author | Henrietta Dombrovskaya |
Publisher | Apress |
Pages | 280 |
Release | 2021-05-27 |
Genre | Computers |
ISBN | 9781484268841 |
Write optimized queries. This book helps you write queries that perform fast and deliver results on time. You will learn that query optimization is not a dark art practiced by a small, secretive cabal of sorcerers. Any motivated professional can learn to write efficient queries from the get-go and capably optimize existing queries. You will learn to look at the process of writing a query from the database engine’s point of view, and know how to think like the database optimizer. The book begins with a discussion of what a performant system is and progresses to measuring performance and setting performance goals. It introduces different classes of queries and optimization techniques suitable to each, such as the use of indexes and specific join algorithms. You will learn to read and understand query execution plans along with techniques for influencing those plans for better performance. The book also covers advanced topics such as the use of functions and procedures, dynamic SQL, and generated queries. All of these techniques are then used together to produce performant applications, avoiding the pitfalls of object-relational mappers. What You Will Learn Identify optimization goals in OLTP and OLAP systems Read and understand PostgreSQL execution plans Distinguish between short queries and long queries Choose the right optimization technique for each query type Identify indexes that will improve query performance Optimize full table scans Avoid the pitfalls of object-relational mapping systems Optimize the entire application rather than just database queries Who This Book Is For IT professionals working in PostgreSQL who want to develop performant and scalable applications, anyone whose job title contains the words “database developer” or “database administrator" or who is a backend developer charged with programming database calls, and system architects involved in the overall design of application systems running against a PostgreSQL database
Gold and Iron
Title | Gold and Iron PDF eBook |
Author | Fritz Stern |
Publisher | Vintage |
Pages | 671 |
Release | 1979-09-12 |
Genre | History |
ISBN | 0394740343 |
Winner of the Lionel Trilling Award Nominated for the National Book Award “A major contribution to our understanding of some of the great themes of modern European history—the relations between Jews and Germans, between economics and politics, between banking and diplomacy.” —James Joll, The New York Times Book Review “I cannot praise this book too highly. It is a work of original scholarship, both exact and profound. It restores a buried chapter of history and penetrates, with insight and understanding, one of the most disturbing historical problems of modern times.” —Hugh J. Trevor-Roper, London Sunday Times “[An] extraordinary book, an invaluable contribution to our understanding of Germany in the second half of the nineteenth century.” —Stanley Hoffman, Washington Post Book World “One of the most important historical works of the past few decades.” —Golo Mann “In many ways this book resembles the great nineteenth-century novels.” —The Economist