Quality Measures in Data Mining
Title | Quality Measures in Data Mining PDF eBook |
Author | Fabrice Guillet |
Publisher | Springer Science & Business Media |
Pages | 319 |
Release | 2007-01-08 |
Genre | Mathematics |
ISBN | 3540449116 |
This book presents recent advances in quality measures in data mining.
Measuring Data Quality for Ongoing Improvement
Title | Measuring Data Quality for Ongoing Improvement PDF eBook |
Author | Laura Sebastian-Coleman |
Publisher | Newnes |
Pages | 404 |
Release | 2012-12-31 |
Genre | Computers |
ISBN | 0123977541 |
The Data Quality Assessment Framework shows you how to measure and monitor data quality, ensuring quality over time. You'll start with general concepts of measurement and work your way through a detailed framework of more than three dozen measurement types related to five objective dimensions of quality: completeness, timeliness, consistency, validity, and integrity. Ongoing measurement, rather than one time activities will help your organization reach a new level of data quality. This plain-language approach to measuring data can be understood by both business and IT and provides practical guidance on how to apply the DQAF within any organization enabling you to prioritize measurements and effectively report on results. Strategies for using data measurement to govern and improve the quality of data and guidelines for applying the framework within a data asset are included. You'll come away able to prioritize which measurement types to implement, knowing where to place them in a data flow and how frequently to measure. Common conceptual models for defining and storing of data quality results for purposes of trend analysis are also included as well as generic business requirements for ongoing measuring and monitoring including calculations and comparisons that make the measurements meaningful and help understand trends and detect anomalies. - Demonstrates how to leverage a technology independent data quality measurement framework for your specific business priorities and data quality challenges - Enables discussions between business and IT with a non-technical vocabulary for data quality measurement - Describes how to measure data quality on an ongoing basis with generic measurement types that can be applied to any situation
The Practitioner's Guide to Data Quality Improvement
Title | The Practitioner's Guide to Data Quality Improvement PDF eBook |
Author | David Loshin |
Publisher | Elsevier |
Pages | 423 |
Release | 2010-11-22 |
Genre | Computers |
ISBN | 0080920349 |
The Practitioner's Guide to Data Quality Improvement offers a comprehensive look at data quality for business and IT, encompassing people, process, and technology. It shares the fundamentals for understanding the impacts of poor data quality, and guides practitioners and managers alike in socializing, gaining sponsorship for, planning, and establishing a data quality program. It demonstrates how to institute and run a data quality program, from first thoughts and justifications to maintenance and ongoing metrics. It includes an in-depth look at the use of data quality tools, including business case templates, and tools for analysis, reporting, and strategic planning. This book is recommended for data management practitioners, including database analysts, information analysts, data administrators, data architects, enterprise architects, data warehouse engineers, and systems analysts, and their managers. - Offers a comprehensive look at data quality for business and IT, encompassing people, process, and technology. - Shows how to institute and run a data quality program, from first thoughts and justifications to maintenance and ongoing metrics. - Includes an in-depth look at the use of data quality tools, including business case templates, and tools for analysis, reporting, and strategic planning.
Principles of Data Mining
Title | Principles of Data Mining PDF eBook |
Author | David J. Hand |
Publisher | MIT Press |
Pages | 594 |
Release | 2001-08-17 |
Genre | Computers |
ISBN | 9780262082907 |
The first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The growing interest in data mining is motivated by a common problem across disciplines: how does one store, access, model, and ultimately describe and understand very large data sets? Historically, different aspects of data mining have been addressed independently by different disciplines. This is the first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The book consists of three sections. The first, foundations, provides a tutorial overview of the principles underlying data mining algorithms and their application. The presentation emphasizes intuition rather than rigor. The second section, data mining algorithms, shows how algorithms are constructed to solve specific problems in a principled manner. The algorithms covered include trees and rules for classification and regression, association rules, belief networks, classical statistical models, nonlinear models such as neural networks, and local "memory-based" models. The third section shows how all of the preceding analysis fits together when applied to real-world data mining problems. Topics include the role of metadata, how to handle missing data, and data preprocessing.
Discovery Science
Title | Discovery Science PDF eBook |
Author | João Gama |
Publisher | Springer |
Pages | 487 |
Release | 2009-10-07 |
Genre | Computers |
ISBN | 3642047475 |
This book constitutes the refereed proceedings of the twelfth International Conference, on Discovery Science, DS 2009, held in Porto, Portugal, in October 2009. The 35 revised full papers presented were carefully selected from 92 papers. The scope of the conference includes the development and analysis of methods for automatic scientific knowledge discovery, machine learning, intelligent data analysis, theory of learning, as well as their applications.
Machine Learning and Data Mining
Title | Machine Learning and Data Mining PDF eBook |
Author | Igor Kononenko |
Publisher | Horwood Publishing |
Pages | 484 |
Release | 2007-04-30 |
Genre | Computers |
ISBN | 9781904275213 |
Good data mining practice for business intelligence (the art of turning raw software into meaningful information) is demonstrated by the many new techniques and developments in the conversion of fresh scientific discovery into widely accessible software solutions. Written as an introduction to the main issues associated with the basics of machine learning and the algorithms used in data mining, this text is suitable foradvanced undergraduates, postgraduates and tutors in a wide area of computer science and technology, as well as researchers looking to adapt various algorithms for particular data mining tasks. A valuable addition to libraries and bookshelves of the many companies who are using the principles of data mining to effectively deliver solid business and industry solutions.
Data Preparation for Data Mining
Title | Data Preparation for Data Mining PDF eBook |
Author | Dorian Pyle |
Publisher | Morgan Kaufmann |
Pages | 566 |
Release | 1999-03-22 |
Genre | Computers |
ISBN | 9781558605299 |
This book focuses on the importance of clean, well-structured data as the first step to successful data mining. It shows how data should be prepared prior to mining in order to maximize mining performance.