Data Quality and Record Linkage Techniques
Title | Data Quality and Record Linkage Techniques PDF eBook |
Author | Thomas N. Herzog |
Publisher | Springer Science & Business Media |
Pages | 225 |
Release | 2007-05-23 |
Genre | Computers |
ISBN | 0387695052 |
This book offers a practical understanding of issues involved in improving data quality through editing, imputation, and record linkage. The first part of the book deals with methods and models, focusing on the Fellegi-Holt edit-imputation model, the Little-Rubin multiple-imputation scheme, and the Fellegi-Sunter record linkage model. The second part presents case studies in which these techniques are applied in a variety of areas, including mortgage guarantee insurance, medical, biomedical, highway safety, and social insurance as well as the construction of list frames and administrative lists. This book offers a mixture of practical advice, mathematical rigor, management insight and philosophy.
Methodological Developments in Data Linkage
Title | Methodological Developments in Data Linkage PDF eBook |
Author | Katie Harron |
Publisher | John Wiley & Sons |
Pages | 286 |
Release | 2015-12-14 |
Genre | Medical |
ISBN | 1118745876 |
A comprehensive compilation of new developments in data linkage methodology The increasing availability of large administrative databases has led to a dramatic rise in the use of data linkage, yet the standard texts on linkage are still those which describe the seminal work from the 1950-60s, with some updates. Linkage and analysis of data across sources remains problematic due to lack of discriminatory and accurate identifiers, missing data and regulatory issues. Recent developments in data linkage methodology have concentrated on bias and analysis of linked data, novel approaches to organising relationships between databases and privacy-preserving linkage. Methodological Developments in Data Linkage brings together a collection of contributions from members of the international data linkage community, covering cutting edge methodology in this field. It presents opportunities and challenges provided by linkage of large and often complex datasets, including analysis problems, legal and security aspects, models for data access and the development of novel research areas. New methods for handling uncertainty in analysis of linked data, solutions for anonymised linkage and alternative models for data collection are also discussed. Key Features: Presents cutting edge methods for a topic of increasing importance to a wide range of research areas, with applications to data linkage systems internationally Covers the essential issues associated with data linkage today Includes examples based on real data linkage systems, highlighting the opportunities, successes and challenges that the increasing availability of linkage data provides Novel approach incorporates technical aspects of both linkage, management and analysis of linked data This book will be of core interest to academics, government employees, data holders, data managers, analysts and statisticians who use administrative data. It will also appeal to researchers in a variety of areas, including epidemiology, biostatistics, social statistics, informatics, policy and public health.
Data Matching
Title | Data Matching PDF eBook |
Author | Peter Christen |
Publisher | Springer Science & Business Media |
Pages | 279 |
Release | 2012-07-04 |
Genre | Computers |
ISBN | 3642311644 |
Data matching (also known as record or data linkage, entity resolution, object identification, or field matching) is the task of identifying, matching and merging records that correspond to the same entities from several databases or even within one database. Based on research in various domains including applied statistics, health informatics, data mining, machine learning, artificial intelligence, database management, and digital libraries, significant advances have been achieved over the last decade in all aspects of the data matching process, especially on how to improve the accuracy of data matching, and its scalability to large databases. Peter Christen’s book is divided into three parts: Part I, “Overview”, introduces the subject by presenting several sample applications and their special challenges, as well as a general overview of a generic data matching process. Part II, “Steps of the Data Matching Process”, then details its main steps like pre-processing, indexing, field and record comparison, classification, and quality evaluation. Lastly, part III, “Further Topics”, deals with specific aspects like privacy, real-time matching, or matching unstructured data. Finally, it briefly describes the main features of many research and open source systems available today. By providing the reader with a broad range of data matching concepts and techniques and touching on all aspects of the data matching process, this book helps researchers as well as students specializing in data quality or data matching aspects to familiarize themselves with recent research advances and to identify open research challenges in the area of data matching. To this end, each chapter of the book includes a final section that provides pointers to further background and research material. Practitioners will better understand the current state of the art in data matching as well as the internal workings and limitations of current systems. Especially, they will learn that it is often not feasible to simply implement an existing off-the-shelf data matching system without substantial adaption and customization. Such practical considerations are discussed for each of the major steps in the data matching process.
Data-Driven Policy Impact Evaluation
Title | Data-Driven Policy Impact Evaluation PDF eBook |
Author | Nuno Crato |
Publisher | Springer |
Pages | 344 |
Release | 2018-10-02 |
Genre | Political Science |
ISBN | 3319784617 |
In the light of better and more detailed administrative databases, this open access book provides statistical tools for evaluating the effects of public policies advocated by governments and public institutions. Experts from academia, national statistics offices and various research centers present modern econometric methods for an efficient data-driven policy evaluation and monitoring, assess the causal effects of policy measures and report on best practices of successful data management and usage. Topics include data confidentiality, data linkage, and national practices in policy areas such as public health, education and employment. It offers scholars as well as practitioners from public administrations, consultancy firms and nongovernmental organizations insights into counterfactual impact evaluation methods and the potential of data-based policy and program evaluation.
Record Linkage and Privacy
Title | Record Linkage and Privacy PDF eBook |
Author | United States. General Accounting Office |
Publisher | DIANE Publishing |
Pages | 172 |
Release | 2001 |
Genre | Electronic records |
ISBN | 1428949291 |
Record Linkage and Privacy
Title | Record Linkage and Privacy PDF eBook |
Author | |
Publisher | |
Pages | 174 |
Release | 2001 |
Genre | Electronic records |
ISBN |
Record Linkage Techniques, 1985
Title | Record Linkage Techniques, 1985 PDF eBook |
Author | Beth Kilss |
Publisher | |
Pages | 412 |
Release | 1986 |
Genre | Dual record systems |
ISBN |