Introduction to Clustering Large and High-Dimensional Data
Title | Introduction to Clustering Large and High-Dimensional Data PDF eBook |
Author | Jacob Kogan |
Publisher | Cambridge University Press |
Pages | 228 |
Release | 2007 |
Genre | Computers |
ISBN | 9780521617932 |
Focuses on a few of the important clustering algorithms in the context of information retrieval.
New Directions in Statistical Physics
Title | New Directions in Statistical Physics PDF eBook |
Author | Luc T. Wille |
Publisher | Springer Science & Business Media |
Pages | 369 |
Release | 2013-03-09 |
Genre | Science |
ISBN | 3662089688 |
This book provides a unique insight into the latest breakthroughs in a consistent manner, at a level accessible to undergraduates, yet with enough attention to the theory and computation to satisfy the professional researcher Statistical physics addresses the study and understanding of systems with many degrees of freedom. As such it has a rich and varied history, with applications to thermodynamics, magnetic phase transitions, and order/disorder transformations, to name just a few. However, the tools of statistical physics can be profitably used to investigate any system with a large number of components. Thus, recent years have seen these methods applied in many unexpected directions, three of which are the main focus of this volume. These applications have been remarkably successful and have enriched the financial, biological, and engineering literature. Although reported in the physics literature, the results tend to be scattered and the underlying unity of the field overlooked.
High-Dimensional Probability
Title | High-Dimensional Probability PDF eBook |
Author | Roman Vershynin |
Publisher | Cambridge University Press |
Pages | 299 |
Release | 2018-09-27 |
Genre | Business & Economics |
ISBN | 1108415199 |
An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.
Introduction to High-Dimensional Statistics
Title | Introduction to High-Dimensional Statistics PDF eBook |
Author | Christophe Giraud |
Publisher | CRC Press |
Pages | 410 |
Release | 2021-08-25 |
Genre | Computers |
ISBN | 1000408353 |
Praise for the first edition: "[This book] succeeds singularly at providing a structured introduction to this active field of research. ... it is arguably the most accessible overview yet published of the mathematical ideas and principles that one needs to master to enter the field of high-dimensional statistics. ... recommended to anyone interested in the main results of current research in high-dimensional statistics as well as anyone interested in acquiring the core mathematical skills to enter this area of research." —Journal of the American Statistical Association Introduction to High-Dimensional Statistics, Second Edition preserves the philosophy of the first edition: to be a concise guide for students and researchers discovering the area and interested in the mathematics involved. The main concepts and ideas are presented in simple settings, avoiding thereby unessential technicalities. High-dimensional statistics is a fast-evolving field, and much progress has been made on a large variety of topics, providing new insights and methods. Offering a succinct presentation of the mathematical foundations of high-dimensional statistics, this new edition: Offers revised chapters from the previous edition, with the inclusion of many additional materials on some important topics, including compress sensing, estimation with convex constraints, the slope estimator, simultaneously low-rank and row-sparse linear regression, or aggregation of a continuous set of estimators. Introduces three new chapters on iterative algorithms, clustering, and minimax lower bounds. Provides enhanced appendices, minimax lower-bounds mainly with the addition of the Davis-Kahan perturbation bound and of two simple versions of the Hanson-Wright concentration inequality. Covers cutting-edge statistical methods including model selection, sparsity and the Lasso, iterative hard thresholding, aggregation, support vector machines, and learning theory. Provides detailed exercises at the end of every chapter with collaborative solutions on a wiki site. Illustrates concepts with simple but clear practical examples.
Clustering
Title | Clustering PDF eBook |
Author | Rui Xu |
Publisher | John Wiley & Sons |
Pages | 400 |
Release | 2008-11-03 |
Genre | Mathematics |
ISBN | 0470382783 |
This is the first book to take a truly comprehensive look at clustering. It begins with an introduction to cluster analysis and goes on to explore: proximity measures; hierarchical clustering; partition clustering; neural network-based clustering; kernel-based clustering; sequential data clustering; large-scale data clustering; data visualization and high-dimensional data clustering; and cluster validation. The authors assume no previous background in clustering and their generous inclusion of examples and references help make the subject matter comprehensible for readers of varying levels and backgrounds.
Grouping Multidimensional Data
Title | Grouping Multidimensional Data PDF eBook |
Author | Jacob Kogan |
Publisher | Taylor & Francis |
Pages | 296 |
Release | 2006-02-10 |
Genre | Computers |
ISBN | 9783540283485 |
Publisher description
Grouping Multidimensional Data
Title | Grouping Multidimensional Data PDF eBook |
Author | Jacob Kogan |
Publisher | Springer Science & Business Media |
Pages | 273 |
Release | 2006-02-08 |
Genre | Computers |
ISBN | 3540283498 |
Clustering is one of the most fundamental and essential data analysis techniques. Clustering can be used as an independent data mining task to discern intrinsic characteristics of data, or as a preprocessing step with the clustering results then used for classification, correlation analysis, or anomaly detection. Kogan and his co-editors have put together recent advances in clustering large and high-dimension data. Their volume addresses new topics and methods which are central to modern data analysis, with particular emphasis on linear algebra tools, opimization methods and statistical techniques. The contributions, written by leading researchers from both academia and industry, cover theoretical basics as well as application and evaluation of algorithms, and thus provide an excellent state-of-the-art overview. The level of detail, the breadth of coverage, and the comprehensive bibliography make this book a perfect fit for researchers and graduate students in data mining and in many other important related application areas.