Statistics for High-Dimensional Data

Statistics for High-Dimensional Data
Title Statistics for High-Dimensional Data PDF eBook
Author Peter Bühlmann
Publisher Springer Science & Business Media
Pages 568
Release 2011-06-08
Genre Mathematics
ISBN 364220192X

Download Statistics for High-Dimensional Data Book in PDF, Epub and Kindle

Modern statistics deals with large and complex data sets, and consequently with models containing a large number of parameters. This book presents a detailed account of recently developed approaches, including the Lasso and versions of it for various models, boosting methods, undirected graphical modeling, and procedures controlling false positive selections. A special characteristic of the book is that it contains comprehensive mathematical theory on high-dimensional statistics combined with methodology, algorithms and illustrations with real data examples. This in-depth approach highlights the methods’ great potential and practical applicability in a variety of settings. As such, it is a valuable resource for researchers, graduate students and experts in statistics, applied mathematics and computer science.

Statistical Methods for Complex And/or High Dimensional Data

Statistical Methods for Complex And/or High Dimensional Data
Title Statistical Methods for Complex And/or High Dimensional Data PDF eBook
Author Shanshan Qin
Publisher
Pages 0
Release 2020
Genre
ISBN

Download Statistical Methods for Complex And/or High Dimensional Data Book in PDF, Epub and Kindle

This dissertation focuses on the development and implementation of statistical methods for high-dimensional and/or complex data, with an emphasis on $p$, the number of explanatory variables, larger than $n$, the number of observations, the ratio of $p/n$ tending to a finite number, and data with outlier observations. First, we propose a non-negative feature selection and/or feature grouping (nnFSG) method. It deals with a general series of sign-constrained high-dimensional regression problems, which allows the regression coefficients to carry a structure of disjoint homogeneity, including sparsity as a special case. To solve the resulting non-convex optimization problem, we provide an algorithm that incorporates the difference of convex programming, augmented Lagrange and coordinate descent methods. Furthermore, we show that the aforementioned nnFSG method recovers the oracle estimate consistently, and yields a bound on the mean squared errors (MSE).} Besides, we examine the performance of our method by using finite sample simulations and a real protein mass spectrum dataset. Next, we consider a High-dimensional multivariate ridge regression model under the regime where both $p$ and $n$ are large enough with $p/n \rightarrow \kappa (0

Complex Models and Computational Methods in Statistics

Complex Models and Computational Methods in Statistics
Title Complex Models and Computational Methods in Statistics PDF eBook
Author Matteo Grigoletto
Publisher Springer Science & Business Media
Pages 228
Release 2013-01-26
Genre Mathematics
ISBN 884702871X

Download Complex Models and Computational Methods in Statistics Book in PDF, Epub and Kindle

The use of computational methods in statistics to face complex problems and highly dimensional data, as well as the widespread availability of computer technology, is no news. The range of applications, instead, is unprecedented. As often occurs, new and complex data types require new strategies, demanding for the development of novel statistical methods and suggesting stimulating mathematical problems. This book is addressed to researchers working at the forefront of the statistical analysis of complex systems and using computationally intensive statistical methods.

Topological and Statistical Methods for Complex Data

Topological and Statistical Methods for Complex Data
Title Topological and Statistical Methods for Complex Data PDF eBook
Author Janine Bennett
Publisher Springer
Pages 297
Release 2014-11-19
Genre Mathematics
ISBN 3662449005

Download Topological and Statistical Methods for Complex Data Book in PDF, Epub and Kindle

This book contains papers presented at the Workshop on the Analysis of Large-scale, High-Dimensional, and Multi-Variate Data Using Topology and Statistics, held in Le Barp, France, June 2013. It features the work of some of the most prominent and recognized leaders in the field who examine challenges as well as detail solutions to the analysis of extreme scale data. The book presents new methods that leverage the mutual strengths of both topological and statistical techniques to support the management, analysis, and visualization of complex data. It covers both theory and application and provides readers with an overview of important key concepts and the latest research trends. Coverage in the book includes multi-variate and/or high-dimensional analysis techniques, feature-based statistical methods, combinatorial algorithms, scalable statistics algorithms, scalar and vector field topology, and multi-scale representations. In addition, the book details algorithms that are broadly applicable and can be used by application scientists to glean insight from a wide range of complex data sets.

High-dimensional Data Analysis

High-dimensional Data Analysis
Title High-dimensional Data Analysis PDF eBook
Author Tony Cai;Xiaotong Shen
Publisher
Pages 318
Release
Genre
ISBN 9787894236326

Download High-dimensional Data Analysis Book in PDF, Epub and Kindle

Over the last few years, significant developments have been taking place in highdimensional data analysis, driven primarily by a wide range of applications in many fields such as genomics and signal processing. In particular, substantial advances have been made in the areas of feature selection, covariance estimation, classification and regression. This book intends to examine important issues arising from highdimensional data analysis to explore key ideas for statistical inference and prediction. It is structured around topics on multiple hypothesis testing, feature selection, regression, cla.

Advances in Complex Data Modeling and Computational Methods in Statistics

Advances in Complex Data Modeling and Computational Methods in Statistics
Title Advances in Complex Data Modeling and Computational Methods in Statistics PDF eBook
Author Anna Maria Paganoni
Publisher Springer
Pages 210
Release 2014-11-04
Genre Mathematics
ISBN 3319111493

Download Advances in Complex Data Modeling and Computational Methods in Statistics Book in PDF, Epub and Kindle

The book is addressed to statisticians working at the forefront of the statistical analysis of complex and high dimensional data and offers a wide variety of statistical models, computer intensive methods and applications: network inference from the analysis of high dimensional data; new developments for bootstrapping complex data; regression analysis for measuring the downsize reputational risk; statistical methods for research on the human genome dynamics; inference in non-euclidean settings and for shape data; Bayesian methods for reliability and the analysis of complex data; methodological issues in using administrative data for clinical and epidemiological research; regression models with differential regularization; geostatistical methods for mobility analysis through mobile phone data exploration. This volume is the result of a careful selection among the contributions presented at the conference "S.Co.2013: Complex data modeling and computationally intensive methods for estimation and prediction" held at the Politecnico di Milano, 2013. All the papers published here have been rigorously peer-reviewed.

Statistical Foundations of Data Science

Statistical Foundations of Data Science
Title Statistical Foundations of Data Science PDF eBook
Author Jianqing Fan
Publisher CRC Press
Pages 942
Release 2020-09-21
Genre Mathematics
ISBN 0429527616

Download Statistical Foundations of Data Science Book in PDF, Epub and Kindle

Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.