Statistics for High-Dimensional Data

Statistics for High-Dimensional Data
Title Statistics for High-Dimensional Data PDF eBook
Author Peter Bühlmann
Publisher Springer Science & Business Media
Pages 568
Release 2011-06-08
Genre Mathematics
ISBN 364220192X

Download Statistics for High-Dimensional Data Book in PDF, Epub and Kindle

Modern statistics deals with large and complex data sets, and consequently with models containing a large number of parameters. This book presents a detailed account of recently developed approaches, including the Lasso and versions of it for various models, boosting methods, undirected graphical modeling, and procedures controlling false positive selections. A special characteristic of the book is that it contains comprehensive mathematical theory on high-dimensional statistics combined with methodology, algorithms and illustrations with real data examples. This in-depth approach highlights the methods’ great potential and practical applicability in a variety of settings. As such, it is a valuable resource for researchers, graduate students and experts in statistics, applied mathematics and computer science.

Statistical Methods for Complex And/or High Dimensional Data

Statistical Methods for Complex And/or High Dimensional Data
Title Statistical Methods for Complex And/or High Dimensional Data PDF eBook
Author Shanshan Qin
Publisher
Pages 0
Release 2020
Genre
ISBN

Download Statistical Methods for Complex And/or High Dimensional Data Book in PDF, Epub and Kindle

This dissertation focuses on the development and implementation of statistical methods for high-dimensional and/or complex data, with an emphasis on $p$, the number of explanatory variables, larger than $n$, the number of observations, the ratio of $p/n$ tending to a finite number, and data with outlier observations. First, we propose a non-negative feature selection and/or feature grouping (nnFSG) method. It deals with a general series of sign-constrained high-dimensional regression problems, which allows the regression coefficients to carry a structure of disjoint homogeneity, including sparsity as a special case. To solve the resulting non-convex optimization problem, we provide an algorithm that incorporates the difference of convex programming, augmented Lagrange and coordinate descent methods. Furthermore, we show that the aforementioned nnFSG method recovers the oracle estimate consistently, and yields a bound on the mean squared errors (MSE).} Besides, we examine the performance of our method by using finite sample simulations and a real protein mass spectrum dataset. Next, we consider a High-dimensional multivariate ridge regression model under the regime where both $p$ and $n$ are large enough with $p/n \rightarrow \kappa (0

Complex Models and Computational Methods in Statistics

Complex Models and Computational Methods in Statistics
Title Complex Models and Computational Methods in Statistics PDF eBook
Author Matteo Grigoletto
Publisher Springer Science & Business Media
Pages 228
Release 2013-01-26
Genre Mathematics
ISBN 884702871X

Download Complex Models and Computational Methods in Statistics Book in PDF, Epub and Kindle

The use of computational methods in statistics to face complex problems and highly dimensional data, as well as the widespread availability of computer technology, is no news. The range of applications, instead, is unprecedented. As often occurs, new and complex data types require new strategies, demanding for the development of novel statistical methods and suggesting stimulating mathematical problems. This book is addressed to researchers working at the forefront of the statistical analysis of complex systems and using computationally intensive statistical methods.

Topological and Statistical Methods for Complex Data

Topological and Statistical Methods for Complex Data
Title Topological and Statistical Methods for Complex Data PDF eBook
Author Janine Bennett
Publisher Springer
Pages 297
Release 2014-11-19
Genre Mathematics
ISBN 3662449005

Download Topological and Statistical Methods for Complex Data Book in PDF, Epub and Kindle

This book contains papers presented at the Workshop on the Analysis of Large-scale, High-Dimensional, and Multi-Variate Data Using Topology and Statistics, held in Le Barp, France, June 2013. It features the work of some of the most prominent and recognized leaders in the field who examine challenges as well as detail solutions to the analysis of extreme scale data. The book presents new methods that leverage the mutual strengths of both topological and statistical techniques to support the management, analysis, and visualization of complex data. It covers both theory and application and provides readers with an overview of important key concepts and the latest research trends. Coverage in the book includes multi-variate and/or high-dimensional analysis techniques, feature-based statistical methods, combinatorial algorithms, scalable statistics algorithms, scalar and vector field topology, and multi-scale representations. In addition, the book details algorithms that are broadly applicable and can be used by application scientists to glean insight from a wide range of complex data sets.

Advances in Complex Data Modeling and Computational Methods in Statistics

Advances in Complex Data Modeling and Computational Methods in Statistics
Title Advances in Complex Data Modeling and Computational Methods in Statistics PDF eBook
Author Anna Maria Paganoni
Publisher Springer
Pages 210
Release 2014-11-04
Genre Mathematics
ISBN 3319111493

Download Advances in Complex Data Modeling and Computational Methods in Statistics Book in PDF, Epub and Kindle

The book is addressed to statisticians working at the forefront of the statistical analysis of complex and high dimensional data and offers a wide variety of statistical models, computer intensive methods and applications: network inference from the analysis of high dimensional data; new developments for bootstrapping complex data; regression analysis for measuring the downsize reputational risk; statistical methods for research on the human genome dynamics; inference in non-euclidean settings and for shape data; Bayesian methods for reliability and the analysis of complex data; methodological issues in using administrative data for clinical and epidemiological research; regression models with differential regularization; geostatistical methods for mobility analysis through mobile phone data exploration. This volume is the result of a careful selection among the contributions presented at the conference "S.Co.2013: Complex data modeling and computationally intensive methods for estimation and prediction" held at the Politecnico di Milano, 2013. All the papers published here have been rigorously peer-reviewed.

Statistical Inference from High Dimensional Data

Statistical Inference from High Dimensional Data
Title Statistical Inference from High Dimensional Data PDF eBook
Author Carlos Fernandez-Lozano
Publisher MDPI
Pages 314
Release 2021-04-28
Genre Science
ISBN 3036509445

Download Statistical Inference from High Dimensional Data Book in PDF, Epub and Kindle

• Real-world problems can be high-dimensional, complex, and noisy • More data does not imply more information • Different approaches deal with the so-called curse of dimensionality to reduce irrelevant information • A process with multidimensional information is not necessarily easy to interpret nor process • In some real-world applications, the number of elements of a class is clearly lower than the other. The models tend to assume that the importance of the analysis belongs to the majority class and this is not usually the truth • The analysis of complex diseases such as cancer are focused on more-than-one dimensional omic data • The increasing amount of data thanks to the reduction of cost of the high-throughput experiments opens up a new era for integrative data-driven approaches • Entropy-based approaches are of interest to reduce the dimensionality of high-dimensional data

Statistical Foundations of Data Science

Statistical Foundations of Data Science
Title Statistical Foundations of Data Science PDF eBook
Author Jianqing Fan
Publisher CRC Press
Pages 942
Release 2020-09-21
Genre Mathematics
ISBN 0429527616

Download Statistical Foundations of Data Science Book in PDF, Epub and Kindle

Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.