Genetic Algorithms as Tool for Statistical Analysis of High-Dimensional Data Structures

Genetic Algorithms as Tool for Statistical Analysis of High-Dimensional Data Structures
Title Genetic Algorithms as Tool for Statistical Analysis of High-Dimensional Data Structures PDF eBook
Author Rüdiger Krause
Publisher
Pages 0
Release 2004
Genre
ISBN 9783832506612

Download Genetic Algorithms as Tool for Statistical Analysis of High-Dimensional Data Structures Book in PDF, Epub and Kindle

In regression the objective is to determine an appropriate function which reflects reality as accurate as possible but also eliminates irregularities from data noise and is therefore easy to interpret. A popular and flexible approach for estimating the true underlying function is the additive model. One possible approach for fitting additive models is the expansion in B-splines which allows direct calculation of the estimators. If the number of B-splines is too large the estimated functions become wiggly and tend to be very close to the observed data. To avoid this problem of overfitting we use a penalization approach characterized by smoothing parameters. In this thesis we propose the use of genetic algorithms for smoothing parameter optimization. Genetic algorithms are rarely applied in the field of statistics and refer to the principle that better adapted individuals win against their competitors under equal conditions. Apart from smoothing parameter optimization the user often faces datasets containing large numbers of relevant and irrelevant explanatory variables. Appropriate variable selection approaches allow to reduce the number of variables to subsets of relevant variables. We propose to consider the problems of variable selection and choice of smoothing parameters simultaneously by using genetic algorithms. Our approach bases on an appropriate combination of the genetic algorithms for smoothing parameter optimization and variable selection.

Statistical Inference from High Dimensional Data

Statistical Inference from High Dimensional Data
Title Statistical Inference from High Dimensional Data PDF eBook
Author Carlos Fernandez-Lozano
Publisher MDPI
Pages 314
Release 2021-04-28
Genre Science
ISBN 3036509445

Download Statistical Inference from High Dimensional Data Book in PDF, Epub and Kindle

• Real-world problems can be high-dimensional, complex, and noisy • More data does not imply more information • Different approaches deal with the so-called curse of dimensionality to reduce irrelevant information • A process with multidimensional information is not necessarily easy to interpret nor process • In some real-world applications, the number of elements of a class is clearly lower than the other. The models tend to assume that the importance of the analysis belongs to the majority class and this is not usually the truth • The analysis of complex diseases such as cancer are focused on more-than-one dimensional omic data • The increasing amount of data thanks to the reduction of cost of the high-throughput experiments opens up a new era for integrative data-driven approaches • Entropy-based approaches are of interest to reduce the dimensionality of high-dimensional data

Genetic Algorithm Essentials

Genetic Algorithm Essentials
Title Genetic Algorithm Essentials PDF eBook
Author Oliver Kramer
Publisher Springer
Pages 94
Release 2017-01-07
Genre Technology & Engineering
ISBN 331952156X

Download Genetic Algorithm Essentials Book in PDF, Epub and Kindle

This book introduces readers to genetic algorithms (GAs) with an emphasis on making the concepts, algorithms, and applications discussed as easy to understand as possible. Further, it avoids a great deal of formalisms and thus opens the subject to a broader audience in comparison to manuscripts overloaded by notations and equations. The book is divided into three parts, the first of which provides an introduction to GAs, starting with basic concepts like evolutionary operators and continuing with an overview of strategies for tuning and controlling parameters. In turn, the second part focuses on solution space variants like multimodal, constrained, and multi-objective solution spaces. Lastly, the third part briefly introduces theoretical tools for GAs, the intersections and hybridizations with machine learning, and highlights selected promising applications.

Structural, Syntactic, and Statistical Pattern Recognition

Structural, Syntactic, and Statistical Pattern Recognition
Title Structural, Syntactic, and Statistical Pattern Recognition PDF eBook
Author International Association for Pattern Recognition
Publisher Springer Science & Business Media
Pages 884
Release 2002-07-24
Genre Computers
ISBN 3540440119

Download Structural, Syntactic, and Statistical Pattern Recognition Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 9th International Workshop on Structural and Syntctic Pattern Recognition, SSPR 2002 and the 4th International Workshop on Statistical Techniques in Pattern Recognition, SPR 2002 held jointly in Windsor, Ontario, Canada in August 2002. The 45 revised full papers and 35 poster papers presented together with three invited papers were carefully reviewed and selected from 116 submissions. The papers are organized in topical sections on graphs, grammars, and languages; graphs, strings, and grammars; documents and OCR; image shape analysis and application; density estimation and distribution models; multi classifiers and fusion; feature extraction and selection; general methodology; and image shape analysis and application.

Evolutionary Statistical Procedures

Evolutionary Statistical Procedures
Title Evolutionary Statistical Procedures PDF eBook
Author Roberto Baragona
Publisher Springer Science & Business Media
Pages 283
Release 2011-01-03
Genre Computers
ISBN 3642162185

Download Evolutionary Statistical Procedures Book in PDF, Epub and Kindle

This proposed text appears to be a good introduction to evolutionary computation for use in applied statistics research. The authors draw from a vast base of knowledge about the current literature in both the design of evolutionary algorithms and statistical techniques. Modern statistical research is on the threshold of solving increasingly complex problems in high dimensions, and the generalization of its methodology to parameters whose estimators do not follow mathematically simple distributions is underway. Many of these challenges involve optimizing functions for which analytic solutions are infeasible. Evolutionary algorithms represent a powerful and easily understood means of approximating the optimum value in a variety of settings. The proposed text seeks to guide readers through the crucial issues of optimization problems in statistical settings and the implementation of tailored methods (including both stand-alone evolutionary algorithms and hybrid crosses of these procedures with standard statistical algorithms like Metropolis-Hastings) in a variety of applications. This book would serve as an excellent reference work for statistical researchers at an advanced graduate level or beyond, particularly those with a strong background in computer science.

International Conference on Mechanism Science and Control Engineering (MSCE 2014)

International Conference on Mechanism Science and Control Engineering (MSCE 2014)
Title International Conference on Mechanism Science and Control Engineering (MSCE 2014) PDF eBook
Author
Publisher DEStech Publications, Inc
Pages 740
Release 2014-09-02
Genre Technology & Engineering
ISBN 1605951838

Download International Conference on Mechanism Science and Control Engineering (MSCE 2014) Book in PDF, Epub and Kindle

The aim of MSCE 2014 is to provide a platform for researchers, engineers, and academicians, as well as industrial professionals, to present their research results and development activities in mechanism science and control engineering. It provides opportunities for the delegates to exchange new ideas and application experiences, to establish business or research relations and to find global partners for future collaboration. MSCE2014 is conducted to all the researchers, engineers, industrial professionals and academicians, who are broadly welcomed to present their latest research results, academic developments or theory practice. Topics of interest include but are not limited to Mechanism theory and Application, Mechanical control and Automation Engineering, Mechanical Dynamics, Materials Processing and Control, Instruments and Vibration Control. It is of great pleasure to see the delegates exchanging ideas and establishing sound relationships on the conference.

Statistical Exploratory Analysis of Genetic Algorithms

Statistical Exploratory Analysis of Genetic Algorithms
Title Statistical Exploratory Analysis of Genetic Algorithms PDF eBook
Author Andrew Simon Timothy Czarn
Publisher
Pages 278
Release 2008
Genre Genetic algorithms
ISBN

Download Statistical Exploratory Analysis of Genetic Algorithms Book in PDF, Epub and Kindle

[Truncated abstract] Genetic algorithms (GAs) have been extensively used and studied in computer science, yet there is no generally accepted methodology for exploring which parameters significantly affect performance, whether there is any interaction between parameters and how performance varies with respect to changes in parameters. This thesis presents a rigorous yet practical statistical methodology for the exploratory study of GAs. This methodology addresses the issues of experimental design, blocking, power and response curve analysis. It details how statistical analysis may assist the investigator along the exploratory pathway. The statistical methodology is demonstrated in this thesis using a number of case studies with a classical genetic algorithm with one-point crossover and bit-replacement mutation. In doing so we answer a number of questions about the relationship between the performance of the GA and the operators and encoding used. The methodology is suitable, however, to be applied to other adaptive optimization algorithms not treated in this thesis. In the first instance, as an initial demonstration of our methodology, we describe case studies using four standard test functions. It is found that the effect upon performance of crossover is predominantly linear while the effect of mutation is predominantly quadratic. Higher order effects are noted but contribute less to overall behaviour. In the case of crossover both positive and negative gradients are found which suggests using rates as high as possible for some problems while possibly excluding it for others. .... This is illustrated by showing how the use of Gray codes impedes the performance on a lower modality test function compared with a higher modality test function. Computer animation is then used to illustrate the actual mechanism by which this occurs. Fourthly, the traditional concept of a GA is that of selection, crossover and mutation. However, a limited amount of data from the literature has suggested that the niche for the beneficial effect of crossover upon GA performance may be smaller than has traditionally been held. Based upon previous results on not-linear-separable problems an exploration is made by comparing two test problem suites, one comprising non-rotated functions and the other comprising the same functions rotated by 45 degrees in the solution space rendering them not-linear-separable. It is shown that for the difficult rotated functions the crossover operator is detrimental to the performance of the GA. It is conjectured that what makes a problem difficult for the GA is complex and involves factors such as the degree of optimization at local minima due to crossover, the bias associated with the mutation operator and the Hamming Distances present in the individual problems due to the encoding. Furthermore, the GA was tested on a real world landscape minimization problem to see if the results obtained would match those from the difficult rotated functions. It is demonstrated that they match and that the features which make certain of the test functions difficult are also present in the real world problem. Overall, the proposed methodology is found to be an effective tool for revealing relationships between a randomized optimization algorithm and its encoding and parameters that are difficult to establish from more ad-hoc experimental studies alone.