The First Discriminant Theory of Linearly Separable Data

The First Discriminant Theory of Linearly Separable Data
Title The First Discriminant Theory of Linearly Separable Data PDF eBook
Author Shuichi Shinmura
Publisher Springer Nature
Pages 373
Release
Genre
ISBN 9819994209

Download The First Discriminant Theory of Linearly Separable Data Book in PDF, Epub and Kindle

New Theory of Discriminant Analysis After R. Fisher

New Theory of Discriminant Analysis After R. Fisher
Title New Theory of Discriminant Analysis After R. Fisher PDF eBook
Author Shuichi Shinmura
Publisher Springer
Pages 221
Release 2016-12-27
Genre Mathematics
ISBN 9811021643

Download New Theory of Discriminant Analysis After R. Fisher Book in PDF, Epub and Kindle

This is the first book to compare eight LDFs by different types of datasets, such as Fisher’s iris data, medical data with collinearities, Swiss banknote data that is a linearly separable data (LSD), student pass/fail determination using student attributes, 18 pass/fail determinations using exam scores, Japanese automobile data, and six microarray datasets (the datasets) that are LSD. We developed the 100-fold cross-validation for the small sample method (Method 1) instead of the LOO method. We proposed a simple model selection procedure to choose the best model having minimum M2 and Revised IP-OLDF based on MNM criterion was found to be better than other M2s in the above datasets. We compared two statistical LDFs and six MP-based LDFs. Those were Fisher’s LDF, logistic regression, three SVMs, Revised IP-OLDF, and another two OLDFs. Only a hard-margin SVM (H-SVM) and Revised IP-OLDF could discriminate LSD theoretically (Problem 2). We solved the defect of the generalized inverse matrices (Problem 3). For more than 10 years, many researchers have struggled to analyze the microarray dataset that is LSD (Problem 5). If we call the linearly separable model "Matroska," the dataset consists of numerous smaller Matroskas in it. We develop the Matroska feature selection method (Method 2). It finds the surprising structure of the dataset that is the disjoint union of several small Matroskas. Our theory and methods reveal new facts of gene analysis.

Backpropagation

Backpropagation
Title Backpropagation PDF eBook
Author Yves Chauvin
Publisher Psychology Press
Pages 576
Release 2013-02-01
Genre Psychology
ISBN 1134775814

Download Backpropagation Book in PDF, Epub and Kindle

Composed of three sections, this book presents the most popular training algorithm for neural networks: backpropagation. The first section presents the theory and principles behind backpropagation as seen from different perspectives such as statistics, machine learning, and dynamical systems. The second presents a number of network architectures that may be designed to match the general concepts of Parallel Distributed Processing with backpropagation learning. Finally, the third section shows how these principles can be applied to a number of different fields related to the cognitive sciences, including control, speech recognition, robotics, image processing, and cognitive psychology. The volume is designed to provide both a solid theoretical foundation and a set of examples that show the versatility of the concepts. Useful to experts in the field, it should also be most helpful to students seeking to understand the basic principles of connectionist learning and to engineers wanting to add neural networks in general -- and backpropagation in particular -- to their set of problem-solving methods.

Big Data, Cloud Computing, and Data Science Engineering

Big Data, Cloud Computing, and Data Science Engineering
Title Big Data, Cloud Computing, and Data Science Engineering PDF eBook
Author Roger Lee
Publisher Springer
Pages 222
Release 2019-07-30
Genre Computers
ISBN 3030244059

Download Big Data, Cloud Computing, and Data Science Engineering Book in PDF, Epub and Kindle

This edited book presents the scientific outcomes of the 4th IEEE/ACIS International Conference on Big Data, Cloud Computing, Data Science & Engineering (BCD 2019) which was held on May 29–31, 2019 in Honolulu, Hawaii. The aim of the conference was to bring together researchers and scientists, businessmen and entrepreneurs, teachers, engineers, computer users and students to discuss the numerous fields of computer science and to share their experiences and exchange new ideas and information in a meaningful way. Presenting 15 of the conference’s most promising papers, the book discusses all aspects (theory, applications and tools) of computer and information science, the practical challenges encountered along the way, and the solutions adopted to solve them.

High-dimensional Microarray Data Analysis

High-dimensional Microarray Data Analysis
Title High-dimensional Microarray Data Analysis PDF eBook
Author Shuichi Shinmura
Publisher Springer
Pages 437
Release 2019-05-14
Genre Medical
ISBN 9811359989

Download High-dimensional Microarray Data Analysis Book in PDF, Epub and Kindle

This book shows how to decompose high-dimensional microarrays into small subspaces (Small Matryoshkas, SMs), statistically analyze them, and perform cancer gene diagnosis. The information is useful for genetic experts, anyone who analyzes genetic data, and students to use as practical textbooks. Discriminant analysis is the best approach for microarray consisting of normal and cancer classes. Microarrays are linearly separable data (LSD, Fact 3). However, because most linear discriminant function (LDF) cannot discriminate LSD theoretically and error rates are high, no one had discovered Fact 3 until now. Hard-margin SVM (H-SVM) and Revised IP-OLDF (RIP) can find Fact3 easily. LSD has the Matryoshka structure and is easily decomposed into many SMs (Fact 4). Because all SMs are small samples and LSD, statistical methods analyze SMs easily. However, useful results cannot be obtained. On the other hand, H-SVM and RIP can discriminate two classes in SM entirely. RatioSV is the ratio of SV distance and discriminant range. The maximum RatioSVs of six microarrays is over 11.67%. This fact shows that SV separates two classes by window width (11.67%). Such easy discrimination has been unresolved since 1970. The reason is revealed by facts presented here, so this book can be read and enjoyed like a mystery novel. Many studies point out that it is difficult to separate signal and noise in a high-dimensional gene space. However, the definition of the signal is not clear. Convincing evidence is presented that LSD is a signal. Statistical analysis of the genes contained in the SM cannot provide useful information, but it shows that the discriminant score (DS) discriminated by RIP or H-SVM is easily LSD. For example, the Alon microarray has 2,000 genes which can be divided into 66 SMs. If 66 DSs are used as variables, the result is a 66-dimensional data. These signal data can be analyzed to find malignancy indicators by principal component analysis and cluster analysis.

Operations Research and Enterprise Systems

Operations Research and Enterprise Systems
Title Operations Research and Enterprise Systems PDF eBook
Author Eric Pinson
Publisher Springer
Pages 308
Release 2015-04-16
Genre Computers
ISBN 3319175092

Download Operations Research and Enterprise Systems Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the Third International Conference on Operations Research and Enterprise Systems, ICORES 2014, held in Angers, France, in March 2014. The 18 revised full papers presented together with an invited paper were carefully reviewed and selected from 96 submissions. The papers are organized in topical sections on Methodologies and Technologies and Applications.

Data Complexity in Pattern Recognition

Data Complexity in Pattern Recognition
Title Data Complexity in Pattern Recognition PDF eBook
Author Mitra Basu
Publisher Springer Science & Business Media
Pages 309
Release 2006-12-22
Genre Computers
ISBN 1846281725

Download Data Complexity in Pattern Recognition Book in PDF, Epub and Kindle

Automatic pattern recognition has uses in science and engineering, social sciences and finance. This book examines data complexity and its role in shaping theory and techniques across many disciplines, probing strengths and deficiencies of current classification techniques, and the algorithms that drive them. The book offers guidance on choosing pattern recognition classification techniques, and helps the reader set expectations for classification performance.