Active Learning and Submodular Functions

Active Learning and Submodular Functions
Title Active Learning and Submodular Functions PDF eBook
Author Andrew Guillory
Publisher
Pages 128
Release 2012
Genre Submodular functions
ISBN

Download Active Learning and Submodular Functions Book in PDF, Epub and Kindle

Active learning is a machine learning setting where the learning algorithm decides what data is labeled. Submodular functions are a class of set functions for which many optimization problems have efficient exact or approximate algorithms. We examine their connections. 1. We propose a new class of interactive submodular optimization problems which connect and generalize submodular optimization and active learning over a finite query set. We derive greedy algorithms with approximately optimal worst-case cost. These analyses apply to exact learning, approximate learning, learning in the presence of adversarial noise, and applications that mix learning and covering. 2. We consider active learning in a batch, transductive setting where the learning algorithm selects a set of examples to be labeled at once. In this setting we derive new error bounds which use symmetric submodular functions for regularization, and we give algorithms which approximately minimize these bounds. 3. We consider a repeated active learning setting where the learning algorithm solves a sequence of related learning problems. We propose an approach to this problem based on a new online prediction version of submodular set cover. A common theme in these results is the use of tools from submodular optimization to extend the breadth and depth of learning theory with an emphasis on non-stochastic settings.

A Submodular Optimization Framework for Never-ending Learning

A Submodular Optimization Framework for Never-ending Learning
Title A Submodular Optimization Framework for Never-ending Learning PDF eBook
Author Wael Emara
Publisher
Pages 0
Release 2012
Genre Data mining
ISBN

Download A Submodular Optimization Framework for Never-ending Learning Book in PDF, Epub and Kindle

The revolution in information technology and the explosion in the use of computing devices in people's everyday activities has forever changed the perspective of the data mining and machine learning fields. The enormous amounts of easily accessible, information rich data is pushing the data analysis community in general towards a shift of paradigm. In the new paradigm, data comes in the form a stream of billions of records received everyday. The dynamic nature of the data and its sheer size makes it impossible to use the traditional notion of offline learning where the whole data is accessible at any time point. Moreover, no amount of human resources is enough to get expert feedback on the data. In this work we have developed a unified optimization based learning framework that approaches many of the challenges mentioned earlier. Specifically, we developed a Never-Ending Learning framework which combines incremental/online, semi-supervised, and active learning under a unified optimization framework. The established framework is based on the class of submodular optimization methods. At the core of this work we provide a novel formulation of the Semi-Supervised Support Vector Machines (S3VM) in terms of submodular set functions. The new formulation overcomes the non-convexity issues of the S3VM and provides a state of the art solution that is orders of magnitude faster than the cutting edge algorithms in the literature. Next, we provide a stream summarization technique via exemplar selection. This technique makes it possible to keep a fixed size exemplar representation of a data stream that can be used by any label propagation based semi-supervised learning technique. The compact data steam representation allows a wide range of algorithms to be extended to incremental/online learning scenario. Under the same optimization framework, we provide an active learning algorithm that constitute the feedback between the learning machine and an oracle. Finally, the developed Never-Ending Learning framework is essentially transductive in nature. Therefore, our last contribution is an inductive incremental learning technique for incremental training of SVM using the properties of local kernels. We demonstrated through this work the importance and wide applicability of the proposed methodologies.

Active Learning

Active Learning
Title Active Learning PDF eBook
Author Burr Chen
Publisher Springer Nature
Pages 100
Release 2022-05-31
Genre Computers
ISBN 3031015606

Download Active Learning Book in PDF, Epub and Kindle

The key idea behind active learning is that a machine learning algorithm can perform better with less training if it is allowed to choose the data from which it learns. An active learner may pose "queries," usually in the form of unlabeled data instances to be labeled by an "oracle" (e.g., a human annotator) that already understands the nature of the problem. This sort of approach is well-motivated in many modern machine learning and data mining applications, where unlabeled data may be abundant or easy to come by, but training labels are difficult, time-consuming, or expensive to obtain. This book is a general introduction to active learning. It outlines several scenarios in which queries might be formulated, and details many query selection algorithms which have been organized into four broad categories, or "query selection frameworks." We also touch on some of the theoretical foundations of active learning, and conclude with an overview of the strengths and weaknesses of these approaches in practice, including a summary of ongoing work to address these open challenges and opportunities. Table of Contents: Automating Inquiry / Uncertainty Sampling / Searching Through the Hypothesis Space / Minimizing Expected Error and Variance / Exploiting Structure in Data / Theory / Practical Considerations

Learning with Submodular Functions

Learning with Submodular Functions
Title Learning with Submodular Functions PDF eBook
Author Francis Bach
Publisher
Pages 228
Release 2013
Genre Convex functions
ISBN 9781601987570

Download Learning with Submodular Functions Book in PDF, Epub and Kindle

Submodular functions are relevant to machine learning for at least two reasons: (1) some problems may be expressed directly as the optimization of submodular functions and (2) the Lovász extension of submodular functions provides a useful set of regularization functions for supervised and unsupervised learning. In this monograph, we present the theory of submodular functions from a convex analysis perspective, presenting tight links between certain polyhedra, combinatorial optimization and convex optimization problems. In particular, we show how submodular function minimization is equivalent to solving a wide variety of convex optimization problems. This allows the derivation of new efficient algorithms for approximate and exact submodular function minimization with theoretical guarantees and good practical performance. By listing many examples of submodular functions, we review various applications to machine learning, such as clustering, experimental design, sensor placement, graphical model structure learning or subset selection, as well as a family of structured sparsity-inducing norms that can be derived and used from submodular functions.

Document Analysis and Recognition - ICDAR 2023

Document Analysis and Recognition - ICDAR 2023
Title Document Analysis and Recognition - ICDAR 2023 PDF eBook
Author Gernot A. Fink
Publisher Springer Nature
Pages 568
Release 2023-08-18
Genre Computers
ISBN 3031417348

Download Document Analysis and Recognition - ICDAR 2023 Book in PDF, Epub and Kindle

This six-volume set of LNCS 14187, 14188, 14189, 14190, 14191 and 14192 constitutes the refereed proceedings of the 17th International Conference on Document Analysis and Recognition, ICDAR 2021, held in San José, CA, USA, in August 2023. The 53 full papers were carefully reviewed and selected from 316 submissions, and are presented with 101 poster presentations. The papers are organized into the following topical sections: Graphics Recognition, Frontiers in Handwriting Recognition, Document Analysis and Recognition.

Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014

Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014
Title Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014 PDF eBook
Author Polina Golland
Publisher Springer
Pages 460
Release 2014-08-31
Genre Computers
ISBN 3319104438

Download Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014 Book in PDF, Epub and Kindle

The three-volume set LNCS 8673, 8674, and 8675 constitutes the refereed proceedings of the 17th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2014, held in Boston, MA, USA, in September 2014. Based on rigorous peer reviews, the program committee carefully selected 253 revised papers from 862 submissions for presentation in three volumes. The 53 papers included in the third volume have been organized in the following topical sections: shape and population analysis; brain; diffusion MRI; and machine learning.

Tractability

Tractability
Title Tractability PDF eBook
Author Lucas Bordeaux
Publisher Cambridge University Press
Pages 401
Release 2014-02-06
Genre Computers
ISBN 1107025192

Download Tractability Book in PDF, Epub and Kindle

An overview of the techniques developed to circumvent computational intractability, a key challenge in many areas of computer science.