Multimodal Interaction in Image and Video Applications
Title | Multimodal Interaction in Image and Video Applications PDF eBook |
Author | Angel D. Sappa |
Publisher | Springer Science & Business Media |
Pages | 209 |
Release | 2013-01-11 |
Genre | Technology & Engineering |
ISBN | 3642359329 |
Traditional Pattern Recognition (PR) and Computer Vision (CV) technologies have mainly focused on full automation, even though full automation often proves elusive or unnatural in many applications, where the technology is expected to assist rather than replace the human agents. However, not all the problems can be automatically solved being the human interaction the only way to tackle those applications. Recently, multimodal human interaction has become an important field of increasing interest in the research community. Advanced man-machine interfaces with high cognitive capabilities are a hot research topic that aims at solving challenging problems in image and video applications. Actually, the idea of computer interactive systems was already proposed on the early stages of computer science. Nowadays, the ubiquity of image sensors together with the ever-increasing computing performance has open new and challenging opportunities for research in multimodal human interaction. This book aims to show how existing PR and CV technologies can naturally evolve using this new paradigm. The chapters of this book show different successful case studies of multimodal interactive technologies for both image and video applications. They cover a wide spectrum of applications, ranging from interactive handwriting transcriptions to human-robot interactions in real environments.
Multimodal Processing and Interaction
Title | Multimodal Processing and Interaction PDF eBook |
Author | Petros Maragos |
Publisher | Springer Science & Business Media |
Pages | 380 |
Release | 2008-12-16 |
Genre | Computers |
ISBN | 0387763163 |
This volume presents high quality, state-of-the-art research ideas and results from theoretic, algorithmic and application viewpoints. It contains contributions by leading experts in the obsequious scientific and technological field of multimedia. The book specifically focuses on interaction with multimedia content with special emphasis on multimodal interfaces for accessing multimedia information. The book is designed for a professional audience composed of practitioners and researchers in industry. It is also suitable for advanced-level students in computer science.
Multimodal Signal Processing
Title | Multimodal Signal Processing PDF eBook |
Author | Jean-Philippe Thiran |
Publisher | Academic Press |
Pages | 343 |
Release | 2009-11-11 |
Genre | Computers |
ISBN | 0080888690 |
Multimodal signal processing is an important research and development field that processes signals and combines information from a variety of modalities – speech, vision, language, text – which significantly enhance the understanding, modelling, and performance of human-computer interaction devices or systems enhancing human-human communication. The overarching theme of this book is the application of signal processing and statistical machine learning techniques to problems arising in this multi-disciplinary field. It describes the capabilities and limitations of current technologies, and discusses the technical challenges that must be overcome to develop efficient and user-friendly multimodal interactive systems. With contributions from the leading experts in the field, the present book should serve as a reference in multimodal signal processing for signal processing researchers, graduate students, R&D engineers, and computer engineers who are interested in this emerging field. - Presents state-of-art methods for multimodal signal processing, analysis, and modeling - Contains numerous examples of systems with different modalities combined - Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.
Multimodal Scene Understanding
Title | Multimodal Scene Understanding PDF eBook |
Author | Michael Ying Yang |
Publisher | Academic Press |
Pages | 424 |
Release | 2019-07-16 |
Genre | Technology & Engineering |
ISBN | 0128173599 |
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning presents recent advances in multi-modal computing, with a focus on computer vision and photogrammetry. It provides the latest algorithms and applications that involve combining multiple sources of information and describes the role and approaches of multi-sensory data and multi-modal deep learning. The book is ideal for researchers from the fields of computer vision, remote sensing, robotics, and photogrammetry, thus helping foster interdisciplinary interaction and collaboration between these realms. Researchers collecting and analyzing multi-sensory data collections – for example, KITTI benchmark (stereo+laser) - from different platforms, such as autonomous vehicles, surveillance cameras, UAVs, planes and satellites will find this book to be very useful. - Contains state-of-the-art developments on multi-modal computing - Shines a focus on algorithms and applications - Presents novel deep learning topics on multi-sensor fusion and multi-modal deep learning
Machine Learning for Multimodal Interaction
Title | Machine Learning for Multimodal Interaction PDF eBook |
Author | Andrei Popescu-Belis |
Publisher | Springer |
Pages | 318 |
Release | 2008-02-22 |
Genre | Computers |
ISBN | 3540781552 |
This book constitutes the thoroughly refereed post-proceedings of the 4th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2007, held in Brno, Czech Republic, in June 2007. The 25 revised full papers presented together with 1 invited paper were carefully selected during two rounds of reviewing and revision from 60 workshop presentations. The papers are organized in topical sections on multimodal processing, HCI, user studies and applications, image and video processing, discourse and dialogue processing, speech and audio processing, as well as the PASCAL speech separation challenge.
Multimodal Human Computer Interaction and Pervasive Services
Title | Multimodal Human Computer Interaction and Pervasive Services PDF eBook |
Author | Grifoni, Patrizia |
Publisher | IGI Global |
Pages | 537 |
Release | 2009-05-31 |
Genre | Computers |
ISBN | 1605663875 |
"This book provides concepts, methodologies, and applications used to design and develop multimodal systems"--Provided by publisher.
Multimedia Image and Video Processing
Title | Multimedia Image and Video Processing PDF eBook |
Author | Ling Guan |
Publisher | CRC Press |
Pages | 1064 |
Release | 2017-12-19 |
Genre | Technology & Engineering |
ISBN | 1351833650 |
As multimedia applications have become part of contemporary daily life, numerous paradigm-shifting technologies in multimedia processing have emerged over the last decade. Substantially updated with 21 new chapters, Multimedia Image and Video Processing, Second Edition explores the most recent advances in multimedia research and applications. This edition presents a comprehensive treatment of multimedia information mining, security, systems, coding, search, hardware, and communications as well as multimodal information fusion and interaction. Clearly divided into seven parts, the book begins with a section on standards, fundamental methods, design issues, and typical architectures. It then focuses on the coding of video and multimedia content before covering multimedia search, retrieval, and management. After examining multimedia security, the book describes multimedia communications and networking and explains the architecture design and implementation for multimedia image and video processing. It concludes with a section on multimedia systems and applications. Written by some of the most prominent experts in the field, this updated edition provides readers with the latest research in multimedia processing and equips them with advanced techniques for the design of multimedia systems.