Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU)
Title | Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) PDF eBook |
Author | Hyesoon Kim |
Publisher | Springer Nature |
Pages | 88 |
Release | 2022-05-31 |
Genre | Technology & Engineering |
ISBN | 3031017374 |
General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories (less than 1 megabyte vs. 1-10 megabytes). In this book, we provide a high-level overview of current GPGPU architectures and programming models. We review the principles that are used in previous shared memory parallel platforms, focusing on recent results in both the theory and practice of parallel algorithms, and suggest a connection to GPGPU platforms. We aim to provide hints to architects about understanding algorithm aspect to GPGPU. We also provide detailed performance analysis and guide optimizations from high-level algorithms to low-level instruction level optimizations. As a case study, we use n-body particle simulations known as the fast multipole method (FMM) as an example. We also briefly survey the state-of-the-art in GPU performance analysis tools and techniques. Table of Contents: GPU Design, Programming, and Trends / Performance Principles / From Principles to Practice: Analysis and Tuning / Using Detailed Performance Analysis to Guide Optimization
Handbook of Parallel Constraint Reasoning
Title | Handbook of Parallel Constraint Reasoning PDF eBook |
Author | Youssef Hamadi |
Publisher | Springer |
Pages | 687 |
Release | 2018-04-05 |
Genre | Computers |
ISBN | 3319635166 |
This is the first book presenting a broad overview of parallelism in constraint-based reasoning formalisms. In recent years, an increasing number of contributions have been made on scaling constraint reasoning thanks to parallel architectures. The goal in this book is to overview these achievements in a concise way, assuming the reader is familiar with the classical, sequential background. It presents work demonstrating the use of multiple resources from single machine multi-core and GPU-based computations to very large scale distributed execution platforms up to 80,000 processing units. The contributions in the book cover the most important and recent contributions in parallel propositional satisfiability (SAT), maximum satisfiability (MaxSAT), quantified Boolean formulas (QBF), satisfiability modulo theory (SMT), theorem proving (TP), answer set programming (ASP), mixed integer linear programming (MILP), constraint programming (CP), stochastic local search (SLS), optimal path finding with A*, model checking for linear-time temporal logic (MC/LTL), binary decision diagrams (BDD), and model-based diagnosis (MBD). The book is suitable for researchers, graduate students, advanced undergraduates, and practitioners who wish to learn about the state of the art in parallel constraint reasoning.
Concurrent Programming: Algorithms, Principles, and Foundations
Title | Concurrent Programming: Algorithms, Principles, and Foundations PDF eBook |
Author | Michel Raynal |
Publisher | Springer Science & Business Media |
Pages | 530 |
Release | 2012-12-30 |
Genre | Computers |
ISBN | 3642320279 |
This book is devoted to the most difficult part of concurrent programming, namely synchronization concepts, techniques and principles when the cooperating entities are asynchronous, communicate through a shared memory, and may experience failures. Synchronization is no longer a set of tricks but, due to research results in recent decades, it relies today on sane scientific foundations as explained in this book. In this book the author explains synchronization and the implementation of concurrent objects, presenting in a uniform and comprehensive way the major theoretical and practical results of the past 30 years. Among the key features of the book are a new look at lock-based synchronization (mutual exclusion, semaphores, monitors, path expressions); an introduction to the atomicity consistency criterion and its properties and a specific chapter on transactional memory; an introduction to mutex-freedom and associated progress conditions such as obstruction-freedom and wait-freedom; a presentation of Lamport's hierarchy of safe, regular and atomic registers and associated wait-free constructions; a description of numerous wait-free constructions of concurrent objects (queues, stacks, weak counters, snapshot objects, renaming objects, etc.); a presentation of the computability power of concurrent objects including the notions of universal construction, consensus number and the associated Herlihy's hierarchy; and a survey of failure detector-based constructions of consensus objects. The book is suitable for advanced undergraduate students and graduate students in computer science or computer engineering, graduate students in mathematics interested in the foundations of process synchronization, and practitioners and engineers who need to produce correct concurrent software. The reader should have a basic knowledge of algorithms and operating systems.
Advances in GPU Research and Practice
Title | Advances in GPU Research and Practice PDF eBook |
Author | Hamid Sarbazi-Azad |
Publisher | Morgan Kaufmann |
Pages | 776 |
Release | 2016-09-15 |
Genre | Computers |
ISBN | 0128037881 |
Advances in GPU Research and Practice focuses on research and practices in GPU based systems. The topics treated cover a range of issues, ranging from hardware and architectural issues, to high level issues, such as application systems, parallel programming, middleware, and power and energy issues. Divided into six parts, this edited volume provides the latest research on GPU computing. Part I: Architectural Solutions focuses on the architectural topics that improve on performance of GPUs, Part II: System Software discusses OS, compilers, libraries, programming environment, languages, and paradigms that are proposed and analyzed to help and support GPU programmers. Part III: Power and Reliability Issues covers different aspects of energy, power, and reliability concerns in GPUs. Part IV: Performance Analysis illustrates mathematical and analytical techniques to predict different performance metrics in GPUs. Part V: Algorithms presents how to design efficient algorithms and analyze their complexity for GPUs. Part VI: Applications and Related Topics provides use cases and examples of how GPUs are used across many sectors. - Discusses how to maximize power and obtain peak reliability when designing, building, and using GPUs - Covers system software (OS, compilers), programming environments, languages, and paradigms proposed to help and support GPU programmers - Explains how to use mathematical and analytical techniques to predict different performance metrics in GPUs - Illustrates the design of efficient GPU algorithms in areas such as bioinformatics, complex systems, social networks, and cryptography - Provides applications and use case scenarios in several different verticals, including medicine, social sciences, image processing, and telecommunications
Languages and Compilers for Parallel Computing
Title | Languages and Compilers for Parallel Computing PDF eBook |
Author | Călin Cașcaval |
Publisher | Springer |
Pages | 364 |
Release | 2014-09-30 |
Genre | Computers |
ISBN | 3319099671 |
This book constitutes the thoroughly refereed post-conference proceedings of the 26th International Workshop on Languages and Compilers for Parallel Computing, LCPC 2013, held in Tokyo, Japan, in September 2012. The 20 revised full papers and two keynote papers presented were carefully reviewed and selected from 44 submissions. The focus of the papers is on following topics: parallel programming models, compiler analysis techniques, parallel data structures and parallel execution models, to GPGPU and other heterogeneous execution models, code generation for power efficiency on mobile platforms, and debugging and fault tolerance for parallel systems.
Compiler Construction
Title | Compiler Construction PDF eBook |
Author | Björn Franke |
Publisher | Springer |
Pages | 258 |
Release | 2015-03-31 |
Genre | Computers |
ISBN | 3662466635 |
This book constitutes the proceedings of the 24th International Conference on Compiler Construction, CC 2015, held as part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2015, in London, UK, in April 2015. The 11 papers presented in this volume were carefully reviewed and selected from 34 submissions. They deal with compiler engineering and compiling techniques; compiler analysis and optimisation and formal techniques in compilers. The book also contains one invited talk in full-paper length.
Distributed Computing and Networking
Title | Distributed Computing and Networking PDF eBook |
Author | Marcos K. Aguilera |
Publisher | Springer Science & Business Media |
Pages | 434 |
Release | 2011-03-09 |
Genre | Computers |
ISBN | 364217678X |
This book constitutes the refereed proceedings of the 12th International Conference on Distributed Computing and Networking, ICDCN 2011, held in Bangalore, India, during January 2-5, 2011. The 31 revised full papers and 3 revised short papers presented together with 3 invited lectures were carefully reviewed and selected from 140 submissions. The papers address all current issues in the field of distributed computing and networking. Being a leading forum for researchers and practitioners to exchange ideas and share best practices, ICDCN also serves as a forum for PhD students to share their research ideas and get quality feedback from the well-renowned experts in the field.