Delta-4: A Generic Architecture for Dependable Distributed Computing
Title | Delta-4: A Generic Architecture for Dependable Distributed Computing PDF eBook |
Author | David Powell |
Publisher | Springer Science & Business Media |
Pages | 477 |
Release | 2012-12-06 |
Genre | Computers |
ISBN | 3642846963 |
Delta-4 is a 5-nation, 13-partner project that has been investigating the achievement of dependability in open distributed systems, including real-time systems. This book describes the design and validation of the distributed fault-tolerant architecture developed within this project. The key features of the Delta-4 architecture are: (a) a distributed object-oriented application support environment; (b) built-in support for user-transparent fault tolerance; (c) use of multicast or group communication protocols; and (d) use of standard off the-shelf processors and standard local area network technology with minimum specialized hardware. The book is organized as follows: The first 3 chapters give an overview of the architecture's objectives and of the architecture itself, and compare the proposed solutions with other approaches. Chapters 4 to 12 give a more detailed insight into the Delta-4 architectural concepts. Chapters 4 and 5 are devoted to providing a firm set of general concepts and terminology regarding dependable and real-time computing. Chapter 6 is centred on fault-tolerance techniques based on distribution. The description of the architecture itself commences with a description of the Delta-4 application support environment (Deltase) in chapter 7. Two variants of the architecture - the Delta-4 Open System Architecture (OSA) and the Delta-4 Extra Performance Architecture (XPA) - are described respectively in chapters 8 and 9. Both variants of the architecture have a common underlying basis for dependable multicasting, i. e.
Hardware and Software Architectures for Fault Tolerance
Title | Hardware and Software Architectures for Fault Tolerance PDF eBook |
Author | Michel Banatre |
Publisher | Springer Science & Business Media |
Pages | 332 |
Release | 1994-02-28 |
Genre | Computers |
ISBN | 9783540577676 |
Fault tolerance has been an active research area for many years. This volume presents papers from a workshop held in 1993 where a small number of key researchers and practitioners in the area met to discuss the experiences of industrial practitioners, to provide a perspective on the state of the art of fault tolerance research, to determine whether the subject is becoming mature, and to learn from the experiences so far in order to identify what might be important research topics for the coming years. The workshop provided a more intimate environment for discussions and presentations than usual at conferences. The papers in the volume were presented at the workshop, then updated and revised to reflect what was learned at the workshop.
Stabilization, Safety, and Security of Distributed Systems
Title | Stabilization, Safety, and Security of Distributed Systems PDF eBook |
Author | Ajoy K. Datta |
Publisher | Springer Science & Business Media |
Pages | 605 |
Release | 2007-06-27 |
Genre | Computers |
ISBN | 3540498230 |
This book constitutes the refereed proceedings of the 8th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS 2006, held in Dallas, TX, USA in November 2006. The 36 revised full papers and 12 revised short papers presented together with the extended abstracts of 2 invited lectures address all aspects of self-stabilization, safety and security, recovery oriented systems and programming.
Distributed Systems for System Architects
Title | Distributed Systems for System Architects PDF eBook |
Author | Paulo Veríssimo |
Publisher | Springer Science & Business Media |
Pages | 636 |
Release | 2012-12-06 |
Genre | Computers |
ISBN | 1461516633 |
The primary audience for this book are advanced undergraduate students and graduate students. Computer architecture, as it happened in other fields such as electronics, evolved from the small to the large, that is, it left the realm of low-level hardware constructs, and gained new dimensions, as distributed systems became the keyword for system implementation. As such, the system architect, today, assembles pieces of hardware that are at least as large as a computer or a network router or a LAN hub, and assigns pieces of software that are self-contained, such as client or server programs, Java applets or pro tocol modules, to those hardware components. The freedom she/he now has, is tremendously challenging. The problems alas, have increased too. What was before mastered and tested carefully before a fully-fledged mainframe or a closely-coupled computer cluster came out on the market, is today left to the responsibility of computer engineers and scientists invested in the role of system architects, who fulfil this role on behalf of software vendors and in tegrators, add-value system developers, R&D institutes, and final users. As system complexity, size and diversity grow, so increases the probability of in consistency, unreliability, non responsiveness and insecurity, not to mention the management overhead. What System Architects Need to Know The insight such an architect must have includes but goes well beyond, the functional properties of distributed systems.
Guide to Reliable Distributed Systems
Title | Guide to Reliable Distributed Systems PDF eBook |
Author | Amy Elser |
Publisher | Springer Science & Business Media |
Pages | 733 |
Release | 2012-01-15 |
Genre | Computers |
ISBN | 1447124154 |
This book describes the key concepts, principles and implementation options for creating high-assurance cloud computing solutions. The guide starts with a broad technical overview and basic introduction to cloud computing, looking at the overall architecture of the cloud, client systems, the modern Internet and cloud computing data centers. It then delves into the core challenges of showing how reliability and fault-tolerance can be abstracted, how the resulting questions can be solved, and how the solutions can be leveraged to create a wide range of practical cloud applications. The author’s style is practical, and the guide should be readily understandable without any special background. Concrete examples are often drawn from real-world settings to illustrate key insights. Appendices show how the most important reliability models can be formalized, describe the API of the Isis2 platform, and offer more than 80 problems at varying levels of difficulty.
Reliable Distributed Systems
Title | Reliable Distributed Systems PDF eBook |
Author | Amy Elser |
Publisher | Springer Science & Business Media |
Pages | 685 |
Release | 2005-03-25 |
Genre | Computers |
ISBN | 0387215093 |
Explains fault tolerance in clear terms, with concrete examples drawn from real-world settings Highly practical focus aimed at building "mission-critical" networked applications that remain secure
Advances in Distributed Systems
Title | Advances in Distributed Systems PDF eBook |
Author | Sacha Krakowiak |
Publisher | Springer |
Pages | 517 |
Release | 2003-06-26 |
Genre | Computers |
ISBN | 3540464751 |
In 1992 we initiated a research project on large scale distributed computing systems (LSDCS). It was a collaborative project involving research institutes and universities in Bologna, Grenoble, Lausanne, Lisbon, Rennes, Rocquencourt, Newcastle, and Twente. The World Wide Web had recently been developed at CERN, but its use was not yet as common place as it is today and graphical browsers had yet to be developed. It was clear to us (and to just about everyone else) that LSDCS comprising several thousands to millions of individual computer systems (nodes) would be coming into existence as a consequence both of technological advances and the demands placed by applications. We were excited about the problems of building large distributed systems, and felt that serious rethinking of many of the existing computational paradigms, algorithms, and structuring principles for distributed computing was called for. In our research proposal, we summarized the problem domain as follows: “We expect LSDCS to exhibit great diversity of node and communications capability. Nodes will range from (mobile) laptop computers, workstations to supercomputers. Whereas mobile computers may well have unreliable, low bandwidth communications to the rest of the system, other parts of the system may well possess high bandwidth communications capability. To appreciate the problems posed by the sheer scale of a system comprising thousands of nodes, we observe that such systems will be rarely functioning in their entirety.