内容简介
本书是一本覆盖面广的专著,内容包含了有关集群计算的体系结构、网络、协议和I/O、进程调度、资源共享和负载平衡,以及目前典型的集群系统剖析。其中每章都是由该研究领域的名的专家撰写,因而具有高的学术价值和学术指导意义。 本书聚集了高性能集群计算领域中100多位的从业者所做出的贡献。实质上,对该领域中每一个与系统相关的关键问题本书都提供了的信息。在高性能并行计算领域中,无论您是一位开发者、研究者、管理员、教师、学生,还是一个管理者,本书都是一本难得的经典书籍。
目录
Ⅰ Requirements and General Issues 1 Cluster computing at a Glance 2 Cluster Setup and its Administration 3 Constructing Scalable Services 4 Dependable Clustered Computing 5 Deploying a High Throughput Computing Cluster 6 Performance Models and Simulation 7 metacomputing: Harnessing Informal Supercomputers 8 Specifying Resources and Services in metacomputing Systems Ⅱ Networking, Protocols, and I/O 9 High Speed Networks 10 Lightweight Messaging Systems 11 Active Messages 12 Xpress Transport Protocol 13 Congestion Management in ATM Clusters 14 Load Balancing Over Networks 15 Multiple Path Communication 16 Network RAM 17 Distributed Shared Memory 18 Parallel I/O for Clusters: Methodologies and Systems 19 Software RAID and Parallel Filesystems Ⅲ Process Scheduling, Load Sharing, and Balancing 20 Job and Resource Management Systems 21 Scheduling Parallel Jobs on Clusters 22 Load Sharing and Fault Tolerance Manager 23 Parallel Program Scheduling Techniques 24 Customized Dynamic Load Balancing 25 Mapping and Scheduling on Heterogeneous Systems Ⅳ Representative Cluster Systems 26 Beowulf 27 RWC PC Cluster Ⅱ and Score Cluster System Software 28 COMPaS: A Pentium Pro PC-based SMP Cluster 29 The NanOS Cluster Operating System 30 BSP-based Adaptive Parallel Processing 31 MARS: An Adaptive Parallel Programming Environment 32 The Gardens Approach to Adaptive Parallel Computing 33 The ParPar System: A Software MPP 34 Pitt Parallel Computer 35 The RS/6000 SP System: A Scalable Parallel Cluster 36 A Scalable and Highly Available Cluster Web Server Index