Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Previous Chapter

Table of Contents

List of Contributors

Acknowledgments

Part 1: Programming and Tools

Chapter 1: Formal analysis techniques for reliable GPU programming: current solutions and call to action

Acknowledgments

1 GPUs in Support of Parallel Computing

2 A quick introduction to GPUs

3 Correctness issues in GPU programming

4 The need for effective tools

5 Call to Action

Chapter 2: SnuCL: A unified OpenCL framework for heterogeneous clusters

Acknowledgments

3 Overview of SnuCL framework

4 Memory management in SnuCL Cluster

5 SnuCL extensions to OpenCL

6 Performance evaluation

Chapter 3: Thread communication and synchronization on massively parallel GPUs

2 Coarse-Grained Communication and Synchronization

3 Built-In Atomic Functions on Regular Variables

4 Fine-Grained Communication and Synchronization

5 Conclusion and Future Research Direction

Chapter 4: Software-level task scheduling on GPUs

Acknowledgments

1 Introduction, Problem Statement, and Context

2 Nondeterministic behaviors caused by the hardware

3 SM-centric transformation

4 Scheduling-enabled optimizations

5 Other scheduling work on GPUs

6 Conclusion and future work

Chapter 5: Data placement on GPUs

3 Memory specification through MSL

4 Compiler support

5 Runtime support

Part 2: Algorithms and Applications

Chapter 6: Biological sequence analysis on GPU

2 Pairwise Sequence Comparison and Sequence-Profile Comparison

3 Design aspects of GPU solutions for biological sequence analysis

4 GPU Solutions for Pairwise Sequence Comparison

5 GPU Solutions for Sequence-Profile Comparison

6 Conclusion and perspectives

Chapter 7: Graph algorithms on GPUs

1 Graph representation for GPUs

2 Graph traversal algorithms: the breadth first search (BFS)

3 The single-source shortest path (SSSP) problem

4 The APSP problem

5 Load Balancing and Memory Accesses: Issues and Management Techniques

Chapter 8: GPU alignment of two and three sequences

2 GPU architecture

3 Pairwise alignment

4 Alignment of three sequences

Chapter 9: Augmented Block Cimmino Distributed Algorithm for solving tridiagonal systems on GPU

2 ABCD Solver for tridiagonal systems

3 GPU implementation and optimization

4 Performance evaluation

5 Conclusion and future work

Chapter 10: GPU computing applied to linear and mixed-integer programming

Acknowledgments

2 Operations Research in Practice

3 Exact Optimization Algorithms

4 Metaheuristics

Conflicts of Interest

Chapter 11: GPU-accelerated shortest paths computations for planar graphs

3 Partitioned Approaches

4 Computational Complexity Analysis

5 Experiments and results

About the Authors

Chapter 12: GPU sorting algorithms

2 Generic Programming Strategies for GPU

3 Sorting algorithms

Chapter 13: MPC: An effective floating-point compression algorithm for GPUs

Acknowledgments

3 Experimental results

4 Summary and Conclusions

Chapter 14: Adaptive sparse matrix representation for efficient matrix-vector multiplication

2 Sparse matrix-vector multiplication

3 GPU architecture and programming model

4 Optimization principles for SpMV

5 Platform (Adaptive Runtime System)

6 Results and analysis

Part 3: Architecture and Performance

Chapter 15: A framework for accelerating bottlenecks in GPU execution with assist warps

Acknowledgments

4 The CABA Framework

5 A Case for CABA: Data Compression

8 Other Uses of the CABA Framework

Chapter 16: Accelerating GPU accelerators through neural algorithmic transformation

2 Neural transformation for GPUs

3 Instruction-set-architecture design

4 Neural accelerator: design and integration

5 Controlling quality trade-offs

Chapter 17: The need for heterogeneous network-on-chip architectures with GPGPUs: A case study with photonic interconnects

3 The Need for Heterogeneous Interconnections

4 Characterization of GPGPU Performance

Chapter 18: Accurately modeling GPGPU frequency scaling with the CRISP performance model

Acknowledgments

2 Motivation and related work

3 GPGPU DVFS performance model

Part 4: Power and Reliability

Chapter 19: Energy and power considerations of GPUs

2 Evaluation methodology

3 Power profiling of regular and irregular programs

4 Affecting power and energy on GPUs

About the authors

Chapter 20: Architecting the last-level cache for GPUs using STT-MRAM nonvolatile memory

4 Two-Part L2 Cache Architecture

5 Dynamic Write Threshold Detection Mechanism

6 Implementation

7 Evaluation Result

Chapter 21: Power management of mobile GPUs

Acknowledgments

2 GPU Power Management for Mobile Games

3 GPU Power Management for GPGPU Applications

4 Future Outlook

Chapter 22: Advances in GPU reliability research

2 Evaluating GPU Reliability

3 Hardware Reliability Enhancements

4 Software Reliability Enhancements

Chapter 23: Addressing hardware reliability challenges in general-purpose GPUs

2 GPGPUs Architecture

3 Modeling and Characterizing GPGPUs Reliability in the Presence of Soft Errors [25]

4 RISE: Improving the Streaming Processors’ Reliability Against Soft Errors in GPGPUs [36]

5 Mitigating the Susceptibility of GPGPUs to PVs [43]

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

3.147.205.154