O'Reilly logo

Heterogeneous Computing with OpenCL by Dana Schaa, Perhaad Mistry, David R. Kaeli, Lee Howes, Benedict Gaster

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 6. Dissecting a CPU/GPU OpenCL Implementation
This chapter discusses a specific mapping of OpenCL to the combination of a Phenom II CPU and a Radeon 6970 GPU. The aim is to show how OpenCL's model maps in a specific case to give the reader some context about its execution on real hardware. We also discuss some optimizations necessary for efficient execution on such hardware.
Keywords APU, CPU, GPU, optimization, Phenom, Radeon

Introduction

In Chapter 3, we discussed trade-offs present in different architectures, many of which support the execution of OpenCL programs. The design of OpenCL is such that the model maps capably to a wide range of architectures, allowing for tuning and acceleration of kernel code. In this chapter, we discuss OpenCL's ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required