Introduction to GPU Kernels and Hardware

Richard Ansorge

doi:10.1017/9781108855273.002

1 - Introduction to GPU Kernels and Hardware

Published online by Cambridge University Press: 04 May 2022

Richard Ansorge

Show author details

Richard Ansorge: Affiliation:
University of Cambridge

Book contents

HTML view is not available for this content. However, as you have access to this content, a full PDF is available via the 'Save PDF' action button.

Summary

The key to parallel programming is sharing a task between many cooperating threads running in parallel. A chart is presented showing how since 2003 the Moore’s law growth in computing performance has depended on parallel computing. This chapter includes a simple introductory CUDA example which performs numerical integration using 1000 000 000 threads. Using CUDA gives a speed-up of about 1000 compared to a single CPU thread. Key CUDA concepts including thread blocks, thread grids and warps are introduced. The hardware differences between conventional CPU architectures and GPUs are then discussed. Optimisations in memory caching on GPUs are also explained as memory access time is often a key performance constraint for many programs. The use of OpenMP to share a single task across all cores of a multicore CPU is also discussed.

Keywords

GPU architecture memory cache threads instruction sets multi-processor core

Information

Type: Chapter
Information: Programming in Parallel with CUDA
A Practical Guide
, pp. 1 - 21

DOI: https://doi.org/10.1017/9781108855273.002 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022

Accessibility standard: Unknown

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.

Book contents

1 - Introduction to GPU Kernels and Hardware

Summary

Keywords

Information

Accessibility standard: Unknown

Save book to Kindle

Save book to Dropbox

Save book to Google Drive