AI Training

Fundamentals of Accelerated Computing with CUDA Python Training

BONUS! LEXX Online Subscription Included: All LEXX Live students receive complimentary ninety (90) day access to the LEXX Online learning platform, which hosts hundreds of expert asynchronous training courses in Cybersecurity, IT, Soft Skills, and Management and more!

Course Overview

In this one-day, instructor-led CUDA course in Washington, DC Metro, Tysons Corner, VA, Columbia, MD or Live Online, students will learn the fundamental tools and techniques for running GPU-accelerated Python applications using CUDA® GPUs and the Numba compiler. Participants will work though dozens of hands-on coding exercises and, at the end of the training, implement a new workflow to accelerate a fully functional linear algebra program originally designed for CPUs, observing impressive performance gains. After taking this course, learners will be able to:

GPU-accelerate NumPy ufuncs with a few lines of code.
Configure code parallelization using the CUDA thread hierarchy.
Write custom CUDA device kernels for maximum performance and flexibility.
Use memory coalescing and on-device shared memory to increase CUDA kernel bandwidth.

Schedule

Currently, there are no public classes scheduled. Please contact a LEXX LIVETraining Consultant to discuss hosting a private class at 301-258-8200.

Program Level

Beginner

Prerequisites

All learners are expected to have:

Basic Python competency, including familiarity with variable types, loops, conditional statements, functions, and array manipulations
NumPy competency, including the use of ndarrays and ufuncs
No previous knowledge of CUDA programming is required

Course Outline

Module 1: Introduction

Module 2: Introduction to CUDA Python with Numba

Begin working with the Numba compiler and CUDA programming in Python.
Use Numba decorators to GPU-accelerate numerical Python functions.
Optimize host-to-device and device-to-host memory transfers.

• Module 3: Custom CUDA Kernels in Python with Numba

Learn CUDA’s parallel thread hierarchy and how to extend parallel program possibilities.
Launch massively parallel custom CUDA kernels on the GPU.
Utilize CUDA atomic operations to avoid race conditions during parallel execution.

• Module 4: Multidimensional Grids, and Shared Memory for CUDA Python with Numba

Learn multidimensional grid creation and how to work in parallel on 2D matrices.
Leverage on-device shared memory to promote memory coalescing while reshaping 2D matrices.

• Module 5: Final Review

+ View More

LEXX Live is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have final authority on the acceptance of individual courses for CPE credit. Complaints re-garding registered sponsors may be submitted to the National Registry of CPE Sponsors through its web site: www.nasbaregistry.org

Starting at $500

ATTENTION

For GSA pricing or Contractor quotes call 301-258-8200 – Option 4

Download/Print Brochure

Lock In Our Best Price Ever!

Complete the form below to get our best price ever for this course!

"*" indicates required fields

Price Match Guarantee

We’ll match any competitor’s price quote. Call 301-258-8200 Option 4.

Brand

Topic

CCA - Certified CMMC Assessor

CRISC (Certified in Risk and Information Systems Control)

CCP - Certified CMMC Professional

CGRC (Certified in Governance, Risk and Compliance)

Lean Six Sigma Black Belt Bootcamp

CCISO - Certified Chief Information Security Officer (CCISO)

CGRC (Certified in Governance, Risk and Compliance)

CISM (Certified Information Security Manager)

AI+ Security Level 3

AI+ Security Compliance

AI+ Architect

Microsoft 365 Excel Level 3

AI+ Ethical Hacker

MS-4012: Microsoft 365 Copilot Chat Interactive Experience for Executives

ITIL® 4 Specialist: Create, Deliver, Support

ITIL ® 4 Foundation

AZ-900T00: Microsoft Azure Fundamentals

CISA (Certified Information Systems Auditor)

CompTIA Network+

Certified Kubernetes Administrator (CKA)

AZ-104T00: Microsoft Azure Administrator

(AWS DevOps Certified Engineer- Professional DevOps Engineering on AWS)

Maryland

LEXX Live Certified Training Centers