Skip to main content

Module 4 - Kernel Optimization

GPU kernel writing and optimization - occupancy, tiling, tensor cores, Flash Attention, kernel fusion, and Triton.