Module 4 - Kernel Optimization
GPU kernel writing and optimization - occupancy, tiling, tensor cores, Flash Attention, kernel fusion, and Triton.
GPU kernel writing and optimization - occupancy, tiling, tensor cores, Flash Attention, kernel fusion, and Triton.