Courses Blog Research Lab AI Letters The Lab Code Bank Interactive 3DKodr Earnest Jobs

Skip to main content

Module 3 - Model Serving

REST and gRPC APIs, batching, model servers (Triton, TorchServe), and low-latency inference.