Skip to main content

Module 3 - Model Serving

REST and gRPC APIs, batching, model servers (Triton, TorchServe), and low-latency inference.