Skip to main content

5 docs tagged with "streaming"

View all tags

gRPC with Python

Protocol Buffers, gRPC service definitions, streaming, interceptors, and when to use gRPC over REST.

Streaming LLM Responses

Streaming LLM output in Python - server-sent events, async generators, FastAPI streaming endpoints, and building real-time chat UIs.

Streaming Responses

Implementing and optimizing streaming for real-time LLM response delivery - SSE, chunking strategies, backpressure, tool use streaming, and production patterns for perceived performance.