Chat Streaming (SSE)

The chat system uses Server-Sent Events (SSE) for real-time streaming of AI responses, creating the “typing” effect users expect.

Why SSE over WebSocket?

For AI chat, we only need server → client. SSE is simpler and works perfectly.

Event	When	Data
`message_start`	Stream begins	`{ conversation_id, message_id }`
`message_delta`	Each token chunk	`{ content: "token text" }`
`message_end`	Stream complete	`{ tokens_used, cost_usd, citations }`
`error`	On failure	`{ code, message }`

File	Purpose
`hooks/useChat.ts`	All chat state management
`hooks/useSSE.ts`	SSE streaming state
`api/chatClient.ts`	`streamMessage()` with SSE parsing
`components/chat/ChatInput.tsx`	Fixed input at viewport bottom

File	Purpose
`api/v1/chat.py`	SSE endpoint + conversation CRUD
`agents/nutrition_chat.py`	AI agent with streaming
`services/context_builder.py`	RAG context assembly