- Real-time streaming responses
- Multi-model support (OpenAI, Azure, local LLMs)
- Configurable prompts and endpoints
- Persistent conversation history
- Theme switching (light/dark)
- Markdown and code block rendering
- Plugin-friendly architecture
- Minimal Next.js & Tailwind stack