Ollama Bot is a Node.js-based AI agent designed to run on Discord servers, leveraging the Ollama CLI and local LLM models for generating conversational responses. It establishes a persistent chat context, allowing users to maintain topic continuity over multiple messages. Administrators can define custom prompts, set model parameters, and restrict commands to specific roles. The bot supports multiple LLM models, automatically manages message queues for high throughput, and logs interactions for audit purposes. Installation involves cloning the repository, installing dependencies via npm, and configuring environment variables such as the Discord bot token and Ollama settings. Once deployed, the bot listens for slash commands, forwards queries to the Ollama model, and posts generated replies directly in Discord channels.