llama-cpp-agent is an open-source C++ framework for running AI agents entirely offline. It leverages the llama.cpp inference engine to provide fast, low-latency interactions and supports a modular plugin system, configurable memory, and task execution. Developers can integrate custom tools, switch between different local LLM models, and build privacy-focused conversational assistants without external dependencies.
Notte is a developer-centric Python framework designed for orchestrating AI agents powered by large language models. It provides built-in memory modules to store and retrieve conversational context, flexible tool integration for external APIs or custom functions, and a planning engine that sequences tasks. With Notte, you can rapidly prototype conversational assistants, data analysis bots, or automated workflows, while benefiting from open-source extensibility and cross-platform support.