AI Voice Agent is a simple yet powerful open-source project that transforms spoken input into natural language responses using state-of-the-art AI models. It captures user speech through a microphone, applies OpenAI Whisper to transcribe audio into text, sends the text to the ChatGPT API for intelligent dialogue generation, and then uses a text-to-speech engine such as Coqui TTS to convert the AI response back into spoken audio. This continuous loop delivers seamless, real-time voice interaction and can be adapted for virtual assistants, accessibility tools, or IoT device control.