AI Voice Agent is a simple yet powerful open-source project that transforms spoken input into natural language responses using state-of-the-art AI models. It captures user speech through a microphone, applies OpenAI Whisper to transcribe audio into text, sends the text to the ChatGPT API for intelligent dialogue generation, and then uses a text-to-speech engine such as Coqui TTS to convert the AI response back into spoken audio. This continuous loop delivers seamless, real-time voice interaction and can be adapted for virtual assistants, accessibility tools, or IoT device control.
Two-Way Voice for Bard is a Chrome extension designed to enhance your experience with Google Bard. This innovative tool enables voice interaction, allowing you to ask questions and receive spoken responses. It's perfect for users who prefer a hands-free experience, making communication feel more like a conversation than a query. By eliminating the need for typing, it fosters a more engaging interaction with AI, leveraging advanced voice recognition technologies for seamless communication.