A conversational AI that simulates speaking with the Mexican artist Frida Kahlo, using speech recognition and dynamic listening.
- Dynamic Listening: Detects when you start and stop speaking
- Natural Voice: Uses Frida's persona with OpenAI's high-quality text-to-speech
- Filler Statements: Plays short responses immediately while generating a full answer
- Interactive Conversations: Frida asks questions to keep the dialog engaging
- Python 3.6+
- OpenAI API key
- PyAudio
- Clone this repository
- Install dependencies:
pip install -r requirements.txt - Set your OpenAI API key:
export OPENAI_API_KEY="your_key_here"
Basic usage:
python whisper_test.py
--model "gpt-4": Use a different model for responses (default: gpt-3.5-turbo)--skip-welcome: Skip the welcome message--tts-voice "nova": Change the OpenAI TTS voice (default: shimmer)
shimmer(default): Female voicenova: Female voicealloy: Non-binary voiceecho: Male voicefable: Male voiceonyx: Male voice
Basic usage with default settings:
python whisper_test.py
Use GPT-4 for higher quality responses:
python whisper_test.py --model "gpt-4"
Try a different voice:
python whisper_test.py --tts-voice "nova"
- The program listens using your microphone
- When you speak, it dynamically detects your voice and records
- When you stop speaking, it automatically stops recording
- Your speech is transcribed using OpenAI Whisper
- A quick filler phrase plays immediately to keep the conversation flowing
- Frida responds using GPT (asking questions ~50% of the time)
- The response is spoken using OpenAI's text-to-speech