python -m venv .venv
.venv/Scripts/activate
pip install -r requirements.txt
Can't pay for expensive servers so the AI is executed on Google Colab
Use "Echidna 13B (United)" (WARNING - slightly unfiltered...)
It will take around 10-25 minutes to load the model.
Paste the generated link into the ENDPOINT variable located in mic_vad_streaming.py
The link will look like https://crude-nevertheless-seconds-experts.trycloudflare.com/
Create an ElevenLabs Account and copy your API key into your .env file in the variable "ELEVEN_LABS_KEY"
python main.py
- Keep a transcript of all the things the AI said
- Have a filter mode. This is a a barrier from KoboldAI and "elevenlabs TTS"
- Make an interface (Vtube studio??)
- Implement a deafen feature so the AI doesnt pick up your reactions to its input or miscellaneous noises (audio compression)