Basic interactive LLM to voice pipeline project

Currently changing lots of things so README is probably outdated in some parts

Create an issue and i might update it & help

Demo (turn on the sound)

2024-03-07.03-34-04.mp4

Requirements

llama-cpp-python (+ compatible models)
TTS
PyAudio
OpenAi whisper
===For the bot part:
discord-ext-voice-recv
dev version of discord.py
virtual audio cable + voicemeeter
python_dotenv

Install

clone repo
create venv and pip install requirements.txt into it
- dev version of discord.py
find devices (TTS input, output) (Im currently using VoiceMeeter Input, VoiceMeeter Out)
python get_audio_devices.py
copy the names of the devices into appropriate place in main.py and bot.py
main.py gets the input, bot.py gets the output
create .env and set TOKEN="your discord bot token"
download llama.cpp compatible models into ./models
edit config.json

Usage

launch both main and bot
press enter in main to start
join vc and ping bot join
@yourbot join
it probably wont work because i forgot to mention something

Notes

Currently using/tested mistral dolphin 2.1 and nous hermes 2
For tts using ~~Xtts_v2~~ Vits (theres probably better options)
Often came across torch not detecting CUDA
To fix torch, install using this

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.gitignore		.gitignore
README.md		README.md
bot.py		bot.py
config.json		config.json
get_audio_devices.py		get_audio_devices.py
main.py		main.py
requirements-discordpy.txt		requirements-discordpy.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Basic interactive LLM to voice pipeline project

Currently changing lots of things so README is probably outdated in some parts

Create an issue and i might update it & help

Demo (turn on the sound)

Requirements

Install

Usage

Notes

About

Releases

Packages

Languages

zigzag1001/LLM-to-TTS

Folders and files

Latest commit

History

Repository files navigation

Basic interactive LLM to voice pipeline project

Currently changing lots of things so README is probably outdated in some parts

Create an issue and i might update it & help

Demo (turn on the sound)

Requirements

Install

Usage

Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages