Chatbot with OpenAI and Google Cloud Speech

This repository contains a chatbot built using the OpenAI API, with integrated audio processing and speech recognition via the Google Cloud Speech API. It allows users to capture audio, transcribe it into text, and then generate automatic responses.

Demo

Watch the demonstration of the Chatbot here:

Features:

Audio Capture:

Records audio from the microphone in real-time and saves it as a .wav file.

Speech Transcription:

Uses Google Cloud Speech API to transcribe the recorded audio into text.

ChatGPT Interaction:

Sends the transcription to OpenAI's GPT-3.5 model and returns a response based on the provided content.

GPT Response to Audio

Converts the GPT-generated responses into speech using Google Text-to-Speech (gTTS), allowing users to hear the chatbot’s reply.

* Flask Integration

Libraries:

openai
google-cloud-speech

Credential Requirements:

OpenAI API Key
Google Cloud Speech credentials (JSON file)

AJAX for Real-Time Chat

In this project, AJAX is used to create a seamless real-time chat experience. The chat is updated dynamically as messages are exchanged between the user and the bot, without requiring the page to reload.

TODO as improvements:

• DB Implementations

Consider adding a database to store user conversations for tracking history, user progress, and analytics. Storing interactions could enable personalized responses and suggestions based on previous chats.

• Improving Performance with Celery (Optional)

To boost performance, especially when handling multiple requests, you can use Celery for background jobs.

Handling API calls in the background to improve responsiveness
Enhancing scalability
Increasing response speed.
Combined with AJAX, it offers a smoother user experience by providing real-time updates.

Instalation

Clone this repository:

git clone https://github.com/narumikat/chatbot-openai.git

Install the dependencies:

pip install - r requirements.txt

Set up credentials:

Add your OpenAI API Key in the .env file:

OPENAI_API_KEY=your_openai_api_key

Download the Google Cloud service account JSON file for Speech-to-Text API access, and export its path as an environment variable

GOOGLE_CREDENTIALS_PATH=path/to/your-google-credentials.json

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.idea		.idea
.venv		.venv
__pycache__		__pycache__
static		static
templates		templates
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
app.py		app.py
input_audio.wav		input_audio.wav
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chatbot with OpenAI and Google Cloud Speech

Demo

Features:

Audio Capture:

Speech Transcription:

ChatGPT Interaction:

GPT Response to Audio

* Flask Integration

Libraries:

Credential Requirements:

AJAX for Real-Time Chat

TODO as improvements:

• DB Implementations

• Improving Performance with Celery (Optional)

Instalation

About

Releases

Packages

Languages

narumikat/chatbot-openai

Folders and files

Latest commit

History

Repository files navigation

Chatbot with OpenAI and Google Cloud Speech

Demo

Features:

Audio Capture:

Speech Transcription:

ChatGPT Interaction:

GPT Response to Audio

* Flask Integration

Libraries:

Credential Requirements:

AJAX for Real-Time Chat

TODO as improvements:

• DB Implementations

• Improving Performance with Celery (Optional)

Instalation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages