Skip to content

Claytonn7/speecht5-finetune

Repository files navigation

SpeechT5-finetuning

This repository contains code and instructions for fine-tuning Microsoft SpeechT5 model for Hindi and English. The finetuned model is available here on Hugging Face: https://huggingface.co/clayton07/speecht5_finetuned_hindi_mono

Prerequisites

  • Python 3.8+
  • CUDA-compatible GPU (recommended)
  • Git
  • Jupyter Notebook
  • 50GB+ disk space for datasets and models

Installation

  1. Clone the repository:
git clone https://github.com/Claytonn7/speecht5-finetune.git
cd speecht5-finetune
  1. Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
  1. Install required packages:
pip install -r requirements.txt
  1. Start Jupyter Notebook:
jupyter notebook

Contributing

  1. Fork the repository
  2. Create your feature branch
  3. Commit your changes
  4. Push to the branch
  5. Create a new Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

For any questions or issues, please open an issue in the repository.

Releases

No releases published

Packages

No packages published