[go: nahoru, domu]

Skip to content

aibolem/Audio-to-Audio-translation

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech and Speech Translation App

This is a Flask web app that allows you to translate audio and video files to an audio output in any languag of your choice using OpenAI's whisper, gpt-3.5-turbo and Eleven labs. It uses the pytube, moviepy, and pydub libraries to download and process the videos, and nltk for tokenizing the text.

Watch a simple demo of the project:

6428e6f5e849b34fcd6d5b53_webm.1.mp4

Installation

Clone this repository and navigate to the project directory:

bash Copy code git clone https://github.com//.git cd Create a virtual environment and install the required packages:

''' Copy code python -m venv venv source venv/bin/activate # on Windows, use "venv\Scripts\activate" pip install -r requirements.txt '''

Usage

''' Run the app locally using Flask: arduino Copy code export FLASK_APP=app.py export FLASK_ENV=development flask run ''' Navigate to http://localhost:5000/ in your web browser to access the app.

Features

Upload audio or video files or links. Transcribe audio or video and generate a text output. Chuck words of over 3000 tokens. Use the OpenAI API to translate. Use Elevenlabs API to convert text to audio.

Dependencies

Flask Flask-SocketIO Flask-Bootstrap PyTube moviepy pydub nltk OpenAI API key Elevenlabs API key

Contributing

Contributions to this project are welcome. To contribute, please follow these steps: Fork this repository. Create a new branch: git checkout -b my-new-branch Make your changes and commit them: git commit -m "Add some feature" Push to the branch: git push origin my-new-branch Create a new pull request. Please include a clear description of your changes and their purpose.

Releases

No releases published

Packages

 
 
 

Languages

  • Python 65.8%
  • HTML 15.4%
  • JavaScript 10.0%
  • CSS 8.8%