[go: nahoru, domu]

Skip to content
#

gtts-api

Here are 53 public repositories matching this topic...

The dataset is Fer 2013 and can be viewed in Kaggle. We give an image as an input and get an output of the emotion and corresponding songs via an audio. The given audio files are for the happy and angry emotions. These files can also be created for all the remaining five emotions as well, as is clear in the code.

  • Updated Sep 4, 2021
  • Jupyter Notebook

This project aims to assist visually impaired individuals by providing a solution to convert images into spoken language. Leveraging deep learning and natural language processing, the system processes images, generates descriptive captions, and converts these captions into audio output.

  • Updated Oct 16, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the gtts-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gtts-api topic, visit your repo's landing page and select "manage topics."

Learn more