Hongwen hongwen-sun

🎯

Focusing

254 followers · 163 following

Achievements

Block or Report

Block or report hongwen-sun

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

stable-audio-tools Public
Forked from Stability-AI/stable-audio-tools

Generative models for conditional audio generation

Python MIT License Updated Jul 25, 2024
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python MIT License Updated Jul 22, 2024
BigVGAN Public
Forked from NVIDIA/BigVGAN

Official implementation of BigVGAN in PyTorch

Python MIT License Updated Jul 10, 2024
CosyVoice Public
Forked from FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python Apache License 2.0 Updated Jul 10, 2024
ChatTTS Public
Forked from 2noise/ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Python Other Updated Jul 9, 2024
Matcha-TTS Public
Forked from shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook MIT License Updated Jul 8, 2024
LivePortrait Public
Forked from KwaiVGI/LivePortrait

Make one portrait alive!

Python MIT License Updated Jul 8, 2024
SenseVoice Public
Forked from FunAudioLLM/SenseVoice

Multilingual Voice Understanding Model

Python Other Updated Jul 8, 2024
stable-audio-metrics Public
Forked from Stability-AI/stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python MIT License Updated Jul 6, 2024
fish-speech Public
Forked from fishaudio/fish-speech

Brand new TTS solution

Python Other Updated Jul 3, 2024
soundstorm-pytorch Public
Forked from lucidrains/soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python MIT License Updated Jun 25, 2024
Make-An-Audio-3 Public
Forked from Text-to-Audio/Make-An-Audio-3

Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers

Python 1 Updated Jun 23, 2024
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Jun 18, 2024
Index-1.9B Public
Forked from bilibili/Index-1.9B

Python Apache License 2.0 Updated Jun 15, 2024
pyvideotrans Public
Forked from jianchang512/pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Python GNU General Public License v3.0 Updated Jun 11, 2024
ChatTTS-ui Public
Forked from jianchang512/ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python Other Updated Jun 10, 2024
k-diffusion Public
Forked from crowsonkb/k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Python MIT License Updated Jun 7, 2024
langchain Public
Forked from langchain-ai/langchain

⚡ Building applications with LLMs through composability ⚡

Python MIT License Updated Jun 6, 2024
GLM-4 Public
Forked from THUDM/GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python Apache License 2.0 Updated Jun 6, 2024
parler-tts Public
Forked from huggingface/parler-tts

Inference and training library for high-quality TTS models.

Python Apache License 2.0 Updated Jun 4, 2024
madmom Public
Forked from CPJKU/madmom

Python audio and music signal processing library

Python Other Updated Jun 4, 2024
harmonixset Public
Forked from urinieto/harmonixset

The Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music

Jupyter Notebook MIT License Updated Jun 4, 2024
CLAP Public
Forked from LAION-AI/CLAP

Contrastive Language-Audio Pretraining

Python Creative Commons Zero v1.0 Universal Updated Jun 3, 2024
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

Python GNU Affero General Public License v3.0 Updated May 29, 2024
Qwen-VL Public
Forked from QwenLM/Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python Other Updated May 28, 2024
opengpts Public
Forked from langchain-ai/opengpts

Rich Text Format MIT License Updated May 21, 2024
Bark-Voice-Cloning Public
Forked from KevinWang676/Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Jupyter Notebook MIT License Updated May 11, 2024
all-in-one Public
Forked from mir-aidj/all-in-one

All-In-One Music Structure Analyzer

Python MIT License Updated May 9, 2024
StableTTS Public
Forked from KdaiP/StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python MIT License Updated Apr 17, 2024
AudioLDM2 Public
Forked from haoheliu/AudioLDM2

Text-to-Audio/Music Generation

Python Other Updated Mar 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hongwen hongwen-sun

Achievements

Achievements

Block or report hongwen-sun

stable-audio-tools Public

e2-tts-pytorch Public

BigVGAN Public

CosyVoice Public

ChatTTS Public

Matcha-TTS Public

LivePortrait Public

SenseVoice Public

stable-audio-metrics Public

fish-speech Public

soundstorm-pytorch Public

Make-An-Audio-3 Public

trl Public

Index-1.9B Public

pyvideotrans Public

ChatTTS-ui Public

k-diffusion Public

langchain Public

GLM-4 Public

parler-tts Public

madmom Public

harmonixset Public

CLAP Public

stable-diffusion-webui Public

Qwen-VL Public

opengpts Public

Bark-Voice-Cloning Public

all-in-one Public

StableTTS Public

AudioLDM2 Public