Block or Report
Block or report hongwen-sun
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
stable-audio-tools Public
Forked from Stability-AI/stable-audio-toolsGenerative models for conditional audio generation
Python MIT License UpdatedJul 25, 2024 -
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorchImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Python MIT License UpdatedJul 22, 2024 -
BigVGAN Public
Forked from NVIDIA/BigVGANOfficial implementation of BigVGAN in PyTorch
Python MIT License UpdatedJul 10, 2024 -
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedJul 10, 2024 -
ChatTTS Public
Forked from 2noise/ChatTTSChatTTS is a generative speech model for daily dialogue.
Python Other UpdatedJul 9, 2024 -
Matcha-TTS Public
Forked from shivammehta25/Matcha-TTS[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Jupyter Notebook MIT License UpdatedJul 8, 2024 -
LivePortrait Public
Forked from KwaiVGI/LivePortraitMake one portrait alive!
Python MIT License UpdatedJul 8, 2024 -
SenseVoice Public
Forked from FunAudioLLM/SenseVoiceMultilingual Voice Understanding Model
Python Other UpdatedJul 8, 2024 -
stable-audio-metrics Public
Forked from Stability-AI/stable-audio-metricsMetrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
Python MIT License UpdatedJul 6, 2024 -
fish-speech Public
Forked from fishaudio/fish-speechBrand new TTS solution
Python Other UpdatedJul 3, 2024 -
soundstorm-pytorch Public
Forked from lucidrains/soundstorm-pytorchImplementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Python MIT License UpdatedJun 25, 2024 -
Make-An-Audio-3 Public
Forked from Text-to-Audio/Make-An-Audio-3Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJun 18, 2024 -
-
pyvideotrans Public
Forked from jianchang512/pyvideotransTranslate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Python GNU General Public License v3.0 UpdatedJun 11, 2024 -
ChatTTS-ui Public
Forked from jianchang512/ChatTTS-ui一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Python Other UpdatedJun 10, 2024 -
k-diffusion Public
Forked from crowsonkb/k-diffusionKarras et al. (2022) diffusion models for PyTorch
Python MIT License UpdatedJun 7, 2024 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedJun 6, 2024 -
GLM-4 Public
Forked from THUDM/GLM-4GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Python Apache License 2.0 UpdatedJun 6, 2024 -
parler-tts Public
Forked from huggingface/parler-ttsInference and training library for high-quality TTS models.
Python Apache License 2.0 UpdatedJun 4, 2024 -
madmom Public
Forked from CPJKU/madmomPython audio and music signal processing library
Python Other UpdatedJun 4, 2024 -
harmonixset Public
Forked from urinieto/harmonixsetThe Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music
Jupyter Notebook MIT License UpdatedJun 4, 2024 -
CLAP Public
Forked from LAION-AI/CLAPContrastive Language-Audio Pretraining
Python Creative Commons Zero v1.0 Universal UpdatedJun 3, 2024 -
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python GNU Affero General Public License v3.0 UpdatedMay 29, 2024 -
Qwen-VL Public
Forked from QwenLM/Qwen-VLThe official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Python Other UpdatedMay 28, 2024 -
-
Bark-Voice-Cloning Public
Forked from KevinWang676/Bark-Voice-CloningBark Voice Cloning and Voice Cloning for Chinese Speech
Jupyter Notebook MIT License UpdatedMay 11, 2024 -
all-in-one Public
Forked from mir-aidj/all-in-oneAll-In-One Music Structure Analyzer
Python MIT License UpdatedMay 9, 2024 -
StableTTS Public
Forked from KdaiP/StableTTSNext-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
Python MIT License UpdatedApr 17, 2024 -
AudioLDM2 Public
Forked from haoheliu/AudioLDM2Text-to-Audio/Music Generation
Python Other UpdatedMar 31, 2024