[go: nahoru, domu]

Skip to content
View ykk648's full-sized avatar
🙂
working
🙂
working
Block or Report

Block or report ykk648

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,582 1,019 Updated Jun 26, 2024

FAIR Sequence Modeling Toolkit 2

Python 634 65 Updated Jul 25, 2024

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Python 938 59 Updated Jan 27, 2024

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

Python 805 80 Updated Jul 24, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,474 368 Updated Jul 18, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 80,162 6,120 Updated Jul 25, 2024

A Python library for editing subtitle files

Python 290 38 Updated Jun 30, 2024

Faster Whisper transcription with CTranslate2

Python 10,469 881 Updated Jul 24, 2024

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1,014 54 Updated Apr 22, 2024

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)

Python 180 6 Updated Jul 2, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 1,176 89 Updated Jul 17, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 1,263 140 Updated Jul 24, 2024

Multilingual Voice Understanding Model

Python 1,580 146 Updated Jul 24, 2024

CosyVoice在Windows环境下使用的版本

Python 209 27 Updated Jul 21, 2024

Bark Voice Cloning and Voice Cloning for Chinese Speech

Jupyter Notebook 2,617 373 Updated Jul 8, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 2,651 246 Updated Jul 23, 2024

使用OpenCV部署yolov8检测人脸和关键点以及人脸质量评价,包含C++和Python两个版本的程序,只依赖opencv库就可以运行,彻底摆脱对任何深度学习框架的依赖。

Python 180 35 Updated Jan 8, 2024

yolov8 face detection with landmark

Python 460 62 Updated Apr 2, 2024

Easy-to-use Face Analysis Tool

Python 4 Updated Jul 7, 2024

Bring portraits to life!

Python 8,033 730 Updated Jul 24, 2024

Brand new TTS solution

Python 6,362 500 Updated Jul 23, 2024
Shell 7,068 969 Updated Jul 24, 2024

Populate library namespace without incurring immediate import costs

Python 118 19 Updated Jul 14, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 12,780 1,058 Updated Jul 25, 2024

Enjoy the magic of Diffusion models!

Python 5,947 532 Updated Jul 12, 2024

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 73 2 Updated Jul 24, 2024

Luma Web Examples, use lumalabs.ai captures directly in your three.js or other WebGL projects!

TypeScript 304 33 Updated Mar 6, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,974 979 Updated Jul 24, 2024

Access large archives as a filesystem efficiently, e.g., TAR, RAR, ZIP, GZ, BZ2, XZ, ZSTD archives

Python 675 35 Updated Jun 2, 2024

Windows File System Proxy - FUSE for Windows

C 6,810 492 Updated Jun 20, 2024
Next