Is it possible to use transformers.js to implement audio source separation tasks? #788

asasas234 · 2024-06-02T04:00:55Z

Question

Hello, I have a beginner's question.

I want to implement the task of removing the human voice from the audio in the video and retaining the background sound in the browser. The idea is to load the model for audio source separation related to transformers.js to achieve the separation of the background sound and human voice, and then only return the background sound.

But I couldn't find relevant examples in the documentation, so I was wondering if this can be implemented? If so, what are the learning or research paths?

Looking forward to your reply

xenova · 2024-06-03T11:56:27Z

Hi there 👋 This library serves as a JavaScript port of the Python transformers library, so if you know of a model where you can do this, we can certainly look into it! Is something like https://huggingface.co/speechbrain/sepformer-wham what you're looking for?

asasas234 · 2024-06-03T12:40:00Z

Yes, speechbrain looks good, but I think demucs would be the best. However, I'm a beginner in machine learning and Python, so I'm looking for the simplest solution that can achieve my goal

asasas234 added the question Further information is requested label Jun 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to use transformers.js to implement audio source separation tasks? #788

Is it possible to use transformers.js to implement audio source separation tasks? #788

Is it possible to use transformers.js to implement audio source separation tasks? #788

Is it possible to use transformers.js to implement audio source separation tasks? #788

Comments

Question