[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ISSUE for v2]: dynamic volume not working properly, again #1296

Open
2 of 7 tasks
Kuuko-fokkusugaru opened this issue Aug 1, 2024 · 2 comments
Open
2 of 7 tasks
Labels
v.2 for version.2.x

Comments

@Kuuko-fokkusugaru
Copy link
Sponsor
Kuuko-fokkusugaru commented Aug 1, 2024

Voice Changer Version

vcclient_win_cuda_2.0.450-alpha.zip

Operational System

Windows 11 Pro

GPU

3080Ti

CUDA Version

12.5.78

Read carefully and check the options

  • If you use win_cuda_torch_cuda edition, setup cuda? see here
  • If you use win_cuda edition, setup cuda and cudnn? see here
  • If you use mac edition, client is not launched automatically. Use chrome to open application.?
  • I've tried to change the Chunk Size
  • I've tried to set the Index to zero
  • I've read the tutorial
  • I've tried to extract to another folder (or re-extract) the .zip file

Does pre-installed model work?

YES

Model Type

RVC PTH

Issue Description

2.0.50 dynamic volume doesn't work anymore. It went back to the default behavior.
Linear mode only reduces the whole average volume of the output.
Analyzing both outputs, sqrt and linear, and increasing the extremely low volume of linear to match sqrt, it ends with the same exact results as sqrt. There is a slightly change in volume, but is not like in 2.0.44. The differences in the volume of 2.0.44 are gone :(

The first track shows sqrt, and the second track shows linear after volume amplification.

image

Application Screenshot

No response

Logs on console

Not needed

@w-okada
Copy link
Owner
w-okada commented Aug 1, 2024

I intended to trace the inputted volume.
Is it not sufficient to use gain to address the overall low volume?

@w-okada w-okada added the v.2 for version.2.x label Aug 1, 2024
@Kuuko-fokkusugaru
Copy link
Sponsor Author

Here is a comparison.
The first track is the original audio.
Second track is v2.0.50 with linear volume type and output gain 6.
Third track is v2.0.44 with linear ratio volume type and output gain 1.
While I feel like there are some changes in volume, even at gain 6, the volume seemed much closer to the original in v2.0.44. The lower and higher volume parts had a very similar amplitude. The main issue of v2.0.44 was the glitch "pop" sound when silence happened but the way that it managed the volume was PERFECT. v2.0.50 feels very close to the original sqrt. I guess it's hard to make it work like in the v2.0.44 without the glitch sound but I did like that at 1 gain input and 1 gain output, the volume was 1:1 with the one being outputted from the original microphone. And surely I can use gain to correct the volume, but I still feel like it's quite off. I have marked some green zones where you can see that the volume change did a great job at keeping the same original levels and how the new version can't keep that.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
v.2 for version.2.x
Projects
None yet
Development

No branches or pull requests

2 participants