Plugin that needs to be adjusted to specific voice

DSP, Plugin and Host development discussion.
Post Reply New Topic
RELATED
PRODUCTS

Post

Hi, I'm working on a plugin that can process human voice in various ways, it is based on some concepts of artificial inteligence - neural networks - and I already have a working proof of concept/prototype, no plugin, running from command line only, but it seems it is gonna work fine.

For this thread lets just assume it can really turn a terrible singing into stellar singing. Yet it needs to be adjusted to user's voice, which means they have to follow through some under five minutes singing exercise and then have this recording processed by my software - after this setup phase, they can run the plugin in real time from their DAW.

My question to you is would you consider using a technology with such an entry barrier? Or would it turn you off? Thank you for your opinions, Martin

Post

Hi Martin. Autotune, Revoice or Melodyne makes me sing good already without adjustment.

Is your adjustment a one time thing or do I have to make it everytime I record something new (because my room may have changed, or I have a bad day, or I get older and my voice deeper)?

Why 5 minutes? Is there room for improvement in time?

I'd say it's the result that makes this time worth or not.

Post

Thank you, all of these are good points I will try to take into account during the development.
Autotune and melodyne work rather with pitch while I focus more on timbre, there is a lot of people who don't sing off pitch but their voice is somewhat weak or awkward … including me :hihi:

Post

Hahaha okay. If it makes me sound like Beyonce I would consider longer one-time adjustment times not a deal breaker :lol:

Post

music technology is instrumentation, i'd expect most new plugin experiences involve some learning curve, which the user hopefully finds engaging and informative.

having recently experimented with cepstra, i remember the first example i saw by kurzweil, in the 90s. this used data for another throat to do teh beyonce thing. many cepstral applications involve fitting a phoneme to data.

i am surprised there aren't current plugins with "dataset" products existing to process weak performance into ideal already. i would think that, for many users, such a plug might require the user to tailor their performance some to get the right phoneme in some cases.

even if not for realtime, apps like melodyne could be replaced with an effect that turned some dribble into product, allowing automation of params, or simply using the source wav for phoneme keying.


my related note..

i baked the cepstral filter into impulses and trigger those at pitch rate for an oscillator.. for those who may have encountered julius smiths' "commuted synthesis example" which has been posted for years.. this produces good spectral contouring but limits resynthesis to having the entire series of harmonics.. the definate plus is that it is latency free and super efficient since we're jsut rendering tables.

..i have no intention of producing commercial vst again (happy with old school dev environ) but this is so simple to implement.. the breath component is simply written as a wavetable of filtered noise. users could build their own formant set and trade realistic sounding voices on a very light platform.
you come and go, you come and go. amitabha neither a follower nor a leader be tagore "where roads are made i lose my way" where there is certainty, consideration is absent.

Post Reply

Return to “DSP and Plugin Development”