h a l f b a k e r yWe don't have enough art & classy shit around here.
add, search, annotate, link, view, overview, recent, by name, random
news, help, about, links, report a problem
browse anonymously,
or get an account
and write.
register,
|
|
|
Autotune allows people who can't sing to perform vocals. This idea goes a step further, allowing people who struggle to articulate a sentence to sound coherent.
It uses existing generative text software, combined with existing audio deepfake technology, so that, when you mumble something into
the microphone, well-formed words come out of the speakers, in your voice and in tune.
This would make possible a new game, where contestants would try to sing prompts of which the software could make no sense.
Please log in.
If you're not logged in,
you can see what this page
looks like, but you will
not be able to add anything.
Destination URL.
E.g., https://www.coffee.com/
Description (displayed with the short name and URL.)
|
|
How do you know this is not already in general use? If its good you could never tell. Watch the lips? |
|
|
MacOS has a function that allows you to record your voice and use it for entered text. After a bit of training, its uncanny. Combine with existing Chat AI and you are baked, but not at the speed that would allow public speaking. Thats a minor speed bump, though. Soon come. |
|
|
[a1] I found my recorded voice very accurate but totally enervated. I like it. I may record some responses to the disinterested Sri Lankan who picks up the Help Line phone. Im going to try it out on people who know me, and wait for the accusations of drug use. |
|
|
What would happen if you layered the Speak Text output with autotune? Do you get Milli Vanilli again? |
|
|
I think there should be a feature which either curtails a lisp or replaces the "s" sounding words, if that's not homophobic. |
|
|
//Do you get Milli Vanilli again?// If Mill Vanilli fall over in a forest, does someone else make a sound? |
|
|
[4and20] What would you replace the s words with? Who are we being kind to here? Ill help but I dont know who this helps. |
|
|
[hippo] Yes, I hear cheering and James Brown. |
|
|
//MacOS has a function that allows you to record your voice and use it for entered text.// |
|
|
Huh, very tempting to use that for remote powerpoint presentations. If I'm tempted, it's already happening. Forewarned is forearmed, I'll have to think of ways to check that who's presenting is actually presenting. |
|
|
Change your real speaking voice to match.. or just wait. The simulation will get better and all the time exposure to real speaking voices declines. Already, huge chunks of youtube are generated voices. TikTok takes real voices and chops them up to remove gaps and changes speed. Podcast apps do the same thing. What fraction of speech that young people hear is real vs manipulated? |
|
|
Once that MacOS ability is trained initially, it listens and refines as you use it. As I understand it the work is done in the cloud IRT, but the data resides on your device and is not dependent upon a cloud connection to operate once socialized, and theres no data retained in the cloud. Anyone who knows me could tell that it is a canned voice construct but strangers might worry for my health yet accept the voice as real. Pretty good as to timbre and pitch, gravelly-ness and noise, but bad at pacing and expression. |
|
| |