Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Anyone who has used elevenlabs for voice generation has found this to be the case. Voice to voice seems like magic.


Elevenlabs isn’t remotely close to how good this voice sounds. I’ve tried to use it extensively before and it just isn’t natural. This voice from openAI and even the one chatGPT has been using is natural.


When have you last used it. I used a few weeks ago to create a fake podcast as a side project recently and it sounded pretty good with their highest end model with cranked up tunings.


About 3 months ago for that exact use case.


My point isn’t necessarily elevenlabs being good or bad, it’s the difference between its text to voice and voice to voice generations. The latter is incredibly expressive and just shows how much is lacking in our ability to encode inflection in text.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: