@RobertJene

⌚ Timestamps
0:00 - introduction
0:14 - Suno Bark
1:22 - Valle-X
3:00 - StyleTTS2
4:07 - CoquiTTS - XTTS
5:40 - Tortoise TTS

@SawunirMasunta

AI-generated writing looks way more natural after running it through Walter Writes AI Humanizer

@vikramr60

Coqui TTS is not open source, means it can't be used for commercial purposes,only for research and educational

@dontrez8412

Pretty good results. I didn't think that first Bark voice bad, though. Thanks for the comparisons.

@chaks2432

I built a GUI for XTTS using flask and svelte and finally got rvc running yesterday. Got inspired by your audiobok_maker, but it was missing some features I figured could be pretty useful (Like allowing users to edit text inside the GUI, add/delete/reorder lines), I'm pretty happy with the result, even if the UI looks like crap and it's still a little buggy. I also got everything to run together, so I don't need the ai-voice-cloning webUI running for it to work

@spiritual_audiobooks

A Open Source  local, fast neural text to speech system that sounds great is Piper TTS.

@Raghav_Kapoor_

3:21 , how to use this voice ?

@Jonathan_Dawson

Hi can you please make a tutorial of the Audiobook Maker, or how to create such pipelines? In particular you mentioned something along the lines of "RVCS" which seemed to make a dramatic difference in the last voice that you demonstrated! How is it done?

@Edward_ZS

What option runs the fastest 

And do any of these work without a GPU

@nielsieboy19

From what I've seen StyleTTS does a much better job of cloning a voice, it's also an order of magnitude faster than Tortoise. Only thing holding it back are the absolutely mental VRAM requirements for training and multilingual models (which are being worked on by the community).

@Lenox-bp3lu

Which one of these did you use for the accent conversion at the start of this video? Please and thank you

@Morpheus.999

i can't find the  audio sample at 3:21 , how could download this?

@StudioPersimmon

Tortoise wins out, but I'm very interested in seeing more from Eleven's. From these examples, Coqui definitely seems to get the closest to your voice out the box, but the actual quality of the audio sounds very low. Is there a way to set the bitrate?

@bestcureremedies

Hi, could you make a video showing step by step how to install one of these open-source apps on a windows 11 computer and a demo how to use it once it is installed. Good work!

@ozerune

Could you do one of these for 2025? Curious to see how much the landscape has changed in the last year.

@RobertJene

6:15 - what do you mean, pipeline from Tortoise TTS to RVC? Like you train a model in Tortoise and then use it in RVC or something?

@ea02ca6f

Why not order the links in the description in the same order they are mentioned in the video with missing links added?

@poco7193

With tortoise TTS I have been issues with training it. I will upload my audio for training and go through the first two steps smoothly, when I actually try to run the training it freezes with some text then just never unfreezes no matter how long I wait. Also I was wanting to know what the 2nd software you were using in this video to make the tortoise tts sound smoother. I am trying to make a podcast for a school project and desperately need a smooth tts for some of my characters

@robertbutcher222

Sorry if this is a bad question for this sort of this video, but is there a way to use one of these in Linux Mint or Ubuntu to read selected text? I like to have selectable text read to me when I highlight text with the mouse cursor. There is a script I could make, if I find the instructions again, but the voice is very robotic. So, I was wondering if one of these could somehow be used, preferably offline.

@ferysery

hi . where can i get my hands on ur AUDIOBOOK MAKER windows desktop app?