Actually Useful AI

1989 readers

1 users here now

Welcome! 🤖

Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, "actually useful" for developers and enthusiasts alike.

Be an active member! 🔔

We highly value participation in our community. Whether it's asking questions, sharing insights, or sparking new discussions, your engagement helps us all grow.

What can I post? 📝

In general, anything related to AI is acceptable. However, we encourage you to strive for high-quality content.

What is not allowed? 🚫

🔊 Sensationalism: "How I made $1000 in 30 minutes using ChatGPT - the answer will surprise you!"
♻️ Recycled Content: "Ultimate ChatGPT Prompting Guide" that is the 10,000th variation on "As a (role), explain (thing) in (style)"
🚮 Blogspam: Anything the mods consider crypto/AI bro success porn sigma grindset blogspam

General Rules 📜

Members are expected to engage in on-topic discussions, and exhibit mature, respectful behavior. Those who fail to uphold these standards may find their posts or comments removed, with repeat offenders potentially facing a permanent ban.

While we appreciate focus, a little humor and off-topic banter, when tasteful and relevant, can also add flavor to our discussions.

Related Communities 🌐

General

Chat

!chatgpt@lemmy.world

Image

Open Source

!fosai@lemmy.world

Please message @sisyphean@programming.dev if you would like us to add a community to this list.

Icon base by Lord Berandas under CC BY 3.0 with modifications to add a gradient

founded 1 year ago

MODERATORS

sisyphean@programming.dev

Building a TTS model for the first time? Good guides? (poptalk.scrubbles.tech)

submitted 10 months ago by scrubbles@poptalk.scrubbles.tech to c/auai@programming.dev

0 comments fedilink hide all child comments

Hey all, brand new to this community, excited to be here!

I've stumbled my way through SD and I currently also have text-generation-webui up and running, and now SillyTavern. Having lots of fun with all of this stuff, learning how it works together, and how it all works!

I've made a few models elsewhere, but TTS models for some reason I'm having issues wrapping my head around. I have a voice I want to make a model for, and I have some videos currently. I'm very familiar with editing audio and video, but stripping out their voice second by second sounds exhausting tbh.

I was wondering if anyone had any good guides on their process of making a TTS model? Are there steps that can be automated while still producing decent results? How much time do I need of a person speaking? Should I run any specific tools to clean up audio? I'm completely new so any and all advice would be great.

I want to run it locally and "plug it in" to my cluster already, so also I'll need the model to work with a tool that will work with the above programs (and I'll take advice there too if you have it!)

Thanks!

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here