Actually Useful AI

1989 readers

1 users here now

Welcome! 🤖

Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, "actually useful" for developers and enthusiasts alike.

Be an active member! 🔔

We highly value participation in our community. Whether it's asking questions, sharing insights, or sparking new discussions, your engagement helps us all grow.

What can I post? 📝

In general, anything related to AI is acceptable. However, we encourage you to strive for high-quality content.

What is not allowed? 🚫

🔊 Sensationalism: "How I made $1000 in 30 minutes using ChatGPT - the answer will surprise you!"
♻️ Recycled Content: "Ultimate ChatGPT Prompting Guide" that is the 10,000th variation on "As a (role), explain (thing) in (style)"
🚮 Blogspam: Anything the mods consider crypto/AI bro success porn sigma grindset blogspam

General Rules 📜

Members are expected to engage in on-topic discussions, and exhibit mature, respectful behavior. Those who fail to uphold these standards may find their posts or comments removed, with repeat offenders potentially facing a permanent ban.

While we appreciate focus, a little humor and off-topic banter, when tasteful and relevant, can also add flavor to our discussions.

Related Communities 🌐

General

Chat

!chatgpt@lemmy.world

Image

Open Source

!fosai@lemmy.world

Please message @sisyphean@programming.dev if you would like us to add a community to this list.

Icon base by Lord Berandas under CC BY 3.0 with modifications to add a gradient

founded 1 year ago

MODERATORS

sisyphean@programming.dev

OpenAI's Whisper has been out for a couple months, anyone knows how can I use it without busting out the Command Line? (lemmy.dbzer0.com)

submitted 1 year ago* (last edited 1 year ago) by ArkyonVeil@lemmy.dbzer0.com to c/auai@programming.dev

2 comments fedilink hide all child comments

Greetings Citizens of Hopefully Useful AI.

It has come to my attention that there are plenty of videos, as well as workflows that would get so much better if there was the possibility of textifying their audio content.

That being said, I hear Whisper, at least in the past 9 months or so was the cream of the crop when it came to audio recognition. And was also open source to boot (shocker).

Therefore, I'd be quite pleased to know if anyone created a method to more easily make use of the model. Because dedicating mental space to remembering specific adhoc commands does not make for a good long term tool.

For reference, I can throw a 24GB of VRAM at the problem if need be, and am running a Windows machine. Anything like Oobabooga or A1111? (Or a standard program would work just as nicely.) That would be very much appreciated.

Type in your answer, and ENRICH the future of Lemmy with your knowledge. (As well as answer one's question, pretty please.)

Thank you very much for reading and have a most fine of days!

you are viewing a single comment's thread
view the rest of the comments

[–] boothin@kbin.social 6 points 1 year ago (1 children)

https://github.com/ahmetoner/whisper-asr-webservice
this project might be of interest for you, it is a web service/api for transcribing with whisper ai. You can either use the web site or make programmatic calls to the API using it

[–] ArkyonVeil@lemmy.dbzer0.com 1 points 1 year ago

Oh this is is quite interesting. Quite interesting indeed! I approve of this. Seems like it may be exactly what I was looking for.

Much appreciated!