GitHub - pluja/whishper: Transcribe any audio to text with an easy UI. Powered by OpenAI's Whisper, LibreTranslate, Sveltekit and Golang.
github.com
external-link
Transcribe any audio to text with an easy UI. Powered by OpenAI's Whisper, LibreTranslate, Sveltekit and Golang. - GitHub - pluja/whishper: Transcribe any audio to text with an easy UI. Powered...

Hi everyone!

A few days ago I released Whishper, a new version of a project I’ve been working for about a year now.

It’s a self-hosted audio transcription suite, you can transcribe audio to text, generate subtitles, translate subtitles and edit them all from one UI and 100% locally (it even works offline).

I hope you like it, check out the website for self-hosting instructions: https://whishper.net

@crazygoat@lemmy.world
link
fedilink
English
19M

Even this is an good sound to text converter and a good ai transcription service

ares35
link
fedilink
41Y

how does whisper do transcribing technical documents. like for lawyers, doctors, engineers and what not? or speakers with heavy accents?

@pluja@lemmy.world
creator
link
fedilink
English
41Y

Whisper models have a very good WER (word error ratio) for languages like Spanish, English, French… if you use the english-only models it also improves. Check out this page on the docs:

https://whishper.net/reference/models/#languages-and-accuracy

@UberMentch@lemmy.world
link
fedilink
English
3
edit-2
1Y

Would love to deploy this, but unfortunately I’m running server equipment that apparently doesn’t support MongoDB 5 (Error message MongoDB 5.0+ requires a CPU with AVX support, and your current system does not appear to have that!). Tried deploying with both 4.4.18 and 4.4.6 and can’t get it to work. If anybody has some recommendations, I’d appreciate hearing them!

Edit: Changed my proxmox environment processor to host, fixed my issue.

@pluja@lemmy.world
creator
link
fedilink
English
11Y

I’m glad you were able to solve the problem, I add the comment I made to another user with the same problem:

Didn’t know about this problem. I’ll try to add a MariaDB alternative database option soon.

Konraddo
link
fedilink
English
11Y

Just tried this out but couldn’t get it to work until downgrading mongo to 4.4.6 because my NAS doesn’t ha``ve AVX support. But then, mongo stays unhealthy. No idea why.

@pluja@lemmy.world
creator
link
fedilink
English
11Y

Didn’t know about this problem. I’ll try to add a MariaDB alternative database option soon to solve this.

@optissima@lemmy.world
link
fedilink
English
11Y

I am looking for open source live transcription software, does this offer that, or is it only file-based?

Obinice
link
fedilink
English
31Y

I’ve been looking for a tool to do this for YEARS, my god! Years!!! ❤️❤️

@Axiochus@lemmy.world
link
fedilink
English
31Y

Oh, awesome! Does it do speaker detection? That’s been one of my main gripes with Whisper.

@pluja@lemmy.world
creator
link
fedilink
English
4
edit-2
1Y

Unfortunately, not yet. Whisper per se is not able to do that. Currently, there are few viable solutions for integration, and I’m looking at this one, but all current solutions I know about need GPU for this.

jherazob
link
fedilink
21Y

VERY understandable, requiring a GPU would limit it’s application and spread, i hope a good GPU-less solution is found eventually

@tvcvt@lemmy.ml
link
fedilink
English
21Y

This is excellent timing for me. I was just taking a break from working on setting up whisper.cpp with a web front end to transcribe interviews. This is a much nicer package than I ever had a chance of pulling together. Nice work!

Create a post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

  • 1 user online
  • 31 users / day
  • 80 users / week
  • 216 users / month
  • 845 users / 6 months
  • 1 subscriber
  • 1.42K Posts
  • 8.13K Comments
  • Modlog