Free Open-Source Artificial Intelligence@lemmy.worldEnglish · 1 year ago

Whisper Large-v3 Release

github.com

Whisper Large-v3 Release

github.com

Even_Adder@lemmy.dbzer0.com to

Free Open-Source Artificial Intelligence@lemmy.worldEnglish · 1 year ago

`large-v3` release · openai/whisper · Discussion #1762

github.com

We're pleased to announce the latest iteration of Whisper, called large-v3. Whisper-v3 has the same architecture as the previous large models except the following minor differences: The input uses ...

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

The large-v3 model shows improved performance over a wide variety of languages, and the plot below includes all languages where Whisper large-v3 performs lower than 60% error rate on Common Voice 15 and Fleurs, showing 10% to 20% reduction of errors compared to large-v2:

Chat

TheLordHumungus@lemmy.world
link
fedilink
English
arrow-up
1
arrow-down
7·
1 year ago
DEATH TO ABOMINABLE INTELLIGENCE!

Free Open-Source Artificial Intelligence@lemmy.world

fosai@lemmy.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !fosai@lemmy.world

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

5 users / day
31 users / week
33 users / month
251 users / 6 months
1 local subscriber
2.9K subscribers
198 Posts
365 Comments
Modlog