Whisper-large-v3

NeuAudio Analysis

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper "Robust Speech Recognition via Large-Scale Weak Supervision" by Alec Radford et al. from OpenAI. Trained on >5M hours of labeled data, Whisper demonstrates a strong ability to generalise to many datasets and domains in a zero-shot setting.

Über das Whisper-large-v3 Modell

Veröffentlicht am huggingface

01/11/2023


Audio-Preis

0.00004083 /Sekunde


Ausgabeformate
jsonverbose_jsontext
Kontextgrößen
Unbekannt
Parameter
1.54B

Testen Sie das Modell, indem Sie damit spielen.