Transcription
Description
Evaluates transcription models on multi-lingual, multi-speaker audio with varying levels of background noise across multiple business domains such as software-development, finance, classifieds, food-delivery, and healthcare. The dataset consists of 150 unique audio samples, with each sample being augmented to generate a low and a high noise version.Provider
ProsusLanguage
English, Hindi, Portuguese, Polish, Afrikaans and DutchEvaluation
Accuracy is reported as 1 - WER (Word Error Rate).Data Statistics
Number of Samples450
Collection PeriodAugust 2024
SyntheticYes
Language
The language of the conversation.
Noise
The level of background noise in the audio sample.
Domain
The business domain of the conversation.
Results based on 0 entries.
Last updated: Invalid Date
# | Model | Provider | Size | Accuracy |
---|---|---|---|---|
No results. |
Rows per page
Page 1 of 0