Transcription

Description

Evaluates transcription models on multi-lingual, multi-speaker audio with varying levels of background noise across multiple business domains such as software-development, finance, classifieds, food-delivery, and healthcare. The dataset consists of 150 unique audio samples, with each sample being augmented to generate a low and a high noise version.

Provider

Prosus

Language

English, Hindi, Portuguese, Polish, Afrikaans and Dutch

Evaluation

Accuracy is reported as 1 - WER (Word Error Rate).

Data Statistics

Number of Samples450

Collection PeriodAugust 2024

SyntheticYes

Language

The language of the conversation.

Noise

The level of background noise in the audio sample.

Domain

The business domain of the conversation.

Results based on 0 entries.

Last updated: Invalid Date

#	Model	Provider	Size	Accuracy
No results.

Rows per page

Page 1 of 0