Whisper is an open-source automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. It is designed to be robust to accents, background noise and technical language, and can transcribe and translate speech in multiple languages into English. It is a simple end-to-end approach, implemented as an encoder-decoder Transformer. It is also capable of performing language identification and phrase-level timestamps. It is designed to be easy to use and have high accuracy, allowing developers to add voice interfaces to more applications.
글로벌 순위
#26 4
국가 순위
82 2
범주 등급
4
액세스
1.5B
반등률
42.24%
액세스당 페이지 수
4.36
평균 진료 시간
00:04:22
액세스 35.39K 가격 모델 Free
액세스 887.72K 가격 모델 Free
액세스 23.94K 가격 모델 Freemium