Whisper (OpenAI)

⭐ 102.2k MIT Python 20250601

OpenAI’s benchmark for speech recognition, with 99% accuracy and support for 100+ languages, enabling offline operation on local devices.

📋 Info

GitHub Stars⭐ 102.2k Stars
LicenseMIT
LanguagePython
Version20250601
Updated2026-03-15

📖 Overview

OpenAI Whisper is the industry standard for automatic speech recognition, boasting 102k Stars on GitHub. It achieves a subtitle generation accuracy of up to 99% for English and over 95% for Chinese. It supports automatic recognition and transcription in more than 100 languages. It can run offline locally, ensuring data privacy and security. Various model sizes are available, ranging from tiny to large, allowing users to choose based on their hardware. Faster-whisper (an accelerated version of CTranslate2) and whisper.cpp (the C++ version) further optimize inference speed. The subtitle features in almost all video tools are built upon Whisper, making it a fundamental tool for AI short-video workflows.

✨ Features

  • 99% recognition accuracy (the highest in the industry)
  • Speech recognition / ASR for over 100 languages
  • Offline operation on local devices — ensuring data security
  • Models available in various sizes, from tiny to large.
  • faster-whisper provides 4x acceleration for workflow inference.

Advertisement

🚀 Quick Start

brew install ffmpeg
pip install setuptools-rust

🔗 Related Tools