Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content

Starmel/OpenSuperWhisper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

179 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenSuperWhisper

OpenSuperWhisper is a macOS application that provides real-time audio transcription using the Whisper model. It offers a seamless way to record and transcribe audio with customizable settings and keyboard shortcuts.

Features

  • 🎙️ Real-time audio recording and transcription
  • 🧠 Two transcription engines: Whisper and Parakeet — download models directly from the app
  • ⌨️ Global keyboard shortcuts — key combination or single modifier key (e.g. Left ⌘, Right ⌥, Fn)
  • ✊ Hold-to-record mode — hold the shortcut to record, release to stop
  • 📁 Drag & drop audio files for transcription with queue processing
  • 🎤 Microphone selection — switch between built-in, external, Bluetooth and iPhone (Apple Continuity) mics from the menu bar
  • 🌍 Support for multiple languages with auto-detection
  • 🇯🇵🇨🇳🇰🇷 Asian language autocorrect (autocorrect)

Installation

brew update # Optional
brew install opensuperwhisper

Or from GitHub releases page.

Requirements

  • macOS (Apple Silicon/ARM64)

Support

If you encounter any issues or have questions, please:

  1. Check the existing issues in the repository
  2. Create a new issue with detailed information about your problem
  3. Include system information and logs when reporting bugs

Building locally

To build locally, you'll need:

git clone git@github.com:Starmel/OpenSuperWhisper.git
cd OpenSuperWhisper
git submodule update --init --recursive
brew install cmake libomp rust ruby
gem install xcpretty
./run.sh build

In case of problems, consult .github/workflows/build.yml which is our CI workflow where the app gets built automatically on GitHub's CI.

Contributing

Contributions are welcome! Please feel free to submit pull requests or create issues for bugs and feature requests.

Contribution TODO list

  • Streaming transcription
  • Custom dictionary / keyword boosting (#19)
  • Intel macOS compatibility (#15)
  • Agent mode (#14)
  • Background app (#8)
  • Support long-press single key audio recording (#18)

License

OpenSuperWhisper is licensed under the MIT License. See the LICENSE file for details.

Whisper Models

You can download Whisper model files (.bin) from the Whisper.cpp Hugging Face repository. Place the downloaded .bin files in the app's models directory. On first launch, the app will attempt to copy a default model automatically, but you can add more models manually.

About

macOS dictation app

Topics

Resources

License

Stars

Watchers

Forks

Contributors

Languages