How to Install Buzz – Step-by-Step Guide (2025)

Buzz is a free, open-source audio transcription tool that brings OpenAI’s Whisper models to your Linux desktop. It runs completely offline, ensuring your files never leave your machine—crucial for privacy. Buzz processes everything locally using AI models trained on diverse languages and accents, supporting batch transcription, real-time recording, and multiple output formats.

Why Install Buzz on Linux?

Linux users value control, privacy, and efficiency—Buzz delivers all three. Key benefits:

  • GPU acceleration for speed or CPU-only processing for compatibility
  • Seamless integration with PulseAudio and PipeWire
  • Live transcription of system audio or microphone input
  • No cloud dependencies or monthly fees
  • Offline processing for complete privacy

System Requirements

Before installation, ensure your system meets these requirements:

  • Python: 3.8 or higher
  • FFmpeg: For audio processing
  • RAM: 1GB (tiny model) to 10GB+ (large model)
  • GPU: CUDA (NVIDIA) or ROCm (AMD) for acceleration (optional)
  • Storage: ~500MB for application and models
  • Recommended: 8GB RAM, dual-core processor for medium models

Install Dependencies on Ubuntu

Ubuntu and Debian-based distributions use APT for dependency installation:

sudo apt update
sudo apt install python3 python3-pip ffmpeg portaudio19-dev

Verification: Run python3 --version and ffmpeg -version to confirm installation.

Install Dependencies on Fedora

Fedora uses DNF with slightly different package names:

sudo dnf install python3 python3-pip ffmpeg portaudio-devel

Verification: Check versions with python3 --version and ffmpeg -version.

Install Dependencies on Arch Linux

Arch Linux users benefit from current packages through pacman:

sudo pacman -S python python-pip ffmpeg portaudio

Note: Arch uses python instead of python3 as the default command.

Install Whisper and Buzz via Pip

The universal installation method across all distributions uses pip:

pip3 install openai-whisper
pip3 install buzz-captions

Configure PATH

If pip shows a PATH warning, add ~/.local/bin to your PATH:

export PATH="$HOME/.local/bin:$PATH"
source ~/.bashrc

Launch Buzz: Run buzz in your terminal.

Install Buzz Using Flatpak

Flatpak provides distribution-independent installation with automatic dependency management:

flatpak remote-add --if-not-exists flathub https://flathub.org/repo/flathub.flatpakrepo
flatpak install flathub com.github.chidiwilliams.buzz

Launch: Run flatpak run com.github.chidiwilliams.buzz or use your application menu.

Configure Buzz for Live Transcription

After launching Buzz, configure these options:

  • Select your microphone from the input device dropdown
  • Choose a Whisper model based on hardware:
    • Tiny/Base: Older machines
    • Small/Medium: Balance accuracy with performance
    • Large: Best accuracy, requires substantial RAM

Command-Line Usage

buzz --model medium --device cuda

Options:

  • --device cuda: GPU acceleration (NVIDIA)
  • --device cpu: CPU-only processing
  • --language en: Set language explicitly

Output formats: TXT, SRT, VTT for subtitling.

Troubleshooting Common Errors

Dependency Conflicts

If pip reports version incompatibilities, create a virtual environment:

python3 -m venv buzz-env
source buzz-env/bin/activate
pip install openai-whisper buzz-captions

Microphone Access Errors

Check audio group membership:

groups | grep audio
sudo usermod -aG audio $USER

For PipeWire systems:

systemctl --user status pipewire-pulse

GPU Acceleration Issues

NVIDIA GPU users:

  • Install CUDA toolkit and cuDNN libraries
  • Verify with nvidia-smi
  • Install PyTorch with CUDA support:
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

AMD GPU users: Install rocm-hip-sdk, then reinstall PyTorch with ROCm support.

Verify GPU recognition:

python3 -c "import torch; print(torch.cuda.is_available())"

Should return True. If issues persist, fall back to CPU processing.

Conclusion

You now have a powerful offline transcription system on your Linux machine. Buzz converts audio into accurate text without compromising privacy or requiring subscription fees.