Whisperly User Guide

Introduction

Whisperly is a fully local, cross-platform desktop application that converts audio files to text using advanced AI technology. All audio processing is done locally on your computer, ensuring privacy and data security.

Privacy First: All processing is local, your audio never leaves your device
Multi-format Support: MP3, WAV, FLAC, M4A, AAC, OGG, WMA
High Quality: Powered by OpenAI Whisper model
Cross-platform: Windows, macOS, Linux
One-time Purchase: Lifetime license, no subscription
Offline Operation: No internet required after initial setup

Getting Started

Download Whisperly from the official website
Run the installer and follow the setup wizard
On first launch, the AI model (~1GB) will be downloaded
After download, you can use Whisperly fully offline

*Internet is only required for the initial model download.

Main Interface

Welcome Screen: Large icon, "Select Audio File" button, drag & drop support
Workspace: Left - file info/progress/export; Right - transcription results and editor
Status Indicators: Local processing, Ready, Processing, Loading, etc.

Audio Processing

Select audio file (multiple formats supported)
Automatic format detection and audio analysis
Preprocessing (convert to 16kHz mono)
AI transcription to generate text
Post-processing (punctuation, timestamps, formatting)

Supports auto language detection, quality/speed settings, CPU/GPU selection

Settings & Configuration

Model Selection: Tiny/Base/Small/Medium/Large (speed vs. accuracy)
Compute Device: Auto/CPU/NVIDIA GPU/Apple Metal
Output Format: TXT/SRT/JSON/CSV
Output Directory: Custom save path

Click "Save Settings" after changes. Some settings require restart.

Export & Copy

Export as TXT, SRT, JSON, CSV
One-click copy transcription to clipboard

After transcription, click "Export"
Select format and save location

Troubleshooting & Optimization

Model download failed: Check network and disk space
Slow processing: Close other apps, try smaller model or GPU acceleration
Unsupported audio: Convert to MP3/WAV and ensure file is not corrupted
Crash: Update version, check system requirements, try CPU-only mode

Optimization: Prefer GPU, close unnecessary apps, choose suitable model, ensure good cooling.

Keyboard Shortcuts

Ctrl+O: Open audio file
Ctrl+S: Save settings
Ctrl+Q: Quit application
F1: Open help
Ctrl+C: Copy transcription
Ctrl+E: Export transcription
Ctrl+,: Open settings
Enter: Save settings
Escape: Cancel/close panel

Advanced Features

Batch processing: Multiple files sequentially, batch export
Audio analysis: RMS, zero-crossing rate, spectral centroid, SNR
Multi-language: Auto/manual selection, 99+ languages, mixed content
Quality metrics: Confidence score for each segment
Custom dictionary: Add specialized terms for better accuracy

Support & Updates

Version and license info in settings
Auto update check when online, free lifetime upgrades
For help, visit the official website or contact support