WhisperCore

Active State

Result View

Transcription

Powered by Whisper AI models. Supports 99+ languages with high accuracy, running entirely on your GPU or CPU.

Speaker Diarization

Automatically identifies and labels different speakers in your audio. Know who said what, without manual tagging.

GPU Accelerated

CUDA-accelerated inference for NVIDIA GPUs. Transcribe hours of audio in minutes, not hours.

100% Private

Zero cloud processing. No telemetry. Your audio files are processed locally and never touch the internet.

Multiple Formats

Export transcripts to TXT, SRT, advanced ASS subtitles, and Word-level SRT. Compatible with MP3, WAV, FLAC, MP4, MKV, and more.

Open Models

Uses OpenAI's Whisper models (tiny to large). Choose quality vs speed based on your hardware.

1

Drop your file

Drag & drop or browse for any audio or video file. Supports MP3, WAV, M4A, FLAC, MP4, and more.

2

Process locally

WhisperCore transcribes on your GPU or CPU using Whisper AI models. Optional speaker diarization identifies who said what.

3

Get your transcript

Export to SRT, VTT, TXT, or JSON. Speaker labels are embedded automatically. Open the output folder and you're done.

Model	Size	VRAM	Best For
`tiny`	39 MB	~1 GB	Quick drafts, low-end hardware
`base`	74 MB	~1 GB	Fast transcription, decent quality
`small`	244 MB	~2 GB	Balanced speed & quality
`medium` â˜… Recommended	769 MB	~5 GB	Best balance for most users
`large-v3-turbo` NEW	800 MB	~6 GB	High accuracy, faster speed
`large-v3`	1.5 GB	~10 GB	Maximum accuracy, high-end GPU

Inference Speed (RTF)

Standard Whisper

4x Real-time

WhisperCore (GPU)

50x Real-time

Using tiny model on NVIDIA RTX 3060. Lower RTF is better.

Word Error Rate (WER)

~16% Long Interview

~0.16% Clear Monologue

Accuracy matches OpenAI's official implementation while running 10x faster.

Minimum

OSWindows 10 (64-bit)
CPUIntel i5 / AMD Ryzen 5
RAM8 GB
Storage2 GB free
GPUNot required (CPU mode)
Modeltiny / base

Recommended

Audio Input

MP3 WAV M4A FLAC AAC OGG MP4 MOV MKV

Export Output

TXT SRT ASS Word-SRT

Current Status

Pre-beta hardening in progress

Pre-beta

Early access by waitlist
Reliability hardening focus
No public pricing yet

Join waitlist

Individual Plans

Planned for beta release

Coming soon

Plan matrix in preparation
Commercial terms in preparation
Pricing not finalized

Get beta updates

Team & Enterprise

Contact us for pilot requirements

Coming soon

Deployment planning support
Offline-first deployment options
No public pricing yet

Contact Sales

Is WhisperCore really 100% offline?

Yes. All transcription and diarization runs entirely on your local hardware using AI models stored on your device. The only network request is a one-time license verification during activation. After that, no internet connection is needed.

Do I need an NVIDIA GPU?

No. WhisperCore works on CPU as well. However, an NVIDIA GPU with CUDA support significantly speeds up transcription â€” often 5-10x faster than CPU-only processing. AMD GPUs are not currently supported for acceleration.

Which Whisper models are supported?

All official OpenAI Whisper models: tiny, base, small, medium, large-v1, large-v2, and large-v3. Smaller models run faster, while larger models provide higher accuracy. We recommend medium for most users.

How does speaker diarization work?

WhisperCore uses a separate on-device model to detect and label different speakers in your audio. It automatically assigns labels like "Speaker 1", "Speaker 2", etc., which are embedded into your SRT/VTT output. The diarization model also runs locally â€” no cloud processing whatsoever.

What is the current release status?

WhisperCore is currently in pre-beta hardening. Access is limited and pricing is not yet published. Join the waitlist to receive beta availability updates.

Will my data be used for training?

Absolutely not. Your audio files and transcription output never leave your device. We don't have access to your data, and we never will. We use pre-trained open-source models â€” no user data is involved in any training process.

1

Join the pre-beta waitlist

Enter your email below to receive beta access updates and rollout notifications.

2

Download & install

Run the installer â€” WhisperCore will set up Python, CUDA, and all dependencies automatically.

3

Activate your license

If required for your build, complete one-time activation. Core transcription remains local.

4

Drop a file, get your transcript

Drag any audio or video file into WhisperCore. Your transcript is ready in minutes.

Pre-Beta Access

Get early access.

Sign up for pre-beta access updates. Pricing is coming soon.

Transcribe. Diarize.
Stay Private.

Designed for simplicity. Built for power.

Everything runs on your hardware.

Transcription

Speaker Diarization

GPU Accelerated

100% Private

Multiple Formats

Open Models

Three steps. Zero cloud.

Drop your file

Process locally

Get your transcript

Pick the right model for your hardware.

Powered by Faster-Whisper.

Inference Speed (RTF)

Word Error Rate (WER)

System Requirements

Minimum

Recommended

Works with your files.

Audio Input

Export Output

Pricing is coming soon.

Current Status

Individual Plans

Team & Enterprise

Frequently Asked Questions

Up and running in minutes.

Join the pre-beta waitlist

Download & install

Activate your license

Drop a file, get your transcript

Get early access.

Transcribe. Diarize. Stay Private.

Designed for simplicity. Built for power.

Everything runs on your hardware.

Transcription

Speaker Diarization

GPU Accelerated

100% Private

Multiple Formats

Open Models

Three steps. Zero cloud.

Drop your file

Process locally

Get your transcript

Pick the right model for your hardware.

Powered by Faster-Whisper.

Inference Speed (RTF)

Word Error Rate (WER)

System Requirements

Minimum

Recommended

Works with your files.

Audio Input

Export Output

Pricing is coming soon.

Current Status

Individual Plans

Team & Enterprise

Frequently Asked Questions

Up and running in minutes.

Join the pre-beta waitlist

Download & install

Activate your license

Drop a file, get your transcript

Get early access.

Transcribe. Diarize.
Stay Private.