tapWhisper — NVIDIA Parakeet ONNX

Specifications

Size ~120 MB - 400 MB

Architecture RNN-T / CTC

Latency Low

Language English

Developer / Creator

NVIDIA (NeMo team), Sherpa ONNX community

License

CC BY 4.0 model; Apache-2.0 Sherpa ONNX runtime

Download Source

Verified Repository Source

Hugging Face Hub / Sherpa ONNX model catalog

NVIDIA Parakeet TDT 0.6B v3

Exact runtime artifacts

sherpa-onnx-nemo-parakeet-tdt-0.6b-v3-int8.tar.bz2

Model Overview

NVIDIA's Parakeet is a state-of-the-art speech-to-text model optimized for English. It is quantized to INT8 ONNX format to run in-process through the Sherpa ONNX engine. It provides extremely high accuracy and lightning-fast speed for coding, business, and general English dictation.

Available Model Variants

Model Name	File Size	RAM Usage	Format/Quant	Languages	Description
NVIDIA Parakeet	465 MB	1.4 GB	INT8 (ONNX)	English	NVIDIA Parakeet TDT 0.6B v3 model. Superior English coding and business accuracy.

Back to tapWhisper