Whisper.cpp Benchmark Report: Complete Performance Analysis on Legacy Hardware (Intel Core i5-460M) #3752
di-halt
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Whisper.cpp Benchmark Report: Complete Performance Analysis on Legacy Hardware (Intel Core i5-460M)
1. Introduction
This report provides a comprehensive and exhaustive benchmark of
whisper.cppperformance on a legacy Intel Core i5-460M (Arrandale) mobile processor from 2010. The test covers all major model sizes (tinytolarge-v3) and all relevant quantization types (q2_k,q3_k,q4_0,q4_1,q4_k,q5_0,q5_1,q5_k,q6_k,q8_0,f16). The goal is to empirically determine the optimal configuration for speed, accuracy, and disk footprint on old, resource-constrained hardware, and to identify which formats are beneficial and which are detrimental.2. Test Environment
-march=native -O3whisper.cpp(latest master as of testing)samples/jfk.wav(11.0 seconds, 16-bit PCM, 16 kHz Mono, clean English speech)3. Results: All Models & All Quantizations
The table below shows the total processing time, real-time factor (RTF), and subjective transcription quality. Lower
Total Timeand higherRTFare better. Models that failed or produced unacceptable output are explicitly marked."I, I, I, I..."GGML_ASSERT. Incompatible. DO NOT USE.large-v3-turbo-platinum-ml.4. Analysis: The Good, The Bad, and The Ugly Formats
Based on the comprehensive data, we can definitively categorize the quantization types:
The Good (Always Use These):
q4_0: UNDISPUTED CHAMPION. Fastest inference time on legacy hardware across all model sizes. Provides excellent quality identical to higher bitrates. The reduced memory bandwidth requirement is the key to its success on DDR3 systems. This is the only quantization you need.q8_0: Reliable and high quality, but strictly worse thanq4_0on this hardware. It is larger and slower with zero quality benefit for transcription. Use only ifq4_0is unavailable.q3_k(Situational): Viable for extreme disk space saving on thebasemodel. It offers acceptable quality but is significantly slower thanq4_0.The Bad & The Ugly (Avoid At All Costs):
q5_0,q5_1: CATASTROPHICALLY SLOW. On this CPU, 5-bit formats are 3x to 5.5x slower thanq4_0. The added computational overhead of unpacking completely destroys performance. NEVER USE THESE.q4_1: Consistently slower thanq4_0on all models with no quality gain. Obsolete.q4_k,q5_k,q6_k: These "K-quants" provide no performance benefit over standardq4_0on this legacy architecture. They are often slower and larger.q2_k: Extreme compression leads to model hallucination and gibberish onbaseandtiny. It is marginally usable onsmallandmediumbut remains very slow. Avoid.f16: Only useful as a source file for conversion toq4_0. Never use for inference.5. Final Recommendations for Legacy Hardware Users
Based on the data, the following models are the only files needed for a complete and optimized speech recognition toolkit on an i5-460M (or similar era) system:
tiny-mlq4_0base-mlq4_0large-v3-turboq4_0Beta Was this translation helpful? Give feedback.
All reactions