Speed up `length` of text strings by 01mf02 · Pull Request #417 · 01mf02/jaq

01mf02 · 2026-03-26T11:04:07Z

This is an improved version of #394 which does not use core::str::from_utf8, but rather implements the same technique as in BurntSushi/bstr#223. That should allow it to achieve much more stable results for strings that contain a few invalid UTF-8 sequences, but which are still mostly valid UTF-8.

Furthermore, this removes a few casts for length calculation, which previously could truncate the length of very large strings.

Results:

$ hyperfine -L v main,length-str2 "target/release/jaq-{v} -n '[\"\" | limit(100000; recurse(. + \"a\")) | length] | add'"
Benchmark 1: target/release/jaq-main -n '["" | limit(100000; recurse(. + "a")) | length] | add'
  Time (mean ± σ):      2.495 s ±  0.009 s    [User: 2.481 s, System: 0.003 s]
  Range (min … max):    2.487 s …  2.512 s    10 runs
 
Benchmark 2: target/release/jaq-length-str2 -n '["" | limit(100000; recurse(. + "a")) | length] | add'
  Time (mean ± σ):     156.4 ms ±   1.6 ms    [User: 152.4 ms, System: 3.1 ms]
  Range (min … max):   154.6 ms … 162.1 ms    19 runs
 
Summary
  target/release/jaq-length-str2 -n '["" | limit(100000; recurse(. + "a")) | length] | add' ran
   15.96 ± 0.17 times faster than target/release/jaq-main -n '["" | limit(100000; recurse(. + "a")) | length] | add'

In the pure ASCII case, calculating string length is now up to 16 times faster than in main.

Speed up length of valid UTF-8 text strings.

f1ed503

01mf02 mentioned this pull request Mar 26, 2026

Speed up length of valid UTF-8 text strings. #394

Closed

01mf02 changed the title ~~Speed up length of valid UTF-8 text strings, take 2.~~ Speed up length of text strings Mar 26, 2026

01mf02 merged commit c0da08b into main Mar 26, 2026
4 checks passed

01mf02 deleted the length-str2 branch March 26, 2026 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up `length` of text strings#417

Speed up `length` of text strings#417
01mf02 merged 1 commit into
mainfrom
length-str2

01mf02 commented Mar 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

01mf02 commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

01mf02 commented Mar 26, 2026 •

edited

Loading