perf/blob io by 2e0byo · Pull Request #240 · EbbLabs/mopidy-tidal

2e0byo · 2026-04-30T13:22:45Z

This PR is on top of #239, though if that's rejected it will rebase on top of #238.

It replaces multiple chunk-sized records with a single record for the body, allocated up front with the correct size, and then buffered with sqlite's blobopen function (which returns a file pointer to a blob record). I've kept it compatible with the old format and it continues to play cached data in that format.

This should lead to improved DB performance, as previously the DB fragmented very quickly, and hopefully will lead to reduced RAM usage.

I've also taken the opportunity to clean up dead / redundant code from the development process of the proxy.

From using the cache for several months I've noticed:

RAM usage sometimes kills mopidy on my raspberry pi (the ram headroom is marginal anyway and I'm running a pretty hevayweight NixOS I should tune, but it's definitely cache related)
if gstreamer crashes, pykka can re-spawn a thread with a now stale DB connection

Gstreamer crashes (based on seeking around aggressively) seem to go away when I used the default HTTP sink and not curl.

Seeking is kept in the calling interface to avoid needing to provide an abstraction over multiple file pointer. Note that the type is wrong, and Blob doesn't support .readinto, annoyingly.

Sqlite already has a concept of treating a blob field like a file and buffering writes to it, so let's use that rather than thousands of tiny rows. That should keep the db much more performant. I've kept the chunking for now so it will work with the older approach, or if we decide to split into smaller chunk sizes like 128 MiB. But much of that code will now be a no-op.

When I first wrote this I split the logic from the impl and developed against a dict-based cache for simplicity. When that cache worked in the proxy I then parametrised tests over it and the sqlite impl until the latter reached parity. That development cache is pointless (in fact python apparently first shipped with sqlite because Guido didn't want to write a hashmap: it's almost performant enough for the job), and has long been deleted. This commit gets rid of the ABC, which no longer offers any useful separation. At the same time we can commit to SQLite directly in the API, which simplifies things somewhat.

2e0byo added 20 commits April 28, 2026 15:16

feat: Lazy[T] type.

44b4597

fix: allow storing None in Lazy.

ac6cd5e

refactor: explicit lazy playback cache access for a uri.

6557b7e

fix: use track's remote url when starting proxy.

c72f44b

refactor: make .active_session type safe.

5d4b769

fix: use sorted offsets everywhere.

b333ca9

perf: use blobopen for reading chunks.

7299266

refactor: move file pointer interface up.

1101d29

refactor: add .open_range.

2f9c59d

perf: use file pointers directly.

d6a2d05

Seeking is kept in the calling interface to avoid needing to provide an abstraction over multiple file pointer. Note that the type is wrong, and Blob doesn't support .readinto, annoyingly.

fix: start logic.

565bd25

chore: better logging.

990c97e

perf: avoid draining buffer if not required.

ba38bf9

chore: get rid of commented code.

a541ead

test: decouple from chunk size.

1f6ca58

chore: cleanup.

48e5e61

chore: fixups.

9fbf179

refactor: use writer directly.

05f1905

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf/blob io#240

perf/blob io#240
2e0byo wants to merge 20 commits into
EbbLabs:mainfrom
2e0byo:perf/blob-io

2e0byo commented Apr 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

2e0byo commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

2e0byo commented Apr 30, 2026 •

edited

Loading