Data track schema metadata by ladvoc · Pull Request #1159 · livekit/rust-sdks

ladvoc · 2026-06-10T22:57:57Z

Adds support for associating schema metadata with a published data track and storing/retrieving schema definitions.

Schema storage is built on-top of data blobs, a general purpose mechanism for storing large (in the order of KBs), arbitrary data blobs in a room:

Protocol: Data blobs protocol#1595
SFU implementation: https://github.com/livekit/livekit/tree/raja_async_attributes

Protocol additions for schema metadata:

Data tracks schema metadata protocol#1553

github-actions · 2026-06-10T22:58:11Z

Changeset

The following package versions will be affected by this PR:

Package	Bump
`livekit`	`patch`
`livekit-datatrack`	`patch`
`livekit-ffi`	`patch`

Adds request ID fields

ladvoc · 2026-06-17T19:11:29Z

For local testing:

Use this SFU branch: https://github.com/livekit/livekit/tree/ladvoc/schema-metadata
Build and run with data blobs support enabled: LIVEKIT_CONFIG="enable_participant_data_blob: true" ./bin/livekit-server --dev

1egoman

Generally makes sense to me!

I think DataTrackSchemaId being a non primative value is key here to future interface evolution (like adding a generic type with a default which when set explictly can be used to assert the type of payloads later on).

1egoman · 2026-06-22T18:50:56Z

+    const DATA_BLOB_REQUEST_TIMEOUT: Duration = Duration::from_secs(5);
+
+    /// Stores an arbitrary blob of data on the server, keyed by `key`.
+    async fn store_data_blob(&self, key: proto::DataBlobKey, contents: Bytes) -> EngineResult<()> {


question: Should this be RoomResult<()>, not EngineResult<()>? There's a few other instances of this in the file as well.

1egoman · 2026-06-22T19:18:55Z

+    ///   nor validated against its [encoding](DataTrackSchemaId::encoding), so
+    ///   the caller is responsible for ensuring it is well-formed.
+    ///
+    pub async fn define_schema(&self, id: DataTrackSchemaId, definition: String) -> RoomResult<()> {


thought: Are you confident that schemas will / should always be text? Or should definition: String be some sort of bytes object, like Vec<u8> or bytes::Bytes (or maybe impl Into<bytes::Bytes>)?

Yes, while the wire format (frame content) will often be binary, I can't think of any case where the schema definition language would be binary. @stephen-derosa, wdyt?

1egoman · 2026-06-22T19:25:03Z

+    let (pub_room, _) = rooms.pop().unwrap();
+    let (_, mut sub_room_event_rx) = rooms.pop().unwrap();
+
+    let schema_id = DataTrackSchemaId::new("my_schema", DataTrackSchemaEncoding::JsonSchema);


question: Is this DataTrackSchemaId::new(...) expected to be a typical pattern, maybe in the case that you know that a schema already exists on the SFU end? Or is the idea that in practice, a user would always call LocalParticipant::deine_schema and this is just being done this way for testing?

stephen-derosa

in general looks good, need to give it a deeper dive

stephen-derosa · 2026-06-23T15:32:31Z

  optional SubscribeDataTrackError error = 1;
 }
+
+// MARK: - Schemas


FMU, what is this // MARK: notation?

This is recognized by most editors (see the minimap and navigator in Cursor/Vscode).

stephen-derosa · 2026-06-23T15:39:14Z

    }

+    #[test]
+    fn test_frame_encoding_mapping() {


test_empty_frame_encoding ?

stephen-derosa · 2026-06-23T15:46:57Z

+    ///
+    /// Called by a publisher to make a schema available to subscribers, who can
+    /// later look up its definition via [`get_schema`](Self::get_schema). Define a
+    /// schema before publishing any data track that references it, so that


what is the behavior if a data track with a schema is published before the schema itself is published?

The track will be published normally and carry the schema ID. If you care about retrieving it on the subscriber end, it is recommended to store the schema before publishing the track to ensure it is available right away, but if you don't need the actual definition for you application, you can simply use schema ID as an identifier.

ladvoc added 13 commits June 5, 2026 15:32

Pin protocol to ladvoc/schema-metadata

ff47c71

Expose schema and frame encoding

d28a823

Clean up conversion

419e094

Add unit test

2099af1

Document encoding enum cases

0a48860

Export under api module

d5832d8

Define conversion to data blob key

327140d

Prototype

0099197

Expose over FFI

48cd8a1

Changeset

df2d4f7

Add E2E test cases

2248728

Document core types

682a63c

Document store and get methods

ea5c92e

github-actions Bot and others added 14 commits June 10, 2026 22:58

generated protobuf

b3e6fb7

Update protocol

8f74a90

Adds request ID fields

Wire up request/response

49b4fd9

Update protocol

fc19418

Move data blob methods to local participant

5ab44b2

Update protocol

ea63124

Derive eq and hash on data blob proto types

ab8d654

Wire up

c31b6fd

Note to make internal

e56d21b

Use request ID

125a7d4

Make data blobs private, test schema storage only

a3b39ca

Add accessors for schema and frame encoding

33baaf7

Test publish with metadata

18381f2

Don't log error

62bf4bb

ladvoc mentioned this pull request Jun 17, 2026

Data track schema metadata livekit/client-sdk-cpp#176

Draft

Expose data track fields over FFI

a271c9d

ladvoc marked this pull request as ready for review June 18, 2026 18:35

ladvoc requested review from 1egoman, MaxHeimbrock, alan-george-lk, lukasIO, stephen-derosa and xianshijing-lk as code owners June 18, 2026 18:35

1egoman reviewed Jun 22, 2026

View reviewed changes

stephen-derosa reviewed Jun 23, 2026

View reviewed changes

ladvoc and others added 4 commits June 23, 2026 14:34

Merge remote-tracking branch 'origin/main' into ladvoc/schema-metadata

14af23c

Merge remote-tracking branch 'origin/main' into ladvoc/schema-metadata

24236c3

Pin protocol

d560373

generated protobuf

aed1fed

Conversation

ladvoc commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 10, 2026

Changeset

Uh oh!

ladvoc commented Jun 17, 2026

Uh oh!

1egoman left a comment

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

ladvoc Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 22, 2026

Choose a reason for hiding this comment

Uh oh!

stephen-derosa left a comment

Choose a reason for hiding this comment

Uh oh!

stephen-derosa Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

ladvoc Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

stephen-derosa Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

stephen-derosa Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

ladvoc Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ladvoc commented Jun 10, 2026 •

edited

Loading

1egoman Jun 22, 2026 •

edited

Loading

ladvoc Jun 23, 2026 •

edited

Loading