SecureMR-Samples/samples/mnistwild at main · Pico-Developer/SecureMR-Samples

Name	Name	Last commit message	Last commit date
parent directory ..
cpp	cpp
src/main/java/com/bytedance/pico/secure_mr_demo/mnistwild	src/main/java/com/bytedance/pico/secure_mr_demo/mnistwild
AndroidManifest.xml	AndroidManifest.xml
CMakeLists.txt	CMakeLists.txt
README.md	README.md
build.gradle.kts	build.gradle.kts

Sample: MNIST Recognition

This sample builds a SecureMR application that recognizes hand-written digits that appear in the VST camera feed. Unlike the other samples, both the preprocessing and inference pipelines are described entirely in JSON (mnist_pipeline.json) and deserialized at runtime to create the SecureMR pipeline graph.

The experience renders three virtual TV in front of the user. The cropped digit, its predicted class, and the confidence score are displayed on the screen in real time.

Assets

Model: mnist.serialized.bin (generated from the classic MNIST network)
Pipeline spec: mnist_pipeline.json
Display mesh: tv.gltf

All assets are packaged under assets/mnistwild and loaded through Android's AAssetManager.

Runtime Flow

Framework setup
MnistWildApp spins up a FrameworkSession, creates required global tensors, and loads the JSON pipeline specification plus GLTF assets on a background thread.
Inference pipeline (JSON)
The mnist_pipeline.json specification expands to the following operator chain:
- RECTIFIED_VST_ACCESS acquires synchronized left/right RGB frames, timestamps, and camera parameters.
- GET_AFFINE produces a perspective transform from hard-coded source/destination points so that the drawing area is cropped from the high-resolution VST frame.
- APPLY_AFFINE crops the digit region (crop_rgb_tensor), after which CONVERT_COLOR converts it to grayscale.
- Two ASSIGNMENT operators duplicate buffers for later use: one keeps the RGB crop for rendering, the other prepares a float copy.
- ARITHMETIC_COMPOSE normalizes the grayscale image to [0, 1] by dividing by 255.
- RUN_MODEL_INFERENCE feeds the normalized tensor into the MNIST model and produces predicted_score (confidence) and predicted_class (digit index).
Tensors marked as placeholders in the JSON (cropped_image, predicted_score, predicted_class) are bound to global tensors so the app can read the results directly after each submission.
Render pipeline (C++)
A native pipeline (CreateRenderPipeline) maps the inference outputs onto the GLTF TV:
- Updates dynamic textures so the RGB crop appears on the TV screen.
- Draws the predicted class and score using RenderCommand_DrawText.
- Positions three GLTF instances (digit text, score text, TV) via pose tensors to keep the visualization fixed in front of the user.
Execution loop
Two worker threads call RunInferencePipeline (~20 Hz) and RunRenderPipeline (~25 Hz) once all pipelines finish initializing.

How to Build and Run

Build and install
Connect your PICO 4 Ultra device and run:
```
./gradlew :samples:mnistwild:installDebug
```
Run the app
Launch the "MnistWild" app from the PICO launcher. The app will request permissions for the VST camera; grant them to start the recognition pipeline.

Customizing the JSON Pipeline

Adjust src_points in the JSON if the digit capture area shifts in the physical world.
Update mnist.serialized.bin for a retrained model; the binding names in the JSON must match the ONNX/TensorRT export (input_1, _538, _539 in the default export).
Additional preprocessing steps (e.g., blurring, adaptive thresholding) can be inserted by editing the JSON without touching the C++ side, as long as the placeholder tensors remain consistent.

Because the JSON is deserialized at runtime, developers can iterate on pipeline topology and operators without recompiling the sample; copying an updated mnist_pipeline.json to the writable path is enough to override the bundled asset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Sample: MNIST Recognition

Assets

Runtime Flow

How to Build and Run

Customizing the JSON Pipeline

FilesExpand file tree

mnistwild

Directory actions

More options

Directory actions

More options

Latest commit

History

mnistwild

Folders and files

parent directory

README.md

Sample: MNIST Recognition

Assets

Runtime Flow

How to Build and Run

Customizing the JSON Pipeline