Defined in: modules/computer_vision/TextToImageModule.ts:15
Module for text-to-image generation tasks.
BaseModule
generateFromFrame: (
frameData, ...args) =>any
Defined in: modules/BaseModule.ts:53
Process a camera frame directly for real-time inference.
This method is bound to a native JSI function after calling load(),
making it worklet-compatible and safe to call from VisionCamera's
frame processor thread.
Performance characteristics:
- Zero-copy path: When using
frame.getNativeBuffer()from VisionCamera v5, frame data is accessed directly without copying (fastest, recommended). - Copy path: When using
frame.toArrayBuffer(), pixel data is copied from native to JS, then accessed from native code (slower, fallback).
Usage with VisionCamera:
const frameOutput = useFrameOutput({
pixelFormat: 'rgb',
onFrame(frame) {
'worklet';
// Zero-copy approach (recommended)
const nativeBuffer = frame.getNativeBuffer();
const result = model.generateFromFrame(
{ nativeBuffer: nativeBuffer.pointer, width: frame.width, height: frame.height },
...args
);
nativeBuffer.release();
frame.dispose();
}
});Frame data object with either nativeBuffer (zero-copy) or data (ArrayBuffer)
...any[]
Additional model-specific arguments (e.g., threshold, options)
any
Model-specific output (e.g., detections, classifications, embeddings)
Frame for frame data format details
BaseModule.generateFromFrame
nativeModule:
any=null
Defined in: modules/BaseModule.ts:16
Internal
Native module instance (JSI Host Object)
BaseModule.nativeModule
delete():
void
Defined in: modules/BaseModule.ts:81
Unloads the model from memory and releases native resources.
Always call this method when you're done with a model to prevent memory leaks.
void
BaseModule.delete
forward(
input,imageSize?,numSteps?,seed?):Promise<string>
Defined in: modules/computer_vision/TextToImageModule.ts:163
Runs the model to generate an image described by input, and conditioned by seed, performing numSteps inference steps.
The resulting image, with dimensions imageSize×imageSize pixels, is returned as a base64-encoded string.
string
The text prompt to generate the image from.
number = 512
The desired width and height of the output image in pixels.
number = 5
The number of inference steps to perform.
number
An optional seed for random number generation to ensure reproducibility.
Promise<string>
A Base64-encoded string representing the generated PNG image.
protectedforwardET(inputTensor):Promise<TensorPtr[]>
Defined in: modules/BaseModule.ts:62
Internal
Runs the model's forward method with the given input tensors. It returns the output tensors that mimic the structure of output from ExecuTorch.
Array of input tensors.
Promise<TensorPtr[]>
Array of output tensors.
BaseModule.forwardET
getInputShape(
methodName,index):Promise<number[]>
Defined in: modules/BaseModule.ts:72
Gets the input shape for a given method and index.
string
method name
number
index of the argument which shape is requested
Promise<number[]>
The input shape as an array of numbers.
BaseModule.getInputShape
interrupt():
void
Defined in: modules/computer_vision/TextToImageModule.ts:195
Interrupts model generation. The model is stopped in the nearest step.
void
staticfromCustomModel(sources,onDownloadProgress?,inferenceCallback?):Promise<TextToImageModule>
Defined in: modules/computer_vision/TextToImageModule.ts:78
Creates a Text to Image instance with user-provided model binaries.
Use this when working with a custom-exported diffusion pipeline.
Internally uses 'custom' as the model name for telemetry.
An object containing the pipeline source paths.
(progress) => void
Optional callback to monitor download progress, receiving a value between 0 and 1.
(stepIdx) => void
Optional callback triggered after each diffusion step.
Promise<TextToImageModule>
A Promise resolving to a TextToImageModule instance.
The native model contract for this method is not formally defined and may change between releases. Refer to the native source code for the current expected tensor interface.
staticfromModelName(namedSources,onDownloadProgress?):Promise<TextToImageModule>
Defined in: modules/computer_vision/TextToImageModule.ts:40
Creates a Text to Image instance for a built-in model.
An object specifying the model name, pipeline sources, and optional inference callback.
(stepIdx) => void
(progress) => void
Optional callback to monitor download progress, receiving a value between 0 and 1.
Promise<TextToImageModule>
A Promise resolving to a TextToImageModule instance.
import { TextToImageModule, BK_SDM_TINY_VPRED_512 } from 'react-native-executorch';
const tti = await TextToImageModule.fromModelName(BK_SDM_TINY_VPRED_512);