feat: Improved LLM public API by pweglik · Pull Request #243 · software-mansion/react-native-executorch

pweglik · 2025-05-06T10:40:59Z

Added generate function, and some little re…formats

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update (improves or adds clarity to existing documentation)

Tested on

iOS
Android

Related issues

#226

Checklist

I have performed a self-review of my code
I have updated the documentation accordingly
My changes generate no new warnings

…formats

NorbertKlockiewicz · 2025-05-07T08:48:24Z

-Alternatively, you can use `runInference`. It provides direct access to the model, without any wrapper, so the input string is passed straight into the model. If you're not sure what are implications of that, you're better off with `sendMessage`
+Alternatively, you can use `generate` method. It allows you to simply pass chat messages and receive completion from the model. It doesn't provide any message history management.
+
+If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`


Suggested change

If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`

If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`.

I think you can mention here that you need to add special tokens to make it work

NorbertKlockiewicz · 2025-05-07T08:50:13Z

 import { View, StyleSheet, Text } from 'react-native';

-enum ModelType {
+enum ModeType {


Maybe just Mode instead of ModeType

NorbertKlockiewicz · 2025-05-07T08:50:57Z

      this.responseCallback('');
      this.isGeneratingCallback(true);
-      await this.nativeModule.runInference(input);
+      console.log('INPUT:', input);


Remove console.log

NorbertKlockiewicz · 2025-05-07T08:57:30Z

Can you also bump huggingface/jinja to newest version(0.5.0), as the current has problems with qwen 3 tokenizer.

chmjkb · 2025-05-07T11:14:16Z

I believe that this is wrong, you're supposed to pass all the special tokens to forward(), but the above prompt definition doesn't do that. Change the call of this method to generate() (not sure if this one is ok)

chmjkb · 2025-05-07T11:20:07Z

-Alternatively, you can use `runInference`. It provides direct access to the model, without any wrapper, so the input string is passed straight into the model. If you're not sure what are implications of that, you're better off with `sendMessage`
+Alternatively, you can use `generate` method. It allows you to simply pass chat messages and receive completion from the model. It doesn't provide any message history management.
+
+If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`


I think you can mention here that you need to add special tokens to make it work

chmjkb · 2025-05-07T12:29:04Z

+    messages: Message[],
    tokenizerConfig: any,
    tools?: LLMTool[],
    template_flags?: Object


Can you make template_flags camel case?

Sorry, missed it, maybe we should enforce camel case with eslint? Can you create issue for it?

Yep, will do

NorbertKlockiewicz and others added 4 commits May 2, 2025 12:19

feat: add urls for qwen 3 and other models(quantized and unquantized

a8d441a

docs: add list of supported llms

ccc70d0

Improved LLM public API - added generate function, and some little re…

3751592

…formats

Fix

d938dd7

pweglik requested review from chmjkb, jakmro and mkopcins May 6, 2025 10:40

pweglik self-assigned this May 6, 2025

pweglik linked an issue May 6, 2025 that may be closed by this pull request

Allow users to manage conversation history in LLM #226

Closed

Fix special tokens

026957d

pweglik changed the title ~~Improved LLM public API~~ feat: Improved LLM public API May 6, 2025

Base automatically changed from @nk/qwen-3 to v0.4.0-rc1 May 7, 2025 07:21

pweglik requested a review from NorbertKlockiewicz May 7, 2025 08:35

NorbertKlockiewicz requested changes May 7, 2025

View reviewed changes

pweglik added 2 commits May 7, 2025 13:08

Fixes after review

7cf2399

Fix

b7ae234

pweglik requested a review from NorbertKlockiewicz May 7, 2025 11:09

chmjkb requested changes May 7, 2025

View reviewed changes

More fixes

77d7565

pweglik requested a review from chmjkb May 7, 2025 12:49

NorbertKlockiewicz approved these changes May 8, 2025

View reviewed changes

pweglik merged commit 270faa6 into v0.4.0-rc1 May 8, 2025
1 check passed

pweglik deleted the @pw/improvements-in-llm-api branch May 8, 2025 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Improved LLM public API #243

feat: Improved LLM public API #243
pweglik merged 8 commits intov0.4.0-rc1from
@pw/improvements-in-llm-api

pweglik commented May 6, 2025 •

edited

Loading

Uh oh!

NorbertKlockiewicz May 7, 2025

Uh oh!

chmjkb May 7, 2025

Uh oh!

NorbertKlockiewicz May 7, 2025

Uh oh!

NorbertKlockiewicz May 7, 2025

Uh oh!

NorbertKlockiewicz commented May 7, 2025

Uh oh!

chmjkb May 7, 2025

Uh oh!

chmjkb May 7, 2025

Uh oh!

chmjkb May 7, 2025

Uh oh!

pweglik May 7, 2025

Uh oh!

chmjkb May 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`
	If you need raw model, without any wrappers, you can use `forward`. It provides direct access to the model, so the input string is passed straight into the model. It may be useful to work with models that aren't finetuned for chat completions. If you're not sure what are implications of that, you're better off with `sendMessage`.

Conversation

pweglik commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of change

Tested on

Related issues

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NorbertKlockiewicz commented May 7, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pweglik commented May 6, 2025 •

edited

Loading