Feat/coreml by Skyline-23 · Pull Request #308 · azooKey/AzooKeyKanaKanjiConverter

Skyline-23 · 2025-11-25T07:55:27Z

PR Summary (feat/coreml)

Overview

This branch isolates CoreML integration (no actor refactor) and makes it buildable/usable via the CLI with a CoreML-only flag. Sync APIs remain primary; async wrappers are thin bridges.

Key changes

Sendable fixes for CoreML path
- Zenz and ZenzCoreMLService marked @unchecked Sendable to unblock async bridging.
- blockingAsync now accepts @Sendable closures via Task.detached; CoreML personalization handle is mapped explicitly to a tuple.
macOS version handling for CoreML
- When ZenzaiCoreML trait is used, macOS linker gets a -platform_version macos 15.0 15.0 hint to silence xcframework version warnings while keeping the default macOS 13 baseline for non-CoreML builds.
CLI installer improvements
- Supports --zenzai-coreml (plus optional --debug).
- Copies the required resource bundle AzooKeyKanaKanjiConverter_KanaKanjiConverterModuleWithDefaultDictionary.bundle into /usr/local/bin/.
- For CoreML builds, skips copying llama.framework; for Zenzai/ZenzaiCPU builds, still expects it.
  Add Support Page for English/Korean

How to build and install (CoreML)

sudo rm -rf .build   # clear stale caches if needed
swift build --traits ZenzaiCoreML
./install_cli.sh --zenzai-coreml --debug   # drop --debug for release

Known warnings / notes

updateIfRequired(options:) in KanaKanjiConverter is deprecated but behavior is unchanged; can be refactored later.
Minor warnings remain (unused intention in InputTable.swift, redundant try in ZenzContext+CoreML.swift) with no behavioral impact.
CoreML xcframeworks are built for macOS 15.5; the trait-specific linker hint mitigates version mismatch noise.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-25T07:59:05Z

+#else
+    package func getModel(modelURL: URL) -> Zenz? {
+        self.zenzStatus = "zenz-v2 model unavailable on this platform"
+        return nil


Non-CoreML builds never load zenz models

When ZenzaiCoreML is not set, getModel now always marks the model as unavailable and returns nil, but the rest of the pipeline still checks zenzaiMode.enabled. On Zenzai/ZenzaiCPU builds this means the llama backend is never loaded and convertToLattice falls through to the non-zenz path, so enabling zenzaiMode no longer produces any zenz candidates or next-character predictions even though the trait and weight URL are provided—a regression from the previous behavior.

Useful? React with 👍 / 👎.

It might be resolved. in 5a07f7d

Adds support for CoreML-based Zenzai conversion, leveraging the `ZenzCoreMLBackend` for improved performance and efficiency on compatible devices. This includes: - Integration of the CoreML model for stateful generation. - A new `ZenzContext+CoreML.swift` to manage the CoreML inference process. - Updates to `Package.swift` to include `swift-transformers` dependency. - Conditional compilation to switch between CoreML and CPU/Metal implementations based on the `ZenzaiCoreML` flag. - Adds tokenizer files and marks mlpackage for LFS. The changes aim to provide a faster and more resource-efficient Zenzai conversion method on Apple platforms with Neural Engine support.

This reverts commit 63f31ac.

Cause of swift-transformers package uses combine, and it not compatible with windows

Enables building the application with CoreML support for enhanced performance on Apple devices. This allows users to leverage CoreML acceleration by using the `--zenzai-coreml` flag during installation.

Removes the explicit LFS bootstrap step from CI workflows as it is no longer required due to changes in how dependencies are managed. This simplifies the CI configuration and reduces build times. Adds documentation for development setup in multiple languages, covering build instructions, testing and devcontainer usage. This enhances the developer experience by providing clear and comprehensive instructions in English, Japanese and Korean. Improves Zenzai integration, adding CoreML support and cross-platform compatibility with Swift 6 concurrency. This allows for high-precision neural Kana-Kanji conversion on Apple devices using CoreML, while ensuring compatibility and performance on both Darwin and Linux platforms.

- replace the blockingAsync result capture with a lock-protected sendable box so Task.detached no longer trips Swift 6.3 data-race checks - align the CoreML macOS linker platform_version hint with the 15.5 XCFramework minimum to remove stale version mismatch warnings - keep the existing CoreML sync bridge behavior intact while restoring a successful ZenzaiCoreML build

- reintroduce the CoreML-facing request option shims and converter entry points that upstream main now expects from AncoSession and prediction flows - reconnect the Zenz prompt/context helpers after rebasing so llama-only helpers stay out of CoreML builds while shared APIs remain available - remove the duplicate fixed-size heap implementation and expose tokenizer APIs needed by the CoreML context so both default and ZenzaiCoreML builds succeed

- remove the zenz-CoreML Swift package dependency and load the stateful 8-bit CoreML assets directly from the Skyline23/zenz-coreml Hugging Face repo - download tokenizer and mlpackage assets into a local cache, compile the mlpackage to mlmodelc, and reuse the compiled model on subsequent launches - stop install_cli.sh from expecting a copied CoreML framework and document that CoreML assets are fetched from Hugging Face at runtime

- switch the default HF CoreML artifact selection to the working stateful FP16 package while keeping 8-bit available via environment override - make the loader follow the current Hugging Face main artifact layout and materialize mlpackage resources into a local cache before compiling them to mlmodelc - align the CoreML stateful decode path with the current app reference so fresh-cache zenz_evaluate produces stable text instead of collapsing into repeated replacement tokens

- move CoreML and llama-specific model lifecycle helpers out of KanaKanjiConverter.swift so the main converter flow reads as backend-agnostic logic - switch ZenzCoreMLService from NSLock-based state management to an actor to make the CoreML path easier to reason about during review - keep behavior unchanged while making the backend split more explicit for follow-up cleanup around shared Zenz APIs

- reintroduce the upstream session-scoped predictive input and stable prediction candidate caches inside KanaKanjiConverter so prediction view behavior matches main again - wire the existing CoreML backend path into the restored session state flow instead of bypassing the cache-aware converter logic - fix the failing AncoSession and Scenario prediction stability tests that were breaking the latest macOS and ubuntu CI runs

- pin swift-transformers to the revision with the Android localized string fallback\n- make Android CI resolve a deterministic compatibility commit instead of a moving branch head\n- preserve the existing CoreML and llama.cpp package structure

- pin swift-transformers to the latest main revision e5e227b\n- keep the repo on the upstream Android-compatible Hub/Tokenizer stack\n- retain the existing CoreML and llama.cpp integration in this branch

- remove CoreML-only stub returns from the Zenz wrapper type\n- gate unsupported predictive input and typo generation at the converter call sites\n- keep backend capability differences local to the integration layer for easier review

- revert the latest-main pin that regressed non-mac CI\n- keep the known-good Android-compatible revision for this PR\n- defer the upstream swift-transformers main upgrade until its cross-platform issues are resolved

Skyline-23 · 2026-03-08T23:09:53Z

Follow-up notes after the latest cleanup pass:

Future cleanup items that still fit this PR's scope:

Remove the remaining sync-to-async bridge in KanaKanjiConverter+ZenzBackend.swift and replace the NSLock-backed blockingAsync helper with a structured actor-based handoff.
Move more Zenz-specific session/cache state out of KanaKanjiConverter.swift into a smaller backend/session container so the converter body stays focused on the shared flow.
Keep backend capability differences at the integration boundary only. I already moved the CoreML unsupported predictive-input / typo-generation behavior out of Zenz.swift, but the same rule should continue to guide any follow-up cleanup.

I also tested upgrading swift-transformers to the latest main revision, but I intentionally did not keep that change in this PR because it regressed non-mac CI.

Exact failure reasons from the latest-main attempt (ebc18d3):

Swift Build and Test in DevContainer and Swift 6.1 on ubuntu-24.04 failed in the new dependency graph around swift-numerics / _NumericsShims module resolution while compiling transitive packages such as EventSource / Algorithms.
Android (x86_64, aarch64, armv7) failed in swift-xet because Xet.swift referenced NIOTSEventLoopGroup, which is not available there.
Windows failed in async-http-client because CAsyncHTTPClient.c used strptime, strptime_l, and locale_t, which do not exist on that toolchain.

Because of that, the branch stays pinned to the known-good compatibility revision for swift-transformers so the CoreML PR itself remains reviewable and green.

Skyline-23 · 2026-03-09T05:27:00Z

@ensan-hcl Could you please re-review my CoreML branch?

I removed the actor-based refactoring and the README changes from this PR to keep the scope focused.

I will submit those improvements in a separate follow-up PR.

chatgpt-codex-connector Bot reviewed Nov 25, 2025

View reviewed changes

Skyline-23 force-pushed the feat/coreml branch from 5a07f7d to ddfa88d Compare January 12, 2026 08:21

Skyline-23 force-pushed the feat/coreml branch 3 times, most recently from aaf2dd8 to 32560bc Compare March 8, 2026 15:44

Skyline-23 added 25 commits March 9, 2026 00:55

Update package.swift to use main branch

ecf6bf9

Add submodule for CoreML

fde8911

Add CoreML trait

43d44ca

Fix CI scripts

8c08148

Fix iOS, MacOS version branch

79b0b61

Fix compile issue on variable env

fc10e83

Revert "Update package.swift to use main branch"

fa315c1

This reverts commit 63f31ac.

Add CoreML Build Tests

a3fb82c

Rollback Dependency

34c5c01

Cause of swift-transformers package uses combine, and it not compatible with windows

Move dependency to combine-resolved packagae

f9d9812

Fix branch name

78d0018

Refactor zenzai CoreML codes

b4ddf2f

Add missing branch fallback

3e927cf

CoreML conversion inside KanaKanjiConverter

484c6e1

Adds CoreML support in cli

3e70bb8

Enables building the application with CoreML support for enhanced performance on Apple devices. This allows users to leverage CoreML acceleration by using the `--zenzai-coreml` flag during installation.

Fix path of CoreML in install_cli.sh

3516603

Fix Mac os version issue

f873a81

Restore MainActorTestCase.swift

6df5fc6

Removed submodule Sources/ZenzCoreMLBackend/zenz-CoreML

7633839

Fix CoreML Dependency

f915ea3

Fix CoreML snapshots for sync APIs

4fe5289

Fix CoreML sendability and llama target name

6ea2ee0

Bump macOS min to 15 when using ZenzaiCoreML trait

7b5c944

Skyline-23 added 10 commits March 9, 2026 00:55

Handle macOS 15 CoreML link flags under trait

e89fcb8

Copy resource bundle when installing CLI

8eee5db

Gracefully skip llama.framework when absent

145fb92

Only skip llama.framework copy for CoreML builds

659affa

Bump dictionary storage submodule

2c22b85

Skyline-23 force-pushed the feat/coreml branch from 32560bc to b67e142 Compare March 8, 2026 15:56

Skyline-23 added 5 commits March 9, 2026 01:11

Update swift-transformers to latest main

ebc18d3

- pin swift-transformers to the latest main revision e5e227b\n- keep the repo on the upstream Android-compatible Hub/Tokenizer stack\n- retain the existing CoreML and llama.cpp integration in this branch

Keep swift-transformers on the stable compatibility revision

5eae166

- revert the latest-main pin that regressed non-mac CI\n- keep the known-good Android-compatible revision for this PR\n- defer the upstream swift-transformers main upgrade until its cross-platform issues are resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feat/coreml#308

Feat/coreml#308
Skyline-23 wants to merge 40 commits into
azooKey:mainfrom
Skyline-23:feat/coreml

Skyline-23 commented Nov 25, 2025

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Nov 25, 2025

Uh oh!

Skyline-23 Nov 25, 2025 •

edited

Loading

Uh oh!

Skyline-23 commented Mar 8, 2026

Uh oh!

Skyline-23 commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

Skyline-23 commented Nov 25, 2025

PR Summary (feat/coreml)

Overview

Key changes

How to build and install (CoreML)

Known warnings / notes

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Skyline-23 Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Skyline-23 commented Mar 8, 2026

Uh oh!

Skyline-23 commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Skyline-23 Nov 25, 2025 •

edited

Loading