Skip to content

Commit f443ab9

Browse files
committed
docs: add 'Intel resources worth knowing' section to both READMEs
Curated, opinionated list of upstream Intel-side resources behind the stack, grouped into: - OpenVINO toolkit (docs, openvino runtime, openvino.genai, NNCF) - OpenVINO Model Server (model_server repo + demos, optimum-intel) - Pre-converted models (OpenVINO HuggingFace org) - Hardware (Core Ultra Series 2 family, Arc Graphics) - Cloud / remote (Intel Tiber AI Cloud, optional) - Community (OpenVINO blog, Intel Developer YouTube) Placed between 'Project structure' and 'Troubleshooting' in both README.md (EN) and README.es.md (ES); TOCs updated symmetrically.
1 parent 501a538 commit f443ab9

2 files changed

Lines changed: 135 additions & 0 deletions

File tree

README.es.md

Lines changed: 68 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,7 @@ Rendimiento medido: **~18-20 tokens/s** generando con Qwen3-8B INT4 sobre la iGP
5454
- [OVMS como backend de Claude Code](#ovms-como-backend-de-claude-code)
5555
- [Memoria y rendimiento](#memoria-y-rendimiento)
5656
- [Estructura](#estructura)
57+
- [Recursos Intel a tener en cuenta](#recursos-intel-a-tener-en-cuenta)
5758
- [Troubleshooting](#troubleshooting)
5859
- [Licencia](#licencia)
5960
- [Autor](#autor)
@@ -505,6 +506,73 @@ Sweet spot: pedirle "explica este fichero", "genera un test para esta función",
505506
└── openwebui-data/ # estado persistente de Open WebUI
506507
```
507508

509+
## Recursos Intel a tener en cuenta
510+
511+
Lista corta y opinada de recursos upstream del lado Intel detrás de este stack
512+
— útil si quieres profundizar, cambiar componentes o contribuir.
513+
514+
### Toolkit OpenVINO (el runtime debajo de todo)
515+
516+
- 📖 [Documentación de OpenVINO](https://docs.openvino.ai/) — docs oficiales,
517+
la fuente autorizada para comportamiento de plugins, ops soportadas y
518+
hints de dispositivo.
519+
- 🐙 [`openvinotoolkit/openvino`](https://github.com/openvinotoolkit/openvino)
520+
— el runtime C++/Python. Echa un ojo al [código fuente del plugin GPU](https://github.com/openvinotoolkit/openvino/tree/master/src/plugins/intel_gpu)
521+
cuando un compile GPU falle de forma rara (el error `is_static()` que
522+
tuvimos nosotros sale de ahí).
523+
- 🐙 [`openvinotoolkit/openvino.genai`](https://github.com/openvinotoolkit/openvino.genai)
524+
— la capa runtime específica para LLMs (continuous batching, KV cache,
525+
chat templates). Lo que OVMS usa por debajo.
526+
- 🐙 [`openvinotoolkit/nncf`](https://github.com/openvinotoolkit/nncf)
527+
Neural Network Compression Framework. Lee la
528+
[documentación de weight compression](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html)
529+
para entender qué hace realmente `--weight-format int4 --group-size 64`.
530+
531+
### OpenVINO Model Server (OVMS)
532+
533+
- 🐙 [`openvinotoolkit/model_server`](https://github.com/openvinotoolkit/model_server)
534+
— el servidor que usamos. El [directorio de demos](https://github.com/openvinotoolkit/model_server/tree/main/demos)
535+
tiene ejemplos canónicos de `graph.pbtxt` para cada task (generación de
536+
texto, embeddings, rerank, generación de imagen, VLMs). Cuando dudes,
537+
copia de ahí.
538+
- 🛠 [`optimum-intel`](https://github.com/huggingface/optimum-intel) — el
539+
puente con HuggingFace que convierte `Qwen/Qwen3-8B` en un IR de OpenVINO.
540+
Nuestro `scripts/export-models.sh` es esencialmente un wrapper de
541+
`optimum-cli export openvino`.
542+
543+
### Modelos pre-convertidos
544+
545+
- 🤗 [Organización `OpenVINO` en HuggingFace](https://huggingface.co/OpenVINO)
546+
— modelos IR pre-convertidos oficialmente (Qwen, Llama, Phi, Mistral,
547+
embeddings, etc.). Si no quieres esperar a la conversión local a INT4,
548+
bájate uno de ahí y sáltate `scripts/export-models.sh`.
549+
550+
### Hardware
551+
552+
- 💻 [Procesadores Intel Core Ultra (Series 2)](https://www.intel.com/content/www/us/en/products/details/processors/core-ultra.html)
553+
— la familia. Lunar Lake (Series 2) es para la que está tuneado este
554+
stack, pero el mismo compose funciona en Meteor Lake y Arrow Lake H/HX
555+
con la misma lógica de colocación iGPU/CPU.
556+
- 💻 [Intel Arc Graphics](https://www.intel.com/content/www/us/en/products/details/discrete-gpus/arc.html)
557+
— la línea integrada (Arc 140V aquí) y la discreta hablan el mismo plugin
558+
de OpenVINO. Si tienes un Arc A770/B580 discreto, reutilizas este mismo
559+
stack con mucho más margen para modelos grandes.
560+
561+
### Cloud / remoto (opcional)
562+
563+
- ☁️ [Intel Tiber AI Cloud](https://www.intel.com/content/www/us/en/developer/tools/devcloud/services.html)
564+
— la cloud de desarrolladores de Intel (antes Intel Developer Cloud).
565+
Útil si quieres probar este mismo stack OVMS en una instancia mayor (Xeon
566+
+ GPU) antes de comprar hardware.
567+
568+
### Comunidad
569+
570+
- 📰 [Blog de OpenVINO](https://blog.openvino.ai/) — release notes, números
571+
de rendimiento y anuncios de soporte de modelos. Suscríbete si vives en
572+
este ecosistema.
573+
- 🎥 [Intel Developer en YouTube](https://www.youtube.com/@IntelSoftware)
574+
charlas técnicas sobre OpenVINO/OVMS y conferencias.
575+
508576
## Troubleshooting
509577

510578
- **`/dev/dri` no existe en WSL**: actualiza Windows + drivers Intel Arc. Reinicia WSL: `wsl --shutdown`.

README.md

Lines changed: 67 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -55,6 +55,7 @@ Measured throughput: **~18-20 tokens/s** generating with Qwen3-8B INT4 on the Ar
5555
- [OVMS as a Claude Code backend](#ovms-as-a-claude-code-backend)
5656
- [Memory and performance](#memory-and-performance)
5757
- [Project structure](#project-structure)
58+
- [Intel resources worth knowing](#intel-resources-worth-knowing)
5859
- [Troubleshooting](#troubleshooting)
5960
- [License](#license)
6061
- [Author](#author)
@@ -509,6 +510,72 @@ Sweet spot: "explain this file", "write a test for this function", "what does th
509510
└── openwebui-data/ # persistent Open WebUI state (gitignored)
510511
```
511512

513+
## Intel resources worth knowing
514+
515+
A short, opinionated list of upstream Intel-side resources behind this stack
516+
— useful if you want to go deeper, swap components, or contribute.
517+
518+
### OpenVINO toolkit (the runtime under everything)
519+
520+
- 📖 [OpenVINO documentation](https://docs.openvino.ai/) — official docs, the
521+
authoritative source for plugin behavior, supported ops and device hints.
522+
- 🐙 [`openvinotoolkit/openvino`](https://github.com/openvinotoolkit/openvino)
523+
— the core C++/Python runtime. Browse the [GPU plugin source](https://github.com/openvinotoolkit/openvino/tree/master/src/plugins/intel_gpu)
524+
when GPU compile fails in non-obvious ways (the error we hit with
525+
`is_static()` was traced from there).
526+
- 🐙 [`openvinotoolkit/openvino.genai`](https://github.com/openvinotoolkit/openvino.genai)
527+
— the LLM-specific runtime layer (continuous batching, KV cache, chat
528+
templates). What OVMS uses under the hood.
529+
- 🐙 [`openvinotoolkit/nncf`](https://github.com/openvinotoolkit/nncf)
530+
Neural Network Compression Framework. Read the
531+
[weight compression docs](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html)
532+
to understand what `--weight-format int4 --group-size 64` actually does.
533+
534+
### OpenVINO Model Server (OVMS)
535+
536+
- 🐙 [`openvinotoolkit/model_server`](https://github.com/openvinotoolkit/model_server)
537+
— the server we run. The [demos directory](https://github.com/openvinotoolkit/model_server/tree/main/demos)
538+
has canonical `graph.pbtxt` examples for every task (text generation,
539+
embeddings, rerank, image generation, VLMs). When in doubt, copy from
540+
there.
541+
- 🛠 [`optimum-intel`](https://github.com/huggingface/optimum-intel) — the
542+
HuggingFace bridge that turns `Qwen/Qwen3-8B` into an OpenVINO IR.
543+
`scripts/export-models.sh` is essentially a wrapper around its
544+
`optimum-cli export openvino`.
545+
546+
### Pre-converted models
547+
548+
- 🤗 [`OpenVINO` organization on HuggingFace](https://huggingface.co/OpenVINO)
549+
— official pre-converted OpenVINO IR models (Qwen, Llama, Phi, Mistral,
550+
embedding models, etc.). If you don't want to wait for the local INT4
551+
conversion, grab one of these directly and skip `scripts/export-models.sh`.
552+
553+
### Hardware
554+
555+
- 💻 [Intel Core Ultra processors (Series 2)](https://www.intel.com/content/www/us/en/products/details/processors/core-ultra.html)
556+
— the family. Lunar Lake (Series 2) is what this stack was tuned for, but
557+
the same compose file works on Meteor Lake and Arrow Lake H/HX with the
558+
same iGPU/CPU placement logic.
559+
- 💻 [Intel Arc Graphics](https://www.intel.com/content/www/us/en/products/details/discrete-gpus/arc.html)
560+
— both the integrated (Arc 140V here) and discrete Arc lineups speak the
561+
same OpenVINO plugin. If you have a discrete Arc A770/B580, you can
562+
reuse this exact stack with much more headroom for big models.
563+
564+
### Cloud / remote (optional)
565+
566+
- ☁️ [Intel Tiber AI Cloud](https://www.intel.com/content/www/us/en/developer/tools/devcloud/services.html)
567+
— Intel's developer cloud (formerly Intel Developer Cloud). Useful if you
568+
want to try the same OVMS stack on a larger Xeon + GPU instance before
569+
buying hardware.
570+
571+
### Community
572+
573+
- 📰 [OpenVINO blog](https://blog.openvino.ai/) — release notes, perf
574+
numbers and model-support announcements. Subscribe if you live in this
575+
ecosystem.
576+
- 🎥 [Intel Developer YouTube](https://www.youtube.com/@IntelSoftware)
577+
OpenVINO/OVMS deep-dives and conference talks.
578+
512579
## Troubleshooting
513580

514581
- **`/dev/dri` doesn't exist in WSL**: update Windows + Intel Arc drivers. Restart WSL: `wsl --shutdown`.

0 commit comments

Comments
 (0)