Skip to content

Commit 3c9b952

Browse files
localai-botmudler
andauthored
chore(model gallery): 🤖 add 1 new models via gallery agent (#10061)
chore(model gallery): 🤖 add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>
1 parent fc2bd09 commit 3c9b952

1 file changed

Lines changed: 55 additions & 0 deletions

File tree

gallery/index.yaml

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,59 @@
11
---
2+
- name: "lfm2.5-8b-a1b"
3+
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
4+
urls:
5+
- https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF
6+
description: |
7+
Try LFM •
8+
Docs •
9+
LEAP •
10+
Discord
11+
12+
# LFM2.5-8B-A1B
13+
14+
LFM2.5 is a new family of hybrid models designed for on-device deployment. It builds on the LFM2 architecture with extended pre-training and reinforcement learning.
15+
16+
- **On-device personal assistant**: Designed to power real-life applications, chaining tool calls, and following complex instructions on all devices.
17+
- **Compressed performance**: Competitive with much larger dense and MoE models on instruction following and agentic tasks.
18+
- **Unmatched throughput**: Fastest in its size class on both CPU and GPU inference, with day-one support for llama.cpp, MLX, vLLM, and SGLang.
19+
20+
Find more information about LFM2.5-8B-A1B in our blog post.
21+
22+
**AA-Omniscience Index (higher is better) rewards correct answers and penalizes hallucinations. Scores range from -100 to 100. See more results on Artificial Analysis.*
23+
24+
## 🗒️ Model Details
25+
26+
LFM2.5-8B-A1B is a general-purpose text-only model with the following features:
27+
28+
...
29+
license: "other"
30+
tags:
31+
- llm
32+
- gguf
33+
icon: https://cdn-uploads.huggingface.co/production/uploads/61b8e2ba285851687028d395/qUZVGkns1bg3sZUShBbhv.png
34+
overrides:
35+
backend: llama-cpp
36+
function:
37+
automatic_tool_parsing_fallback: true
38+
grammar:
39+
disable: true
40+
known_usecases:
41+
- chat
42+
options:
43+
- use_jinja:true
44+
parameters:
45+
min_p: 0.15
46+
model: llama-cpp/models/LFM2.5-8B-A1B-GGUF/LFM2.5-8B-A1B-Q4_K_M.gguf
47+
repeat_penalty: 1.05
48+
temperature: 0.1
49+
top_k: 50
50+
top_p: 0.1
51+
template:
52+
use_tokenizer_template: true
53+
files:
54+
- filename: llama-cpp/models/LFM2.5-8B-A1B-GGUF/LFM2.5-8B-A1B-Q4_K_M.gguf
55+
sha256: 3ef9879143342a46cdd00b5f6f3919b77ee693945f9d73eea99344eadd6950b6
56+
uri: https://huggingface.co/LiquidAI/LFM2.5-8B-A1B-GGUF/resolve/main/LFM2.5-8B-A1B-Q4_K_M.gguf
257
- name: "qwopus3.5-9b-coder-mtp"
358
url: "github:mudler/LocalAI/gallery/virtual.yaml@master"
459
urls:

0 commit comments

Comments
 (0)