Skip to content

Commit a786e93

Browse files
authored
Merge pull request #1167 from devoxx/blog/devoxxgenie-plugin-analytics
docs(blog): add DevoxxGenie plugin analytics post
2 parents ab3fefb + 3e97840 commit a786e93

5 files changed

Lines changed: 98 additions & 0 deletions
Lines changed: 98 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,98 @@
1+
---
2+
slug: devoxxgenie-plugin-analytics
3+
title: "What 18,000 Events Tell Us About How You Use DevoxxGenie"
4+
authors: [stephanj]
5+
tags: [analytics, telemetry, local llm, ollama, qwen, agent mode, rag, mcp, IntelliJ IDEA, privacy]
6+
date: 2026-06-30
7+
description: "A look behind the curtain at DevoxxGenie's anonymous usage analytics: what it reveals about local-first AI coding, the rise of Qwen, and how developers actually use agent mode, RAG and MCP."
8+
keywords: [devoxxgenie, analytics, telemetry, local llm, ollama, lmstudio, qwen, agent mode, rag, mcp, intellij plugin, privacy]
9+
image: /img/devoxxgenie-providers-chart.png
10+
---
11+
12+
# What the Numbers Say About How You Use DevoxxGenie
13+
14+
DevoxxGenie just passed a milestone worth celebrating: **more than 72,000 downloads**, and in this month alone **29,330 active users** firing up the plugin inside IntelliJ IDEA. Thank you. Genuinely.
15+
16+
A smaller, opt-in slice of those users also share **anonymous usage analytics**. No prompts, no code, no file contents, no personal data, just coarse aggregated signals about which features get enabled, which providers get used, and which models get picked. The goal is simple: stop guessing about what matters and start building for how people actually work.
17+
18+
Even from that opted-in subset, **more than 18,000 telemetry events** over the past month paint a remarkably clear picture. A few of the patterns surprised me. Here is the story the data tells.
19+
20+
<!-- truncate -->
21+
22+
:::info A note on the charts
23+
The charts below are real, straight from the dashboard, but they show **proportions, not raw counts**. This post is about how the bars stack up, not exact figures. Everything DevoxxGenie collects is anonymous and aggregated by design.
24+
:::
25+
26+
## 1. Local-first isn't a slogan, it's the default
27+
28+
The single loudest signal in the entire dataset: **developers run their models locally.**
29+
30+
![Feature usage split between Local and Cloud providers](/img/devoxxgenie-local-vs-cloud.png)
31+
32+
Looking at *Feature Used by Provider Type*, **roughly 85% of all activity went to local providers** and only about 15% to cloud APIs. The provider leaderboard tells the same story from a different angle. The top three by prompts dispatched are all local:
33+
34+
1. **[Ollama](/docs/llm-providers/local-models)**, by a wide margin the most-used provider
35+
2. **[LMStudio](/docs/llm-providers/local-models)**
36+
3. **[CustomOpenAI](/docs/llm-providers/custom-providers)** (local OpenAI-compatible endpoints)
37+
38+
![LLM providers ranked by prompts dispatched](/img/devoxxgenie-providers-chart.png)
39+
40+
Cloud providers (OpenRouter, Google, DeepSeek, Anthropic, OpenAI) all appear, but they sit well below the local pack. In total, **13 different providers** showed up in the data, from the usual suspects all the way down to Groq and Kimi. The breadth of DevoxxGenie's [provider support](/docs/llm-providers/overview) isn't theoretical; people genuinely use the long tail.
41+
42+
The takeaway for the project is unambiguous: **local inference is the primary path**, and keeping that experience fast, private, and frictionless is the highest-leverage thing we can do.
43+
44+
## 2. Qwen ate the model leaderboard
45+
46+
If local is the *where*, **Qwen is the *what*.**
47+
48+
![Models ranked by prompts dispatched, dominated by Qwen variants](/img/devoxxgenie-models-chart.png)
49+
50+
Both the "models selected" (intent) and "prompts dispatched" (actual) charts are dominated by Qwen coder variants. **`qwen2.5-coder:7b` leads both lists.** It is the most-selected *and* the most-dispatched model. The rest of the top tier is more Qwen: `qwen3-coder:30b`, `qwen3.6:35b-mlx`, `qwen/qwen3.6-27b`, `qwen2.5-coder:14b`, and a parade of community quantizations.
51+
52+
Two things jump out:
53+
54+
- **Small, fast coder models win.** The 7B model out-uses everything heavier. People want a model that keeps up with their typing more than they want the biggest brain in the room.
55+
- **Apple Silicon is well represented.** MLX builds like `qwen3.6:35b-mlx` ranking near the top is a clear fingerprint of Mac developers running optimized local inference.
56+
57+
The cloud models that do appear lean toward the cheap-and-fast end too (`gemini-2.5-flash-lite`, `deepseek-reasoner`), reinforcing the same "fast feedback loop" preference.
58+
59+
## 3. People pick a model once, then hammer it
60+
61+
Comparing *intent* (model selected) against *actual* (prompts dispatched) reveals a nice behavioral ratio: **for every model selection, developers fire off roughly 3 to 4 prompts.** `qwen2.5-coder:7b`, for instance, was dispatched several times more often than it was explicitly selected.
62+
63+
In other words, model-switching is rare. Developers settle on a model and stay in flow. That argues for making the *default* model excellent and the *switching* experience cheap, but not for optimizing around constant model-hopping, which barely happens.
64+
65+
## 4. Agent mode is now the headline feature
66+
67+
![Features used per prompt: Agent mode leads](/img/devoxxgenie-features-used.png)
68+
69+
When you look at **features actually used per prompt**, **[Agent mode](/docs/features/agent-mode) comes out on top**, ahead of everything else. Right behind it: **[Project Context (Selected)](/docs/features/project-scanner)**, then **[Streaming](/docs/features/chat-interface)** and **[RAG](/docs/features/rag)**.
70+
71+
A couple of details worth highlighting:
72+
73+
- **Scoped context beats "everything."** *Project Context (Selected)* is used far more than *Project Context (Full)*. Developers are deliberate. They hand the model the files that matter rather than dumping the whole repo. Good instinct, and good for token bills.
74+
- **Streaming is the expected default.** A large share of prompts stream their response. Watching the answer arrive token by token clearly beats waiting for a wall of text.
75+
76+
On the *enabled per session* side, **[Custom Prompt](/docs/configuration/prompts)** is switched on in about half of sessions, followed by **Agent (about 24%)**, **Streaming (about 11%)**, **RAG (about 7%)**, **[MCP](/docs/features/mcp_expanded) (about 6%)**, and **[Web Search via Tavily](/docs/features/web-search) (about 3%)**. The headline features get broad adoption; the advanced ones have a smaller but committed following.
77+
78+
## 5. MCP and long memory are power-user territory
79+
80+
Two of the smaller panels are quietly the most interesting.
81+
82+
- **MCP servers per session:** most sessions run *zero* MCP servers, but a meaningful cluster runs **2 to 5**, and a handful push into the **6 to 10** range. MCP is a power-user feature, and the power users are clearly stacking servers.
83+
- **[Chat memory](/docs/features/chat-memory):** the vast majority of sessions sit in the **highest memory bucket (21+ messages)**. These aren't one-shot questions. They are long, sustained conversations where context accumulates. That validates the work on chat memory management and persistent memory.
84+
85+
Meanwhile, the **[Semantic Search](/docs/features/rag)** tool barely registers yet. It is the newest arrival, so low adoption is expected, but it is a flag to make the feature more discoverable.
86+
87+
## What this changes
88+
89+
None of this is collected to surveil anyone. It is collected so the roadmap reflects reality instead of my assumptions. A few decisions fall straight out of the data:
90+
91+
- **Double down on local.** Ollama, LMStudio, and CustomOpenAI are the main stage. Their setup, speed, and model management deserve the most polish.
92+
- **Treat Qwen coder models as first-class citizens**, including MLX builds for Apple Silicon.
93+
- **Keep investing in agent mode.** It is no longer a side feature, it is *the* feature.
94+
- **Make newer capabilities (Semantic Search, MCP) more discoverable**, because the people who find them clearly get value from them.
95+
96+
With **72,000+ downloads** and nearly **30,000 active users a month**, every one of these choices now lands in a lot of editors. If you'd rather not contribute any telemetry, that's completely fine. It is opt-in and easy to turn off in settings. But if you leave it on, know that it is directly shaping where DevoxxGenie goes next. Thank you for that. 🙏
97+
98+
*Curious about exactly what is and isn't collected? It is all defined by a closed-enum schema. Only known, non-identifying values are ever recorded, and anything outside the allowlist is dropped.*
97.5 KB
Loading
36.7 KB
Loading
65.9 KB
Loading
52.5 KB
Loading

0 commit comments

Comments
 (0)