|
| 1 | +--- |
| 2 | +slug: devoxxgenie-plugin-analytics |
| 3 | +title: "What 18,000 Events Tell Us About How You Use DevoxxGenie" |
| 4 | +authors: [stephanj] |
| 5 | +tags: [analytics, telemetry, local llm, ollama, qwen, agent mode, rag, mcp, IntelliJ IDEA, privacy] |
| 6 | +date: 2026-06-30 |
| 7 | +description: "A look behind the curtain at DevoxxGenie's anonymous usage analytics: what it reveals about local-first AI coding, the rise of Qwen, and how developers actually use agent mode, RAG and MCP." |
| 8 | +keywords: [devoxxgenie, analytics, telemetry, local llm, ollama, lmstudio, qwen, agent mode, rag, mcp, intellij plugin, privacy] |
| 9 | +image: /img/devoxxgenie-providers-chart.png |
| 10 | +--- |
| 11 | + |
| 12 | +# What the Numbers Say About How You Use DevoxxGenie |
| 13 | + |
| 14 | +DevoxxGenie just passed a milestone worth celebrating: **more than 72,000 downloads**, and in this month alone **29,330 active users** firing up the plugin inside IntelliJ IDEA. Thank you. Genuinely. |
| 15 | + |
| 16 | +A smaller, opt-in slice of those users also share **anonymous usage analytics**. No prompts, no code, no file contents, no personal data, just coarse aggregated signals about which features get enabled, which providers get used, and which models get picked. The goal is simple: stop guessing about what matters and start building for how people actually work. |
| 17 | + |
| 18 | +Even from that opted-in subset, **more than 18,000 telemetry events** over the past month paint a remarkably clear picture. A few of the patterns surprised me. Here is the story the data tells. |
| 19 | + |
| 20 | +<!-- truncate --> |
| 21 | + |
| 22 | +:::info A note on the charts |
| 23 | +The charts below are real, straight from the dashboard, but they show **proportions, not raw counts**. This post is about how the bars stack up, not exact figures. Everything DevoxxGenie collects is anonymous and aggregated by design. |
| 24 | +::: |
| 25 | + |
| 26 | +## 1. Local-first isn't a slogan, it's the default |
| 27 | + |
| 28 | +The single loudest signal in the entire dataset: **developers run their models locally.** |
| 29 | + |
| 30 | + |
| 31 | + |
| 32 | +Looking at *Feature Used by Provider Type*, **roughly 85% of all activity went to local providers** and only about 15% to cloud APIs. The provider leaderboard tells the same story from a different angle. The top three by prompts dispatched are all local: |
| 33 | + |
| 34 | +1. **[Ollama](/docs/llm-providers/local-models)**, by a wide margin the most-used provider |
| 35 | +2. **[LMStudio](/docs/llm-providers/local-models)** |
| 36 | +3. **[CustomOpenAI](/docs/llm-providers/custom-providers)** (local OpenAI-compatible endpoints) |
| 37 | + |
| 38 | + |
| 39 | + |
| 40 | +Cloud providers (OpenRouter, Google, DeepSeek, Anthropic, OpenAI) all appear, but they sit well below the local pack. In total, **13 different providers** showed up in the data, from the usual suspects all the way down to Groq and Kimi. The breadth of DevoxxGenie's [provider support](/docs/llm-providers/overview) isn't theoretical; people genuinely use the long tail. |
| 41 | + |
| 42 | +The takeaway for the project is unambiguous: **local inference is the primary path**, and keeping that experience fast, private, and frictionless is the highest-leverage thing we can do. |
| 43 | + |
| 44 | +## 2. Qwen ate the model leaderboard |
| 45 | + |
| 46 | +If local is the *where*, **Qwen is the *what*.** |
| 47 | + |
| 48 | + |
| 49 | + |
| 50 | +Both the "models selected" (intent) and "prompts dispatched" (actual) charts are dominated by Qwen coder variants. **`qwen2.5-coder:7b` leads both lists.** It is the most-selected *and* the most-dispatched model. The rest of the top tier is more Qwen: `qwen3-coder:30b`, `qwen3.6:35b-mlx`, `qwen/qwen3.6-27b`, `qwen2.5-coder:14b`, and a parade of community quantizations. |
| 51 | + |
| 52 | +Two things jump out: |
| 53 | + |
| 54 | +- **Small, fast coder models win.** The 7B model out-uses everything heavier. People want a model that keeps up with their typing more than they want the biggest brain in the room. |
| 55 | +- **Apple Silicon is well represented.** MLX builds like `qwen3.6:35b-mlx` ranking near the top is a clear fingerprint of Mac developers running optimized local inference. |
| 56 | + |
| 57 | +The cloud models that do appear lean toward the cheap-and-fast end too (`gemini-2.5-flash-lite`, `deepseek-reasoner`), reinforcing the same "fast feedback loop" preference. |
| 58 | + |
| 59 | +## 3. People pick a model once, then hammer it |
| 60 | + |
| 61 | +Comparing *intent* (model selected) against *actual* (prompts dispatched) reveals a nice behavioral ratio: **for every model selection, developers fire off roughly 3 to 4 prompts.** `qwen2.5-coder:7b`, for instance, was dispatched several times more often than it was explicitly selected. |
| 62 | + |
| 63 | +In other words, model-switching is rare. Developers settle on a model and stay in flow. That argues for making the *default* model excellent and the *switching* experience cheap, but not for optimizing around constant model-hopping, which barely happens. |
| 64 | + |
| 65 | +## 4. Agent mode is now the headline feature |
| 66 | + |
| 67 | + |
| 68 | + |
| 69 | +When you look at **features actually used per prompt**, **[Agent mode](/docs/features/agent-mode) comes out on top**, ahead of everything else. Right behind it: **[Project Context (Selected)](/docs/features/project-scanner)**, then **[Streaming](/docs/features/chat-interface)** and **[RAG](/docs/features/rag)**. |
| 70 | + |
| 71 | +A couple of details worth highlighting: |
| 72 | + |
| 73 | +- **Scoped context beats "everything."** *Project Context (Selected)* is used far more than *Project Context (Full)*. Developers are deliberate. They hand the model the files that matter rather than dumping the whole repo. Good instinct, and good for token bills. |
| 74 | +- **Streaming is the expected default.** A large share of prompts stream their response. Watching the answer arrive token by token clearly beats waiting for a wall of text. |
| 75 | + |
| 76 | +On the *enabled per session* side, **[Custom Prompt](/docs/configuration/prompts)** is switched on in about half of sessions, followed by **Agent (about 24%)**, **Streaming (about 11%)**, **RAG (about 7%)**, **[MCP](/docs/features/mcp_expanded) (about 6%)**, and **[Web Search via Tavily](/docs/features/web-search) (about 3%)**. The headline features get broad adoption; the advanced ones have a smaller but committed following. |
| 77 | + |
| 78 | +## 5. MCP and long memory are power-user territory |
| 79 | + |
| 80 | +Two of the smaller panels are quietly the most interesting. |
| 81 | + |
| 82 | +- **MCP servers per session:** most sessions run *zero* MCP servers, but a meaningful cluster runs **2 to 5**, and a handful push into the **6 to 10** range. MCP is a power-user feature, and the power users are clearly stacking servers. |
| 83 | +- **[Chat memory](/docs/features/chat-memory):** the vast majority of sessions sit in the **highest memory bucket (21+ messages)**. These aren't one-shot questions. They are long, sustained conversations where context accumulates. That validates the work on chat memory management and persistent memory. |
| 84 | + |
| 85 | +Meanwhile, the **[Semantic Search](/docs/features/rag)** tool barely registers yet. It is the newest arrival, so low adoption is expected, but it is a flag to make the feature more discoverable. |
| 86 | + |
| 87 | +## What this changes |
| 88 | + |
| 89 | +None of this is collected to surveil anyone. It is collected so the roadmap reflects reality instead of my assumptions. A few decisions fall straight out of the data: |
| 90 | + |
| 91 | +- **Double down on local.** Ollama, LMStudio, and CustomOpenAI are the main stage. Their setup, speed, and model management deserve the most polish. |
| 92 | +- **Treat Qwen coder models as first-class citizens**, including MLX builds for Apple Silicon. |
| 93 | +- **Keep investing in agent mode.** It is no longer a side feature, it is *the* feature. |
| 94 | +- **Make newer capabilities (Semantic Search, MCP) more discoverable**, because the people who find them clearly get value from them. |
| 95 | + |
| 96 | +With **72,000+ downloads** and nearly **30,000 active users a month**, every one of these choices now lands in a lot of editors. If you'd rather not contribute any telemetry, that's completely fine. It is opt-in and easy to turn off in settings. But if you leave it on, know that it is directly shaping where DevoxxGenie goes next. Thank you for that. 🙏 |
| 97 | + |
| 98 | +*Curious about exactly what is and isn't collected? It is all defined by a closed-enum schema. Only known, non-identifying values are ever recorded, and anything outside the allowlist is dropped.* |
0 commit comments