Bogdusik
diff --git a/‎.github/workflows/lint.yml‎
Lines changed: 26 additions & 0 deletions b/‎.github/workflows/lint.yml‎
Lines changed: 26 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 104 additions & 23 deletions b/‎README.md‎
Lines changed: 104 additions & 23 deletions
diff --git a/‎assistant/asr.py‎
Lines changed: 40 additions & 58 deletions b/‎assistant/asr.py‎
Lines changed: 40 additions & 58 deletions
diff --git a/‎assistant/core/__init__.py‎ b/‎assistant/core/__init__.py‎
diff --git a/‎assistant/core/exceptions.py‎
Lines changed: 21 additions & 0 deletions b/‎assistant/core/exceptions.py‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎assistant/core/state.py‎
Lines changed: 12 additions & 0 deletions b/‎assistant/core/state.py‎
Lines changed: 12 additions & 0 deletions
@@ -0,0 +1,26 @@
+name: Lint & Type Check
+
+on:
+  push:
+    branches: ["**"]
+  pull_request:
+    branches: [main]
+
+jobs:
+  lint:
+    runs-on: windows-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - uses: actions/setup-python@v5
+        with:
+          python-version: "3.11"
+
+      - name: Install lint tools
+        run: pip install ruff mypy
+
+      - name: Ruff (lint + format check)
+        run: ruff check . --select E,F,W,I --ignore E501
+
+      - name: Mypy (type check)
+        run: mypy assistant --ignore-missing-imports --no-strict-optional
@@ -1,36 +1,117 @@
 # Personal PC Assistant
 
-A voice assistant for Windows that actually listens and executes commands. Built it because I got tired of clicking through menus and wanted my computer to feel like a conversation partner.
-
-## How it looks
+A local Windows voice assistant. Hold a hotkey → speak → the assistant executes your command. Everything runs offline: speech recognition via Faster Whisper, intent detection via rule engine + Ollama fallback.
 
 ![Control Panel Screenshot](assets/gui-screenshot.png)
 
-## Why this is cool
+## Features
+
+- Push-to-talk voice control (default: Right Shift)
+- 15+ built-in skills: open/close apps, volume, brightness, screenshots, Wi-Fi toggle, shutdown, web search, clipboard
+- Cyberpunk-themed PyQt6 GUI with animated waveform
+- Teach custom voice commands without code — saved to `config.json`
+- Fully local: Faster Whisper ASR + Ollama LLM, no cloud APIs
+
+## Architecture
+
+```
+main_gui.py / main_fast.py
+        │
+        ├─ assistant/
+        │   ├─ core/
+        │   │   ├─ exceptions.py   (AssistantError hierarchy)
+        │   │   └─ state.py        (AssistantState dataclass)
+        │   ├─ nlu/
+        │   │   ├─ normalizer.py   (text cleaning, lemmatization)
+        │   │   ├─ ollama_client.py (OllamaClient — injectable base_url)
+        │   │   └─ engine.py       (rules + Ollama fallback → intent)
+        │   ├─ skills/
+        │   │   ├─ app_control.py  (open/close/minimize apps)
+        │   │   ├─ system_control.py (volume, brightness, shutdown …)
+        │   │   ├─ browser.py      (search, open website)
+        │   │   ├─ clipboard.py    (copy/paste)
+        │   │   └─ registry.py     (SKILLS dict + Skill Protocol)
+        │   ├─ asr.py              (Faster Whisper transcription)
+        │   ├─ recorder.py         (push-to-talk audio capture)
+        │   └─ runner.py           (skill dispatch + confirmation flow)
+        └─ config.json             (hotkey, app aliases, custom commands)
+```
+
+## Entry points
+
+| File | Use when |
+|------|----------|
+| `main_gui.py` | Daily use — cyberpunk control panel, visual feedback |
+| `main_fast.py` | Debugging / scripting — console-only, lighter startup |
+
+## Setup
+
+### Requirements
+
+- Windows 10/11
+- Python 3.10+
+- [Ollama](https://ollama.ai) installed and on PATH
+- Microphone
+
+### Quick start
+
+```bat
+:: 1. Clone
+git clone https://github.com/Bogdusik/Personal-PC-Assistant.git
+cd Personal-PC-Assistant
+
+:: 2. Run setup (checks admin, copies config, installs deps)
+setup.bat
+
+:: 3. Pull an Ollama model (one-time)
+ollama pull gemma3:12b
+
+:: 4. Launch (as Administrator for hotkey support)
+python main_gui.py
+```
+
+### Manual setup
+
+```bat
+python -m venv venv
+venv\Scripts\activate
+pip install -r requirements.txt
+copy config.example.json config.json
+```
+
+Edit `config.json`:
+- `hotkey` — key to hold while speaking (default: `"right shift"`)
+- `mic_device` — microphone device index or `null` for default
+- `ollama_model` — model name (default: `"gemma3:12b"`)
+- `app_aliases` — map your app names to full `.exe` paths
+
+> **Run as Administrator** — required for keyboard hotkey hooks.
 
-• **Animation** - Custom sci-fi/cyberpunk GUI with animated waveform that pulses from your voice in real-time, smooth fade-in on launch, glassmorphism effects  
-• **Fallback** - Works even if Ollama crashes, graceful fallback guides you through setup  
-• **Local AI** - Everything runs locally (Faster Whisper + Ollama), nothing goes to the cloud  
-• **Ripple on buttons** - Interactive interface with ripple effects on hover, the whole UI "wakes up" when you launch it
+## Environment variables
 
-## How to run
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `OLLAMA_BASE_URL` | `http://localhost:11434` | Override Ollama server URL |
+| `OLLAMA_MODEL` | `gemma3:12b` | Override model (also settable in config.json) |
 
-1. `git clone https://github.com/Bogdusik/Personal-PC-Assistant.git`
-2. `cd Personal-PC-Assistant`
-3. `python -m venv venv` (optional, but recommended)
-4. `venv\Scripts\activate` (Windows) or `source venv/bin/activate` (Linux/Mac)
-5. `pip install -r requirements.txt`
-6. `ollama pull gemma3:12b` (install Ollama first from ollama.ai)
-7. `python main_gui.py` (run as Administrator for hotkey support)
+## Teaching custom commands
 
-**Important:** Works only on Windows 10/11. Requires microphone access. Run as Administrator for hotkey functionality (default: Right Shift - hold to speak, release to process).
+Say *"новая команда"* (new command) or press **Ctrl+4** while the assistant is running to add a custom voice trigger:
+- Choose a match type: `equals`, `startswith`, `contains`, or `regex`
+- Pick an intent and its arguments
+- Commands are saved to `config.json` immediately
 
-## What I learned from this
+## Running tests
 
-• Mastered speech recognition in practice - Faster Whisper is incredible  
-• Got comfortable with PyQt6 and Windows API (PyCaw, win32gui, keyboard hooks)  
-• Finally built something I always wanted - talking to my computer feels natural now
+```bat
+pip install pytest
+pytest tests/ -v
+```
 
-## Want to use it?
+## Development
 
-Fork it, improve it. I won't be offended.
+```bat
+pip install ruff mypy
+ruff check .
+mypy assistant --ignore-missing-imports
+```
@@ -1,107 +1,89 @@
-import os
-import logging
+from __future__ import annotations
 import gc
+import logging
+import os
+
+from assistant.core.exceptions import AudioError
 
 os.environ["CUDA_VISIBLE_DEVICES"] = ""
 os.environ["CT2_VERBOSE"] = "0"
 
+logger = logging.getLogger(__name__)
+
 _cached_model = None
-_model_name = None
+_model_name: str | None = None
+
 
 def init_asr():
     global _cached_model, _model_name
-    
+
     if _cached_model is not None:
-        logging.info(f"Используем кэшированную модель: {_model_name}")
+        logger.info("Используем кэшированную модель: %s", _model_name)
         return _cached_model
-    
+
     print("Гружу ASR-модель (faster-whisper, CPU)...", flush=True)
-    logging.info("Начинаю загрузку ASR модели")
-    
-    models_to_try = [
+
+    try:
+        from faster_whisper import WhisperModel
+    except ImportError as exc:
+        raise AudioError(f"faster-whisper не установлен: {exc}") from exc
+
+    for model_name, device, compute_type in [
         ("tiny", "cpu", "int8"),
         ("tiny", "cpu", "float32"),
         ("base", "cpu", "int8"),
         ("base", "cpu", "float32"),
-    ]
-    
-    for model_name, device, compute_type in models_to_try:
+    ]:
         try:
             print(f"Пробую модель: {model_name} ({compute_type})...", flush=True)
-            logging.info(f"Попытка загрузки модели: {model_name} ({compute_type})")
-            
-            from faster_whisper import WhisperModel
             model = WhisperModel(model_name, device=device, compute_type=compute_type)
-            
             _cached_model = model
             _model_name = f"{model_name}_{compute_type}"
-            
-            print(f"✅ Модель {model_name} загружена успешно!", flush=True)
-            logging.info(f"ASR модель {model_name} загружена и кэширована")
+            print(f"✅ Модель {model_name} загружена!", flush=True)
+            logger.info("ASR модель %s загружена", _model_name)
             return model
-            
-        except ImportError as e:
-            logging.error(f"Ошибка импорта faster-whisper: {e}")
-            print(f"❌ Ошибка импорта: {e}", flush=True)
-            break
-        except Exception as e:
-            error_msg = str(e)[:100]
-            print(f"❌ Ошибка с {model_name}: {error_msg}...", flush=True)
-            logging.warning(f"Ошибка загрузки {model_name}: {e}")
+        except Exception as exc:
+            logger.warning("Ошибка загрузки %s/%s: %s", model_name, compute_type, exc)
             continue
-    
+
     try:
-        print("Пробую базовую загрузку...", flush=True)
-        logging.info("Попытка базовой загрузки модели")
-        from faster_whisper import WhisperModel
         model = WhisperModel("tiny")
-        
         _cached_model = model
         _model_name = "tiny_default"
-        
         print("✅ Базовая модель загружена!", flush=True)
-        logging.info("Базовая ASR модель загружена успешно")
+        logger.info("Базовая ASR модель загружена")
         return model
-        
-    except Exception as e:
-        error_msg = f"Критическая ошибка загрузки ASR: {e}"
-        print(f"❌ {error_msg}", flush=True)
-        logging.error(error_msg)
-        raise Exception(f"Не удалось загрузить ни одну модель Whisper: {e}")
+    except Exception as exc:
+        raise AudioError(f"Не удалось загрузить ни одну модель Whisper: {exc}") from exc
 
-def cleanup_asr():
+
+def cleanup_asr() -> None:
     global _cached_model, _model_name
     if _cached_model is not None:
-        logging.info("Очищаю память ASR модели")
+        logger.info("Очищаю память ASR модели")
         del _cached_model
         _cached_model = None
         _model_name = None
         gc.collect()
 
+
 def transcribe(model, path: str) -> str:
     try:
-        logging.info(f"Начинаю транскрипцию файла: {path}")
         segments, _ = model.transcribe(
             path,
             language="ru",
             vad_filter=True,
             vad_parameters=dict(min_silence_duration_ms=300),
-            # Максимально быстрый режим: жадный декодер без бима
             beam_size=1,
             best_of=1,
-            condition_on_previous_text=False
+            condition_on_previous_text=False,
         )
-        
         result = "".join(seg.text for seg in segments).strip()
-        logging.info(f"Транскрипция завершена: '{result}'")
+        logger.info("Транскрипция завершена: '%s'", result)
         return result
-        
-    except FileNotFoundError:
-        logging.error(f"Аудио файл не найден: {path}")
-        print(f"❌ Файл не найден: {path}")
-        return ""
-    except Exception as e:
-        error_msg = f"ASR ошибка транскрипции: {e}"
-        logging.error(error_msg)
-        print(f"❌ {error_msg}")
-        return ""
+    except FileNotFoundError as exc:
+        logger.error("Аудио файл не найден: %s", path)
+        raise AudioError(f"Файл не найден: {path}") from exc
+    except Exception as exc:
+        logger.error("ASR ошибка транскрипции: %s", exc)
+        raise AudioError(f"Ошибка транскрипции: {exc}") from exc
@@ -0,0 +1,21 @@
+from __future__ import annotations
+
+
+class AssistantError(Exception):
+    pass
+
+
+class OllamaError(AssistantError):
+    pass
+
+
+class ConfigError(AssistantError):
+    pass
+
+
+class SkillError(AssistantError):
+    pass
+
+
+class AudioError(AssistantError):
+    pass
@@ -0,0 +1,12 @@
+from __future__ import annotations
+import subprocess
+from dataclasses import dataclass, field
+
+
+@dataclass
+class AssistantState:
+    cfg: dict = field(default_factory=dict)
+    hotkey: str = "right shift"
+    mic_device: int | str | None = None
+    last_text: str = ""
+    ollama_process: subprocess.Popen | None = None