fix(bandit): skip reward updates for unknown arm names by GrigoryEvko · Pull Request #12 · FusionBrainLab/gigaevo-core

GrigoryEvko · 2026-05-15T06:46:31Z

on_mutation_outcome forwards program.get_metadata('mutation_model') directly to self._bandit.update_reward, which indexes self.arms[arm_name]. The normal flow keeps producer and router in step, but the metadata can also arrive from a Program loaded out of a Redis snapshot written by an older router, from a custom mutation operator, or from a hand-built test fixture. In those cases update_reward raises KeyError and aborts the agent callback.

Minimal reproducer against current main:

from unittest.mock import MagicMock
from gigaevo.llm.bandit import BanditModelRouter, MutationOutcome
from gigaevo.programs.program import Program

def _mock(name):
    m = MagicMock(); m.model_name = name
    m.with_structured_output = MagicMock(return_value=MagicMock())
    return m

router = BanditModelRouter([_mock('llama'), _mock('qwen')], [0.5, 0.5], fitness_key='score')
child = Program(code='x=1')
child.set_metadata('mutation_model', 'gpt-4-not-in-router')
child.metrics['score'] = 0.8
parent = Program(code='x=0'); parent.metrics['score'] = 0.5
router.on_mutation_outcome(child, [parent], outcome=MutationOutcome.ACCEPTED)
# KeyError: 'gpt-4-not-in-router'

The change adds a single membership check against self._bandit.arms at the top of on_mutation_outcome and skips with one debug note when the name is unknown. Three new tests cover the unknown-arm path on the ACCEPTED branch, on the REJECTED_ACCEPTOR branch, and combined with missing child fitness.

short id will generate based on full id when required

…d delta fitness

…regression feature weights

… and improve docstring clarity

The DedupDecision type is returned by dedup.process_incoming() but memory.py accesses its fields via attribute access without referencing the type by name. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor: memory system modular refactor + adversarial hardening

…cstrings Phase 1 cleanup (completed): - Move A_mem/, GAM_root/ → _vendor/ (vendored MIT libs) - Move contrib licenses → _vendor/ - Move 3 example scripts → examples/ - Fix 15 broken vendored library imports (A_mem/GAM_root bare imports) - Update 8 consumer import paths to _vendor/ - Add _vendor/__init__.py docstring (vendored libs notice) - Add examples/__init__.py docstring (not production code) - Update shared_memory/__init__.py docstring - Update pyproject.toml (ruff/mypy exclude paths for vendors) Tests: 770 passed Lint: clean Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): Phase 1 — directory reorg (vendor/examples/docstrings)

- Delete 5 duplicated usage-merge functions from memory_write_example.py (_to_float, _median_or_none, _extract_usage_task_deltas, _build_usage_payload_from_task_deltas, _merge_usage_payloads) → import from card_update_dedup.py (canonical home) - Delete duplicate dedupe_keep_order from card_update_dedup.py → import from shared_memory/utils.py - Remove deprecated _apply_update_actions() from memory.py (dead wrapper) - Make memory_to_card private (_memory_to_card) — only used internally - Simplify single-iteration loop in _extract_json_object Net: ~120 lines deleted, zero behavior change. Tests: 770 passed | Lint: clean Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): deduplicate code and delete dead paths

Replace hand-written RetrievalWeights.from_mapping() and CardUpdateDedupConfig.from_mapping() dict parsers (~87 lines) with Pydantic v2 @model_validator(mode="before") — same behavior, idiomatic. Also: add docstrings to all functions in card_update_dedup.py, fix stale test references to deleted _apply_update_actions wrapper, add test for flat config format. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): Pydantic-idiomatic config parsing

- memory_write_example.py → write_pipeline.py (it's production, not an example) - memory_write_config.py → write_pipeline_config.py - selected_ideas_6.py → origin_analysis.py (remove versioned filename) - Delete test_memory_write_example_extended.py (duplicate of test_write_pipeline.py) - Update all 8 import sites + 1 dynamic importlib.import_module call Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): rename write pipeline and analysis files

Extract from card_conversion.py (554 → 420 lines): - base.py: GigaEvoMemoryBase abstract class (20 lines) - card_search.py: format_search_results, search_cards_by_keyword, synthesize_search_results (115 lines) Update 4 import sites directly (no re-exports). card_conversion.py retains: normalization, conversion, GAM config, constants, protocols. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): split card_conversion into focused modules

Define MemoryError, MemoryRetrieverError, MemorySearchError, and MemoryStorageError in gigaevo/exceptions.py following the existing GigaEvoError hierarchy. Wire them into the memory subsystem: - gam_search.build() wraps all failures in MemoryRetrieverError - memory.py narrows two gam.build() catches from bare Exception - card_store._load() narrows to (json.JSONDecodeError, OSError) - card_dedup import block narrows to (ImportError, OSError) Resilience-critical catches (search fallback, merge loop, __exit__) remain broad by design. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): custom exception hierarchy and narrowed catches

@AbstractMethod

…t base to ABC - concept_api.py: all 5 RuntimeError raises → MemoryStorageError (matches gigaevo/database pattern of wrapping I/O errors) - base.py: GigaEvoMemoryBase now uses ABC + @AbstractMethod (matches MutationOperator, Stage, LangGraphAgent pattern) - card_dedup.py: narrow two broad catches: - JSONL read fallback: except Exception → (json.JSONDecodeError, OSError) - GAM store build: except Exception → (MemoryRetrieverError, OSError) - Update 6 test assertions from RuntimeError to MemoryStorageError Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ormity refactor(memory): exception conformity + ABC base class

When write_pipeline.py passes MemoryCard/ProgramCard Pydantic models to memory_platform.save_card(), the dict() call on a Pydantic model doesn't properly flatten nested Pydantic objects like ConnectedIdea. This caused TypeError in _persist_index() when json.dumps() tried to serialize. Root cause: write_pipeline returns list[AnyCard] (Pydantic models) and both backends (memory_platform and memory/shared_memory) consume these cards via save_card(). memory_platform's normalize_memory_card() must explicitly call .model_dump() on Pydantic inputs to flatten nested objects. Fix verified: all 788 memory + integration tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Tests the exact bug path: Pydantic MemoryCard/ProgramCard with nested ConnectedIdea and MemoryCardExplanation objects must be properly flattened to plain dicts before JSON serialization. 6 tests covering: ProgramCard with ConnectedIdea, MemoryCard with MemoryCardExplanation, plain dict passthrough, JSON round-trips, None. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Add gigaevo-memory Git dependency to pyproject.toml - Remove sys.path manipulation from memory_platform/memory.py and remote_gam_retriever.py (no longer needed with proper install) - Simplify test file to use direct imports instead of module mocking Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Expands from 6 to 11 tests covering the complete save_card → _persist_index flow with Pydantic inputs. Tests verify: - normalize_memory_card: ConnectedIdea/MemoryCardExplanation → dict - save_card: Pydantic ProgramCard/MemoryCard → JSON-serializable index - _card_to_backend_content: API payload is clean dict - persist/reload roundtrip: index file survives write→read cycle Uses _make_platform_memory() factory with mocked API client to test memory_platform in isolation without network dependencies. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Add docstrings to 15 public methods across 5 files (memory.py, concept_api.py, card_dedup.py, openai_inference.py, write_pipeline.py) - Add return type annotations to 4 functions in amem_gam_retriever.py - Fix 2 mypy errors: annotate retrievers dict, rename variable in api_sync.py - Extract magic numbers: _MAX_SUMMARY_CHARS, _MAX_DESCRIPTION_CHARS, _ENTITY_NAME_MAX_LENGTH, _MAX_CONNECTED_DESCRIPTIONS Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor(memory): type annotations, docstrings, constants, platform bug fix

on_mutation_outcome forwarded program.get_metadata('mutation_model') directly to self._bandit.update_reward, which raised KeyError when the metadata did not match a current arm (Redis snapshot replay, custom operator, hand-built test fixture). Membership check at the top of the function turns the crash into a debug-level skip.

…rite

PetrAnokhin and others added 30 commits April 1, 2026 16:00

gitignore

c00aeb3

feat: add changes extraction to mutation agent

52ef218

feat: add idea tracker

5ec4f06

feat: add logging for idea tracker

842746e

fix: circular import in logger

fdca9a7

fix: remove short id separate storage and generation

dca364c

short id will generate based on full id when required

feat: add best idea extraction based on top_k selection by fitness an…

2d2c1e6

…d delta fitness

feat: experimental ml pipeline for impact estimation based on linear …

1872d38

…regression feature weights

refactor: remove debug code

c94e78a

fix: changed cooccurrence threshold agressive scaling to fixed minimum

97ad639

feat: add idea description rewriting logic

c6c3421

chore: removed unused prompts

5170fe9

gitignore

3293d4d

fix: correct serialization of dict and lists in pd columns

7c7e3e8

feat: csv loading to IdeaTracker

db5d7c4

memory in config

ce2ca3d

fixed my cat stepping on keyboard probably

43def44

Update idea_tracker

9fd9cf1

feat: add extended record card dataclass

6ea7bd9

feat: add update logic for extended record card

bec4351

feat: support for extended record card

49a2a00

refactor: record card extended minor refactor

4830094

feat: task description loading

1662eed

fix: remove debug print

7c8d45a

feat: update main logic to work with extended record card

78584c7

chore: update docstrings

97f96d1

fix: wrong key name fix

3d1d740

fix: IncomingIdeas update logic fix

2402efb

refactor: replace ML impact pipeline with origin analysis computation…

ba95f51

… and improve docstring clarity

fix: add break condition for processing when no new ideas are present

4974f97

KhrulkovV and others added 26 commits April 5, 2026 23:37

chore: remove unused DedupDecision import from memory.py

4068841

The DedupDecision type is returned by dedup.process_incoming() but memory.py accesses its fields via attribute access without referencing the type by name. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #166 from KhrulkovV/worktree-memory-optimize

8e74431

refactor: memory system modular refactor + adversarial hardening

Merge pull request #167 from KhrulkovV/worktree-memory-optimize

ca62aae

refactor(memory): Phase 1 — directory reorg (vendor/examples/docstrings)

Merge pull request #168 from KhrulkovV/refactor/memory-deduplicate

381e63b

refactor(memory): deduplicate code and delete dead paths

Merge pull request #170 from KhrulkovV/refactor/memory-pydantic-config

a6968cf

refactor(memory): Pydantic-idiomatic config parsing

Merge pull request #171 from KhrulkovV/refactor/memory-rename-relocate

1ae530a

refactor(memory): rename write pipeline and analysis files

Merge pull request #172 from KhrulkovV/refactor/memory-split-narrow

37a486e

refactor(memory): split card_conversion into focused modules

Merge pull request #173 from KhrulkovV/refactor/memory-exceptions

e8c60d6

refactor(memory): custom exception hierarchy and narrowed catches

Merge pull request #174 from KhrulkovV/refactor/memory-exception-conf…

407b757

…ormity refactor(memory): exception conformity + ABC base class

fix: lint errors in memory_platform test (unused import, sort)

b99e221

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #175 from KhrulkovV/refactor/memory-mypy-docs

4f249d3

refactor(memory): type annotations, docstrings, constants, platform bug fix

Update pyproject.toml

98bd0ab

Update run.py

054df39

This was referenced May 17, 2026

feat(dataplane): typed Redis coordination plane with atomic Lua substrate, FSM migration, and Hydra hardening #20

Open

refactor(config): replace hydra/omegaconf with typed pydantic+tyro #21

Open

KhrulkovV force-pushed the main branch from 054df39 to 0f2b866 Compare May 26, 2026 09:37

chore: empty commit to refresh PR mergeability after main history rew…

794613b

…rite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(bandit): skip reward updates for unknown arm names#12

fix(bandit): skip reward updates for unknown arm names#12
GrigoryEvko wants to merge 800 commits into
FusionBrainLab:mainfrom
GrigoryEvko:fix/bandit-canonicalize-mutation-model

GrigoryEvko commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

GrigoryEvko commented May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants