Optimize get_optimized_code_for_module

codeflash-ai[bot] · web-flow · commit ce1f8da41a1b · 2026-02-25T10:33:13.000Z
Runtime improvement (primary): the optimized version reduces end-to-end runtime from 48.6 ms to 43.5 ms — an 11% overall speedup. Many hot-call scenarios (repeated lookups, large mappings and bulk iterations) see much larger per-case gains (up to ~80% in repeated calls and ~50% on some large-map lookups in the annotated tests).

What changed (concrete optimizations)
- CodeStringsMarkdown.file_to_path:
  - Replaced a two-step .get(...) / indexing pattern with a single try/except KeyError around self._cache["file_to_path"]. This avoids multiple dict lookups and branches when the cache exists.
  - Builds and caches the mapping only on the KeyError path (so successful fast-path returns are a single dict access).

- get_optimized_code_for_module:
  - Compute str(relative_path) once (str_relative) and reuse it instead of calling str(...) repeatedly.
  - Avoid constructing full lists of keys and Path objects when searching for similar filenames:
    - Iterate file_to_code_context keys directly (no temporary available_files list unless needed).
    - Use os.path.basename(f) instead of Path(f).name to avoid allocating Path objects; os.path.basename is a thin C-level operation and much cheaper for simple basename extraction.
  - Defer construction of available_files (list(file_to_code_context.keys())) until actually needed for logging, avoiding unnecessary allocations in the common case.

Why this speeds things up (technical reasons)
- Less Python-level work and fewer allocations: the original code performed more dict lookups, created temporary lists, and built many Path objects inside a list comprehension — each Path(...) allocates a Python object and calls methods, which is expensive in hot loops. The optimized code reduces object construction and reduces interpreter-level branching.
- Fewer lookups: switching to try/except for the cached value reduces the number of dictionary key operations on the hot path (successful cache hit path becomes a single access).
- Cheaper basename extraction: os.path.basename is implemented in C and avoids constructing heavy Path objects for each candidate, which lowers per-iteration overhead when scanning many keys.
- Deferred work: only produce heavy values (available_files list) when we actually need them for a warning/debug path, so the common successful-case remains minimal.

How this affects existing workloads (based on tests and likely hot paths)
- Big wins when the function is called many times or the mapping is large:
  - Repeated calls to the same path (hot path) benefit heavily because file_to_path cache access and the simple get(...) are cheap.
  - Large mappings where we occasionally scan keys for similarity gain because we avoid Path allocations and unnecessary list construction.
- Minimal/zero impact for simple single-shot calls where no scanning occurs beyond the direct dict get.
- A few tests show micro-regressions (~0–2% slower in isolated cases). These are tiny and reasonable trade-offs for the improved aggregate runtime and much larger wins on hot workloads — e.g., a single extra function call or slightly different branching can explain sub-percent differences.

Behavioral/key-dependency notes
- Semantics preserved: fallback logic, similarity detection and logging behavior remain functionally the same. The only behavioral change is internal ordering of checks and how we detect basenames; that produces equivalent results for path strings.
- New import of os is local and trivial; no new external dependencies.

Which test cases benefit most (from annotated_tests)
- Repeated-calls and large-map iteration tests show the largest improvements (repeated_calls_use_cached_file_to_path, large_mapping_retrieve_multiple_entries, and the large-map loop).
- Tests that exercise the “scan for similar filename” logic also improve because os.path.basename avoids Path allocations across many keys (large_scale_many_entries_similar_filenames_detected_among_many).
- A few single-call tests show negligible change or very small regressions, which is an acceptable trade-off given the substantial wins on hot paths.

Summary
- Primary win: 11% overall runtime reduction (with much larger wins on hot paths).
- How: reduce dict lookups, avoid temporary lists, eliminate Path(...) allocations in tight loops, reuse computed strings, and defer expensive work.
- Trade-offs: minor micro-regressions in a couple of edge micro-benchmarks, but these are acceptable given the improved throughput and much larger gains where it matters (repeated and large-scale calls).
diff --git a/codeflash/languages/python/static_analysis/code_replacer.py b/codeflash/languages/python/static_analysis/code_replacer.py
@@ -21,6 +21,7 @@
 )
 from codeflash.languages.python.static_analysis.line_profile_utils import ImportAdder
 from codeflash.models.models import FunctionParent
+import os
 
 if TYPE_CHECKING:
     from codeflash.discovery.functions_to_optimize import FunctionToOptimize
@@ -557,25 +558,31 @@ def _extract_function_from_code(
 
 def get_optimized_code_for_module(relative_path: Path, optimized_code: CodeStringsMarkdown) -> str:
     file_to_code_context = optimized_code.file_to_path()
-    module_optimized_code = file_to_code_context.get(str(relative_path))
+    str_relative = str(relative_path)
+    module_optimized_code = file_to_code_context.get(str_relative)
     if module_optimized_code is None:
         # Fallback: if there's only one code block with None file path,
         # use it regardless of the expected path (the AI server doesn't always include file paths)
-        if "None" in file_to_code_context and len(file_to_code_context) == 1:
+        if len(file_to_code_context) == 1 and "None" in file_to_code_context:
             module_optimized_code = file_to_code_context["None"]
             logger.debug(f"Using code block with None file_path for {relative_path}")
         else:
-            available_files = list(file_to_code_context.keys())
             # Check if this looks like a path mismatch (same filename exists under a different path)
             # vs the AI simply not returning code for this module
             requested_name = relative_path.name
-            similar = [f for f in available_files if Path(f).name == requested_name]
+            similar: list[str] = []
+            # Iterate keys once and avoid constructing Path objects repeatedly
+            for f in file_to_code_context:
+                if os.path.basename(f) == requested_name:
+                    similar.append(f)
             if similar:
+                available_files = list(file_to_code_context.keys())
                 logger.warning(
                     f"Optimized code not found for '{relative_path}' but found similar path(s): {similar}. "
                     f"Re-check your markdown code structure. Available files: {available_files}"
                 )
             else:
+                available_files = list(file_to_code_context.keys())
                 logger.debug(
                     f"AI service did not return optimized code for '{relative_path}'. "
                     f"Available files in response: {available_files}"
diff --git a/codeflash/models/models.py b/codeflash/models/models.py
@@ -331,12 +331,13 @@ def file_to_path(self) -> dict[str, str]:
             dict[str, str]: Mapping from file path (as string) to code.
 
         """
-        if self._cache.get("file_to_path") is not None:
+        try:
             return self._cache["file_to_path"]
-        self._cache["file_to_path"] = {
-            str(code_string.file_path): code_string.code for code_string in self.code_strings
-        }
-        return self._cache["file_to_path"]
+        except KeyError:
+            # Build the mapping once and cache it
+            mapping = {str(code_string.file_path): code_string.code for code_string in self.code_strings}
+            self._cache["file_to_path"] = mapping
+            return mapping
 
     @staticmethod
     def parse_markdown_code(markdown_code: str, expected_language: str = "python") -> CodeStringsMarkdown: