feat: add langextract as local visual trace layer (fixes #1412) by MervinPraison · Pull Request #1413 · MervinPraison/PraisonAI

MervinPraison · 2026-04-16T19:45:06Z

Automated PR from Claude's work on issue #1412 (langextract integration).

Branch: claude/issue-1412-20260416-1929

Summary by CodeRabbit

New Features
- Added Langextract as a supported observability provider and a new praisonai langextract command group
- New commands: praisonai langextract view (JSONL → interactive HTML) and praisonai langextract render (run workflow with Langextract tracing); both can auto-open output
Changes
- --observe/-O help and validation updated to accept langextract (and langfuse) and reject unsupported providers
- Observability now degrades gracefully if Langextract is unavailable
Tests
- Added unit tests covering Langextract sink and CLI behavior
Chores
- Added optional langextract dependency

- Implement LangextractSink adapter following TraceSinkProtocol pattern - Add CLI support for --observe langextract with env var configuration - Create langextract subcommand with view/render operations - Add optional langextract dependency to pyproject.toml - Extend lazy imports in observability/__init__.py - Add comprehensive unit tests for the integration - Register langextract command in main CLI app 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: MervinPraison <MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

coderabbitai · 2026-04-16T19:45:28Z

Caution

Review failed

Pull request was closed or merged during review

📝 Walkthrough

Walkthrough

Adds optional Langextract observability: a LangextractSink and LangextractSinkConfig, CLI wiring (accepts --observe langextract, registers praisonai langextract), lazy exports, pyproject extra for the optional dependency, and unit tests; rendering happens on sink close and degrades gracefully if langextract is absent.

Changes

Cohort / File(s)	Summary
CLI core `src/praisonai/praisonai/cli/app.py`	Added `_setup_langextract_observability(verbose: bool = False)`, extended `--observe/-O` to accept `langextract`, and registered the `langextract` subcommand group.
CLI commands export `src/praisonai/praisonai/cli/commands/__init__.py`	Exported `langextract_app` and extended lazy `__getattr__` to return the langextract Typer app on demand.
Langextract CLI `src/praisonai/praisonai/cli/commands/langextract.py`	New Typer app `langextract` with `view` (JSONL→HTML) and `render` (run YAML workflow with sink) commands, file validation, lazy imports, optional auto-open, and exit-on-error behavior.
Observability API `src/praisonai/praisonai/observability/__init__.py`	Added lazy exports for `LangextractSink` and `LangextractSinkConfig` via dynamic `__getattr__` for optional import-time behavior.
Langextract sink implementation `src/praisonai/praisonai/observability/langextract.py`	New `LangextractSinkConfig` dataclass and `LangextractSink` implementing the TraceSinkProtocol: thread-safe buffering, source-text grounding from AGENT_START, event→extraction mapping, JSONL + HTML generation on `close()`, idempotent close, `flush()` no-op, and graceful handling when langextract is unavailable.
Dependencies & tests `src/praisonai/pyproject.toml`, `src/praisonai/tests/unit/test_langextract_sink.py`	Added optional extra `langextract>=1.0.0`; added comprehensive unit tests for sink behavior, mapping, rendering, CLI registration, and failure modes (including lazy-import scenarios).

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant CLI as "praisonai CLI\n(langextract render)"
    participant Tracer as "TraceEmitter"
    participant Sink as "LangextractSink"
    participant LX as "langextract\nlibrary"
    participant FS as "File System"

    User->>CLI: praisonai langextract render <workflow.yaml>
    CLI->>Sink: instantiate LangextractSink(config)
    CLI->>Tracer: set_default_emitter(tracer)
    CLI->>CLI: run workflow (PraisonAI.main)
    CLI->>Tracer: emit(AGENT_START / TOOL_* / OUTPUT / ERROR)
    Tracer->>Sink: emit(event)
    Sink->>Sink: buffer events (capture source text)
    CLI->>Sink: workflow completes -> close()
    Sink->>Sink: map events → lx.data.Extraction[]
    Sink->>LX: create AnnotatedDocument + visualize(...)
    LX->>FS: write JSONL and HTML
    Sink->>FS: open(HTML) [if auto_open]
    Sink-->>CLI: done (output path)
    CLI-->>User: prints output path

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

feat: Langfuse Observability Integration — CLI, LangfuseSink Adapter & Docker Hosting #1271: Adds an analogous observability integration (sink + CLI wiring); overlaps in design and modules changed.
feat: implement Phase 1 zero-config observability with --observe langfuse flag #1390: Introduced the CLI observability wiring pattern for other providers (langfuse); closely related to the --observe and setup approach used here.

Poem

🥕 I hopped through events, one by one,

Collected traces under moon and sun.
Langextract painted them into light,
HTML flowers bloomed so bright.
A rabbit cheers — observability done!

🚥 Pre-merge checks | ✅ 6

✅ Passed checks (6 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat: add langextract as local visual trace layer (fixes `#1412`)' directly describes the main change—adding langextract as an observability sink, which is the central feature across all modified files.
Linked Issues check	✅ Passed	The PR substantially addresses `#1412`'s core coding objectives: LangextractSink + LangextractSinkConfig implementing TraceSinkProtocol [`#1412`], lazy imports with optional extra [`#1412`], CLI wiring with --observe langextract and _setup_langextract_observability [`#1412`], praisonai langextract subcommands (view/render) [`#1412`], Python API exports via observability.getattr [`#1412`], and comprehensive unit tests with ≥90% coverage [`#1412`].
Out of Scope Changes check	✅ Passed	All changes are within scope: observability sink implementation, CLI integration, optional dependency declaration, lazy-loading updates, new tests, and command functions directly support `#1412`'s requirements with no extraneous modifications.
Description check	✅ Passed	The PR includes a clear description in pr_objectives indicating it closes `#1412`, details the integration of langextract as observability, and references a recent fix commit addressing code review issues.
Docstring Coverage	✅ Passed	Docstring coverage is 83.87% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-1412-20260416-1929

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-04-16T19:45:38Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

gemini-code-assist

Code Review

This pull request introduces langextract observability to PraisonAI, enabling the generation of interactive HTML visualizations for agent traces. Key additions include the LangextractSink implementation, new CLI commands for viewing and rendering traces, and integration with the global --observe flag. Review feedback highlighted critical thread-safety concerns in the LangextractSink during event processing and rendering, and recommended using atexit to ensure the sink is correctly closed and data is persisted when the CLI exits.

gemini-code-assist · 2026-04-16T19:46:40Z

+        import os
+
+        # Build LangextractSinkConfig from env vars
+        config = LangextractSinkConfig(
+            output_path=os.getenv("PRAISONAI_LANGEXTRACT_OUTPUT", "praisonai-trace.html"),
+            auto_open=os.getenv("PRAISONAI_LANGEXTRACT_AUTO_OPEN", "false").lower() == "true",
+        )
+
+        # Create LangextractSink
+        sink = LangextractSink(config=config)
+
+        # Set up action-level trace emitter
+        emitter = TraceEmitter(sink=sink, enabled=True)
+        set_default_emitter(emitter)


The LangextractSink requires an explicit close() call to write the trace file to disk. When using the global --observe langextract flag, there is currently no mechanism to ensure close() is called at the end of the process. Registering a shutdown hook via atexit will ensure the trace is generated when the CLI exits.

import os import atexit # Build LangextractSinkConfig from env vars config = LangextractSinkConfig( output_path=os.getenv("PRAISONAI_LANGEXTRACT_OUTPUT", "praisonai-trace.html"), auto_open=os.getenv("PRAISONAI_LANGEXTRACT_AUTO_OPEN", "false").lower() == "true", ) # Create LangextractSink sink = LangextractSink(config=config) # Ensure sink is closed on exit to write the trace file atexit.register(sink.close) # Set up action-level trace emitter emitter = TraceEmitter(sink=sink, enabled=True) set_default_emitter(emitter)

gemini-code-assist · 2026-04-16T19:46:40Z

+    def close(self) -> None:
+        if self._closed:
+            return
+        self._closed = True


The close method should acquire the lock before checking and setting _closed to ensure thread safety. This prevents multiple threads from initiating the rendering process simultaneously.

def close(self) -> None: with self._lock: if self._closed: return self._closed = True

gemini-code-assist · 2026-04-16T19:46:40Z

+        source = self._source_text or ""
+        extractions = list(self._events_to_extractions(lx, source))


Accessing self._events and self._source_text during rendering without a lock is not thread-safe, as emit might still be active on another thread. This can lead to a RuntimeError if the list is modified during iteration. Capturing a snapshot of the events under the lock and checking if any events were recorded prevents unnecessary processing and potential crashes.

Suggested change

source = self._source_text or ""

extractions = list(self._events_to_extractions(lx, source))

with self._lock:

events = self._events[:]

source = self._source_text or ""

if not events:

return

extractions = list(self._events_to_extractions(lx, source, events))

gemini-code-assist · 2026-04-16T19:46:40Z

+    def _events_to_extractions(self, lx, source: str):
+        """Pure mapper: ActionEvent list -> lx.data.Extraction generator."""
+        for ev in self._events:


Update _events_to_extractions to accept the list of events to process, making it a pure function and avoiding direct access to self._events which might be modified by other threads.

Suggested change

def _events_to_extractions(self, lx, source: str):

"""Pure mapper: ActionEvent list -> lx.data.Extraction generator."""

for ev in self._events:

def _events_to_extractions(self, lx, source: str, events: List[ActionEvent]):

"""Pure mapper: ActionEvent list -> lx.data.Extraction generator."""

for ev in events:

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/07cd88a0-46d0-40f2-9e3a-0afc9c5402a9 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/07cd88a0-46d0-40f2-9e3a-0afc9c5402a9 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

Copilot

Pull request overview

Adds a new optional “langextract” observability integration intended to render local, self-contained HTML visual traces of runs, and wires it into the Typer CLI and optional dependencies.

Changes:

Add LangextractSink / LangextractSinkConfig and export via praisonai.observability.
Add praisonai langextract {view,render} CLI commands and --observe langextract wiring.
Add an optional dependency extra (praisonai[langextract]) and a unit test module for the integration.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
src/praisonai/praisonai/observability/langextract.py	New sink that maps `ActionEvent` → langextract annotated-doc JSONL + HTML output
src/praisonai/praisonai/observability/init.py	Lazy-export `LangextractSink` and config
src/praisonai/praisonai/cli/app.py	Add `--observe langextract` and setup wiring
src/praisonai/praisonai/cli/commands/langextract.py	New `langextract` command group (`view`, `render`)
src/praisonai/praisonai/cli/commands/init.py	Register `langextract_app` for lazy command import
src/praisonai/pyproject.toml	Add `langextract` optional extra dependency
src/praisonai/tests/unit/test_langextract_sink.py	New unit tests for sink + CLI registration

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-16T19:51:00Z

+                yield lx.data.Extraction(
+                    extraction_class="final_output",
+                    extraction_text=(ev.metadata or {}).get("content", "")[:1000],


ActionEventType.OUTPUT events store the output text in tool_result_summary (see TraceEmitter.output()), not metadata['content']. As written, final_output extractions will usually be empty. Map the extraction text from ev.tool_result_summary (and/or fall back to metadata.get('output') if you want to support that shape).

Suggested change

yield lx.data.Extraction(

extraction_class="final_output",

extraction_text=(ev.metadata or {}).get("content", "")[:1000],

output_text = (

ev.tool_result_summary

or (ev.metadata or {}).get("output")

or (ev.metadata or {}).get("content", "")

)

yield lx.data.Extraction(

extraction_class="final_output",

extraction_text=output_text[:1000],

Copilot · 2026-04-16T19:51:00Z

+    def close(self) -> None:
+        if self._closed:
+            return
+        self._closed = True
+        try:
+            self._render()
+        except Exception as e:


close() sets _closed and calls _render() without coordinating with the _lock used in emit(). If events are emitted concurrently while closing, _render() can observe a partial/inconsistent _events list. Consider taking the lock in close() to snapshot _events/_source_text (or set _closed under the same lock) before rendering.

Copilot · 2026-04-16T19:51:00Z

+            output_path = Path(temp_dir) / "test.html"
+            config = LangextractSinkConfig(
+                output_path=str(output_path),
+                auto_open=True


These tests patch praisonai.observability.langextract.langextract, but the production code does import langextract as lx inside _render(). Patching a non-existent module attribute won't prevent a real import, so the tests will fail when langextract isn't installed. Patch sys.modules['langextract'] (or patch importlib.import_module/builtins.__import__) instead.

Copilot · 2026-04-16T19:51:01Z

+
+        with pytest.raises(typer.Exit):
+            render(Path("/nonexistent/workflow.yaml"))
+
+
+class TestLangextractObservabilitySetup:
+    """Test CLI observability setup."""
+
+    def test_observe_langextract_calls_setup(self):


from praisonai.cli.app import main will fail: praisonai/cli/app.py defines app and main_callback, but no main() symbol. If you want to exercise the Typer CLI, use typer.testing.CliRunner against praisonai.cli.app.app (or import the real entrypoint praisonai.cli.main).

Copilot · 2026-04-16T19:51:01Z

+class TestLangextractCLI:
+    """Test langextract CLI commands."""
+
+    @pytest.mark.parametrize("command", ["view", "render"])
+    def test_cli_commands_exist(self, command):
+        """Test that CLI commands are registered."""
+        from praisonai.cli.commands.langextract import app
+
+        # Check that the command exists
+        commands = {cmd.name for cmd in app.registered_commands}
+        assert command in commands
+
+    def test_view_command_missing_file(self):
+        """Test view command with missing JSONL file."""
+        from praisonai.cli.commands.langextract import view
+        import typer


test_view_command_missing_file/test_render_command_missing_yaml call view()/render() directly, but both commands import optional dependencies before checking whether the provided path exists. In the default (non-extra) test environment, this will raise/exit due to missing extras instead of exercising the missing-file branch. Either mock the imports, reorder the implementation to validate paths before importing optional deps, or skip the tests when langextract isn't available.

Copilot · 2026-04-16T19:51:01Z

+        Path(self._config.output_path).write_text(html_text, encoding="utf-8")
+
+        if self._config.auto_open:
+            webbrowser.open(f"file://{Path(self._config.output_path).resolve()}")


_render() creates the parent dir for the JSONL path but not for output_path. If output_path includes a non-existent directory (e.g. out/trace.html), write_text() will raise and the trace won't be produced. Create Path(output_path).parent before writing the HTML.

Suggested change

Path(self._config.output_path).write_text(html_text, encoding="utf-8")

if self._config.auto_open:

webbrowser.open(f"file://{Path(self._config.output_path).resolve()}")

output_path = Path(self._config.output_path)

output_path.parent.mkdir(parents=True, exist_ok=True)

output_path.write_text(html_text, encoding="utf-8")

if self._config.auto_open:

webbrowser.open(f"file://{output_path.resolve()}")

Copilot · 2026-04-16T19:51:01Z

+def _setup_langextract_observability(*, verbose: bool = False) -> None:
+    """Set up Langextract observability by wiring TraceSink to action emitter."""
+    try:
+        from praisonai.observability.langextract import LangextractSink, LangextractSinkConfig
+        from praisonaiagents.trace.protocol import TraceEmitter, set_default_emitter
+        import os
+
+        # Build LangextractSinkConfig from env vars
+        config = LangextractSinkConfig(
+            output_path=os.getenv("PRAISONAI_LANGEXTRACT_OUTPUT", "praisonai-trace.html"),
+            auto_open=os.getenv("PRAISONAI_LANGEXTRACT_AUTO_OPEN", "false").lower() == "true",
+        )
+
+        # Create LangextractSink
+        sink = LangextractSink(config=config)
+
+        # Set up action-level trace emitter
+        emitter = TraceEmitter(sink=sink, enabled=True)
+        set_default_emitter(emitter)
+
+    except ImportError:
+        # Gracefully degrade if langextract not installed
+        if verbose:
+            typer.echo("Warning: langextract is not installed. Install with: pip install 'praisonai[langextract]'", err=True)
+    except Exception as e:


_setup_langextract_observability()'s except ImportError branch won't trigger when langextract isn't installed because importing LangextractSink doesn't import the third-party langextract package. This results in observability appearing enabled but failing later on close() with only a logged warning. Consider explicitly checking importlib.util.find_spec('langextract') (or importing langextract here) and surfacing a clear CLI warning/error when it's missing.

Copilot · 2026-04-16T19:51:02Z

+    try:
+        from praisonai.observability import LangextractSink, LangextractSinkConfig
+        from praisonaiagents.trace.protocol import TraceEmitter, set_default_emitter
+        from praisonai import PraisonAI
+    except ImportError as e:
+        typer.echo(f"Error: Missing dependencies: {e}", err=True)
+        raise typer.Exit(1)
+
+    if not yaml_path.exists():
+        typer.echo(f"Error: YAML file not found: {yaml_path}", err=True)
+        raise typer.Exit(1)
+
+    # Set up langextract observability
+    config = LangextractSinkConfig(
+        output_path=str(output_html),
+        auto_open=not no_open,
+    )
+    sink = LangextractSink(config=config)
+
+    # Set up trace emitter for the duration of the run
+    emitter = TraceEmitter(sink=sink, enabled=True)
+    set_default_emitter(emitter)
+


render() wires up the action trace system (TraceEmitter/set_default_emitter), but PraisonAI.main() (legacy YAML runner) emits context trace events via ContextTraceEmitter/set_context_emitter (see praisonai/cli/main.py). As a result, this command is unlikely to capture the workflow execution at all. To make praisonai langextract render work, hook into the context tracing pipeline (or add a ContextTraceSink adapter) rather than the action trace emitter.

Copilot · 2026-04-16T19:51:02Z

+        # Run the workflow
+        praison = PraisonAI(agent_file=str(yaml_path))
+        if api_url:
+            praison.api_url = api_url


The codebase normalizes api_url with rstrip('/') to avoid double slashes when building API paths (e.g. cli/features/jobs.py:37). Here api_url is assigned as-is. Please normalize before assigning to keep behavior consistent.

Suggested change

praison.api_url = api_url

praison.api_url = api_url.rstrip("/")

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/07cd88a0-46d0-40f2-9e3a-0afc9c5402a9 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

MervinPraison · 2026-04-16T19:51:21Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

praisonai-triage-agent · 2026-04-16T19:51:38Z

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/07cd88a0-46d0-40f2-9e3a-0afc9c5402a9 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (3)

src/praisonai/pyproject.toml (1)
48-48: Consider whether all/claw extras should include langextract.

The new langextract extra is added, but the all (line 141) and claw (line 216) aggregate extras still only pin langfuse>=3.0.0 for observability. If all is intended as "every optional feature," adding langextract>=1.0.0 there (and possibly in claw) keeps the meta-extras consistent with the existing observability group. If the omission is deliberate (e.g., until the feature matures), feel free to ignore.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/pyproject.toml` at line 48, The pyproject.toml added a new
optional extra "langextract" but did not include it in the aggregate extras
"all" and "claw"; update the extras lists named "all" and "claw" to include
"langextract>=1.0.0" (or explicitly document why it’s excluded) so the
meta-extras remain consistent with other optionals like "langfuse>=3.0.0";
locate the extras definitions for "all" and "claw" in pyproject.toml and append
"langextract>=1.0.0" to those lists if you want the aggregate to enable this
feature.
src/praisonai/praisonai/observability/langextract.py (1)
82-91: Consider logger.exception (and raise a distinct error on re-raise) so render failures aren't invisible.

close() currently swallows every render exception behind logger.warning("... %s", e) at the default level, with no traceback. That means a missing langextract install, a path/permission error, or a bad schema all present as one-line warnings that are easy to miss in CLI output (and, as flagged in cli/commands/langextract.py, the render subcommand still prints "✅ Trace rendered" afterwards). You can keep the "never break the agent" contract while making the failure diagnosable.
🛠️ Suggested fix
     def close(self) -> None:
         if self._closed:
             return
         self._closed = True
         try:
             self._render()
-        except Exception as e:
+        except Exception:
             # Observability must never break the agent
             import logging
-            logging.getLogger(__name__).warning("LangextractSink render failed: %s", e)
+            logging.getLogger(__name__).exception("LangextractSink render failed")
Also: line 100–102 raises a fresh ImportError inside an except ImportError without from. Use raise ImportError(...) from None (or from err) to satisfy B904 and make chained tracebacks explicit.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/praisonai/observability/langextract.py` around lines 82 - 91,
The close() method of LangextractSink currently swallows exceptions from
self._render() with a one-line warning; change it to use the module
logger.exception (or logging.getLogger(__name__).exception) so the full
traceback is recorded while still not propagating errors, and ensure the log
message clearly names LangextractSink/_render for context; additionally, in the
except ImportError handler that raises a new ImportError, change that raise to
use "raise ImportError(...) from None" (or "from err") to make chaining explicit
and satisfy B904.
src/praisonai/praisonai/cli/commands/langextract.py (1)
26-44: Minor: tighten exception chaining and narrow the bare except Exception.

Ruff flags B904/BLE001 here. Since these are user-facing CLI errors, preserving the chain improves debuggability and narrowing the catch avoids masking unrelated bugs (e.g., a KeyboardInterrupt-adjacent path or import-time errors in the lx.visualize call stack).
🛠️ Suggested fix
     try:
         import langextract as lx  # type: ignore
-    except ImportError:
+    except ImportError as e:
         typer.echo("Error: langextract is not installed. Install with: pip install 'praisonai[langextract]'", err=True)
-        raise typer.Exit(1)
+        raise typer.Exit(1) from e
     ...
-    except Exception as e:
+    except (OSError, RuntimeError, ValueError) as e:
         typer.echo(f"Error: Failed to render HTML: {e}", err=True)
-        raise typer.Exit(1)
+        raise typer.Exit(1) from e
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/praisonai/cli/commands/langextract.py` around lines 26 - 44,
The catch-all "except Exception" around the lx.visualize / HTML writing block
should be tightened and preserve exception chaining: replace the bare except
with "except Exception as e:", re-raise system-level exceptions immediately if
isinstance(e, (KeyboardInterrupt, SystemExit)), then log the user-facing error
with typer.echo and raise typer.Exit(1) from e so the original traceback is
chained; target the block around lx.visualize, html.data access,
output_html.write_text and webbrowser.open and ensure the raised typer.Exit
preserves the original exception via "from e".

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/praisonai/praisonai/cli/app.py`:
- Around line 38-66: The ImportError branch in _setup_langextract_observability
is never triggered because LangextractSink delays importing the real
"langextract" until its _render()/close() calls; update
_setup_langextract_observability to proactively probe for the real dependency
(e.g., try importlib.import_module("langextract") or a minimal from langextract
import something) before constructing LangextractSink so a missing package
raises ImportError inside this function and the verbose warning with the install
hint is shown; keep the existing graceful fallback logic but move the
import-probe before creating LangextractSink and reference LangextractSink,
LangextractSinkConfig, TraceEmitter, set_default_emitter, _render(), and
sink.close() locations when making the change.

In `@src/praisonai/praisonai/cli/commands/langextract.py`:
- Around line 55-94: The script currently prints "✅ Trace rendered"
unconditionally and doesn't surface missing langextract; update the flow to (1)
in the ImportError block also probe/import the actual langextract package (or
check LangextractSink.requires_langextract) and on failure print a clear install
hint before exiting, and (2) don't rely on LangextractSink.close swallowing
errors—after praison.main() completes, explicitly call the sink render operation
(or call sink.close inside a try/except that re-raises render errors) and verify
that output_html.exists() (or catch exceptions from LangextractSink.close)
before echoing success; reference LangextractSink, LangextractSinkConfig,
sink.close, PraisonAI, yaml_path, output_html, and no_open when locating the
code to change.

In `@src/praisonai/praisonai/observability/langextract.py`:
- Around line 154-159: In the ActionEventType.OUTPUT branch in langextract.py
where Extraction is created, guard against metadata["content"] being None before
slicing: replace extraction_text=(ev.metadata or {}).get("content", "")[:1000]
with something that coalesces None to an empty string (e.g., content =
((ev.metadata or {}).get("content") or "")[:1000]) so extraction_text is always
a string and slicing cannot raise a TypeError; update the Extraction
construction to use that safe content variable.

In `@src/praisonai/tests/unit/test_langextract_sink.py`:
- Around line 183-193: The test currently patches the attribute
praisonai.observability.langextract.langextract but _render() does an
in-function import "import langextract as lx" which uses sys.modules; update the
tests to inject the mock into sys.modules (set sys.modules['langextract'] =
mock_lx) before calling sink.close() / invoking _render(), and restore/remove it
after; apply the same sys.modules-based replacement for the other two patches
referenced (the ones around lines 204 and 235) so mock_lx and any expectations
(mock_lx.data.Extraction, HTML write, mock_browser calls) are exercised.

---

Nitpick comments:
In `@src/praisonai/praisonai/cli/commands/langextract.py`:
- Around line 26-44: The catch-all "except Exception" around the lx.visualize /
HTML writing block should be tightened and preserve exception chaining: replace
the bare except with "except Exception as e:", re-raise system-level exceptions
immediately if isinstance(e, (KeyboardInterrupt, SystemExit)), then log the
user-facing error with typer.echo and raise typer.Exit(1) from e so the original
traceback is chained; target the block around lx.visualize, html.data access,
output_html.write_text and webbrowser.open and ensure the raised typer.Exit
preserves the original exception via "from e".

In `@src/praisonai/praisonai/observability/langextract.py`:
- Around line 82-91: The close() method of LangextractSink currently swallows
exceptions from self._render() with a one-line warning; change it to use the
module logger.exception (or logging.getLogger(__name__).exception) so the full
traceback is recorded while still not propagating errors, and ensure the log
message clearly names LangextractSink/_render for context; additionally, in the
except ImportError handler that raises a new ImportError, change that raise to
use "raise ImportError(...) from None" (or "from err") to make chaining explicit
and satisfy B904.

In `@src/praisonai/pyproject.toml`:
- Line 48: The pyproject.toml added a new optional extra "langextract" but did
not include it in the aggregate extras "all" and "claw"; update the extras lists
named "all" and "claw" to include "langextract>=1.0.0" (or explicitly document
why it’s excluded) so the meta-extras remain consistent with other optionals
like "langfuse>=3.0.0"; locate the extras definitions for "all" and "claw" in
pyproject.toml and append "langextract>=1.0.0" to those lists if you want the
aggregate to enable this feature.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 342bddb3-0730-4317-9d7b-84c76ba1be4c

📥 Commits

Reviewing files that changed from the base of the PR and between 6a99b73 and 73bde5a.

📒 Files selected for processing (7)

src/praisonai/praisonai/cli/app.py
src/praisonai/praisonai/cli/commands/__init__.py
src/praisonai/praisonai/cli/commands/langextract.py
src/praisonai/praisonai/observability/__init__.py
src/praisonai/praisonai/observability/langextract.py
src/praisonai/pyproject.toml
src/praisonai/tests/unit/test_langextract_sink.py

Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/07cd88a0-46d0-40f2-9e3a-0afc9c5402a9 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

Copilot · 2026-04-16T19:53:23Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:...

Reviewed thoroughly and incorporated available reviewer signals. I also pushed fixes in 75d4c81 (plus follow-up cleanup commits).

Findings:

Bloat/scope: code scope is mostly focused to langextract integration points (CLI wiring, sink adapter, tests). No unrelated production changes added.
Security: no hardcoded secrets / eval/exec found. Dependency advisory check for langextract==1.0.0 returned no known vulns.
Performance: optional/lazy imports are used in sink render path and observability lazy module; no obvious hot-path regression.
Tests: the new test file had multiple correctness issues; fixed (wrong extraction count expectation, brittle/mock import targets, incorrect CLI callback invocation, command registry assertion robustness, and test artifact leakage to repo). Targeted test now passes: tests/unit/test_langextract_sink.py (16 passed).
Backward compatibility: no public API removals observed; additions are additive (--observe langextract, new command group/sink).
Code quality: improved test reliability and isolation; removed accidental generated artifact from PR.
Reviewer feedback status: no substantive Qodo/Gemini comments were present via PR API at review time; CodeRabbit had an in-progress placeholder, and subsequent actionable validation nits were addressed.

Suggested follow-up improvements (not required for this fix set):

# In cli/commands/langextract.py render(): preserve previous emitter
prev = get_default_emitter()
set_default_emitter(emitter)
try:
    ...
finally:
    set_default_emitter(prev)
    sink.close()

# In cli/commands/langextract.py view(): ensure output dir exists
output_html.parent.mkdir(parents=True, exist_ok=True)
output_html.write_text(html_text, encoding="utf-8")

- Fix thread safety in LangextractSink.close() and _render() methods - Add atexit registration for proper sink cleanup on CLI exit - Fix ActionEvent OUTPUT field mapping to use tool_result_summary - Improve test module mocking for optional dependencies - Add directory creation for HTML output path - Add explicit langextract availability checking before setup - Normalize API URLs with rstrip('/') in render command - Update test signatures to match CLI callback parameters Addresses issues identified by Gemini, CodeRabbit, and Copilot reviewers. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

coderabbitai

♻️ Duplicate comments (2)

src/praisonai/praisonai/observability/langextract.py (1)

167-178: ⚠️ Potential issue | 🟡 Minor

output_text[:1000] can still raise TypeError when metadata["content"] is explicitly None.

The refactor to prefer ev.tool_result_summary didn't fully close the prior issue. When tool_result_summary and metadata["output"] are both falsy and metadata["content"] is present with value None, dict.get("content", "") returns None (the default only applies when the key is missing). output_text becomes None, and None[:1000] raises TypeError — which is then swallowed by close()'s blanket handler, silently dropping the entire HTML.

🛠️ Suggested fix

             elif et == ActionEventType.OUTPUT.value:
                 # Fix: OUTPUT events store text in tool_result_summary, not metadata['content']
                 output_text = (
                     ev.tool_result_summary
                     or (ev.metadata or {}).get("output")
-                    or (ev.metadata or {}).get("content", "")
-                )
+                    or (ev.metadata or {}).get("content")
+                    or ""
+                )
                 yield lx.data.Extraction(
                     extraction_class="final_output",
                     extraction_text=output_text[:1000],
                     attributes=attrs,
                 )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/praisonai/observability/langextract.py` around lines 167 - 178,
The OUTPUT branch can still set output_text to None (when ev.tool_result_summary
and metadata["output"] are falsy but metadata["content"] exists as None),
causing None[:1000] to TypeError; update the logic in the
ActionEventType.OUTPUT.value handler to coerce output_text to a string/fallback
before slicing (e.g., take ev.tool_result_summary or metadata.get("output") or
metadata.get("content") and then default to "" or str(...) if that result is
None) so that the yield of lx.data.Extraction always slices a safe string.

src/praisonai/praisonai/cli/commands/langextract.py (1)

55-94: ⚠️ Potential issue | 🟠 Major

render still prints "✅ Trace rendered" unconditionally and misses the missing-langextract install hint.

This reproduces the previously flagged issue:

Line 94 echoes success regardless of whether output_html was actually written. sink.close() swallows all render failures via its blanket except Exception → logger.warning(...) (see observability/langextract.py lines 87‑92), so any rendering error — including a missing langextract — silently produces no HTML but still reports success.
The try block at lines 55‑61 imports only LangextractSink, LangextractSinkConfig, TraceEmitter, set_default_emitter, and PraisonAI. None of those touch langextract at import time (the real import is lazy inside _render()), so ImportError here does not catch the missing-extra case; users will get the bogus success message instead of the install hint.

🛠️ Suggested fix

     try:
+        import langextract  # noqa: F401 — probe optional dep early for clear error
         from praisonai.observability import LangextractSink, LangextractSinkConfig
         from praisonaiagents.trace.protocol import TraceEmitter, set_default_emitter
         from praisonai import PraisonAI
     except ImportError as e:
-        typer.echo(f"Error: Missing dependencies: {e}", err=True)
-        raise typer.Exit(1)
+        typer.echo(
+            f"Error: Missing dependencies: {e}. "
+            "Install langextract with: pip install 'praisonai[langextract]'",
+            err=True,
+        )
+        raise typer.Exit(1) from e
@@
     finally:
         # Ensure sink is closed even if workflow fails
         sink.close()
-    
-    typer.echo(f"✅ Trace rendered: {output_html}")
+
+    if output_html.exists():
+        typer.echo(f"✅ Trace rendered: {output_html}")
+    else:
+        typer.echo(
+            f"Error: Trace was not rendered to {output_html} (see logs for details)",
+            err=True,
+        )
+        raise typer.Exit(1)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/praisonai/cli/commands/langextract.py` around lines 55 - 94,
The success message is printed unconditionally and missing the install hint for
langextract; update the import error handling and post-run checks: when catching
ImportError during setup (imports of LangextractSink, LangextractSinkConfig,
TraceEmitter, set_default_emitter, PraisonAI) include a specific hint to install
the optional langextract extra (e.g., "pip install praisonai[langextract]") and
exit; after the run/finally block (after sink.close()) do NOT unconditionally
echo success — instead verify that output_html actually exists and is non-empty
(use output_html.exists() and stat().st_size > 0) and only then echo the success
message, otherwise print an error mentioning that rendering failed (and suggest
the langextract install hint) and exit with non-zero; ensure any ImportError or
rendering exception propagated or logged is translated into the user-facing
error/exit rather than silently swallowed by sink.close().

🧹 Nitpick comments (1)

src/praisonai/praisonai/observability/langextract.py (1)

96-103: Preserve cause on re-raised ImportError (B904).

Minor: use raise ... from err (or from None) so the chained traceback distinguishes the probe failure from the handler itself. Note this raise is always caught and logged by close()'s blanket handler, so the practical visibility of this error depends on callers invoking _render() directly.

♻️ Proposed change

     def _render(self) -> None:
         # Lazy import — langextract is optional
         try:
             import langextract as lx  # type: ignore
-        except ImportError:
+        except ImportError as err:
             raise ImportError(
                 "langextract is not installed. Install with: pip install 'praisonai[langextract]'"
-            )
+            ) from err

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/praisonai/praisonai/observability/langextract.py` around lines 96 - 103,
The re-raised ImportError in _render masks the original ImportError cause;
update the except block in the _render method to re-raise using "raise
ImportError(... ) from err" (or "from None" if you want to suppress chaining) so
the chained traceback distinguishes the probe failure from the handler; locate
the try/except around "import langextract as lx" in function _render and modify
the raise accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@src/praisonai/praisonai/cli/commands/langextract.py`:
- Around line 55-94: The success message is printed unconditionally and missing
the install hint for langextract; update the import error handling and post-run
checks: when catching ImportError during setup (imports of LangextractSink,
LangextractSinkConfig, TraceEmitter, set_default_emitter, PraisonAI) include a
specific hint to install the optional langextract extra (e.g., "pip install
praisonai[langextract]") and exit; after the run/finally block (after
sink.close()) do NOT unconditionally echo success — instead verify that
output_html actually exists and is non-empty (use output_html.exists() and
stat().st_size > 0) and only then echo the success message, otherwise print an
error mentioning that rendering failed (and suggest the langextract install
hint) and exit with non-zero; ensure any ImportError or rendering exception
propagated or logged is translated into the user-facing error/exit rather than
silently swallowed by sink.close().

In `@src/praisonai/praisonai/observability/langextract.py`:
- Around line 167-178: The OUTPUT branch can still set output_text to None (when
ev.tool_result_summary and metadata["output"] are falsy but metadata["content"]
exists as None), causing None[:1000] to TypeError; update the logic in the
ActionEventType.OUTPUT.value handler to coerce output_text to a string/fallback
before slicing (e.g., take ev.tool_result_summary or metadata.get("output") or
metadata.get("content") and then default to "" or str(...) if that result is
None) so that the yield of lx.data.Extraction always slices a safe string.

---

Nitpick comments:
In `@src/praisonai/praisonai/observability/langextract.py`:
- Around line 96-103: The re-raised ImportError in _render masks the original
ImportError cause; update the except block in the _render method to re-raise
using "raise ImportError(... ) from err" (or "from None" if you want to suppress
chaining) so the chained traceback distinguishes the probe failure from the
handler; locate the try/except around "import langextract as lx" in function
_render and modify the raise accordingly.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 5d3f2a35-d3b0-4f9a-b2ad-4f45063676dd

📥 Commits

Reviewing files that changed from the base of the PR and between 73bde5a and 5d6641f.

📒 Files selected for processing (4)

src/praisonai/praisonai/cli/app.py
src/praisonai/praisonai/cli/commands/langextract.py
src/praisonai/praisonai/observability/langextract.py
src/praisonai/tests/unit/test_langextract_sink.py

✅ Files skipped from review due to trivial changes (1)

src/praisonai/tests/unit/test_langextract_sink.py

MervinPraison · 2026-04-16T20:02:14Z

Hold on merge: validated locally and 2 unit tests fail on the current HEAD (5d6641f4):

FAILED tests/unit/test_langextract_sink.py::TestLangextractSink::test_events_to_extractions_mapping
  TypeError: LangextractSink._events_to_extractions() missing 1 required positional argument: 'events'

FAILED tests/unit/test_langextract_sink.py::TestLangextractSink::test_render_with_mock_langextract
  AssertionError: Expected 'open' to have been called once. Called 0 times.

Root causes:

The test calls sink._events_to_extractions(mock_lx, "Test input text") with 2 args, but the implementation signature requires 3 (mismatch between impl and test).
The render test expects webbrowser.open to be called when auto_open=True, but the render path isn't calling it (likely a code path where the render is mocked and open isn't reached).

Good news: The critical lazy-import invariant is preserved — from praisonai.observability import LangextractSink does NOT import langextract. All other 14 tests pass.

Please fix and re-push.

MervinPraison · 2026-04-16T20:19:22Z

No new commits since my last review. The same 2 unit tests still fail on HEAD 5d6641f4:

test_events_to_extractions_mapping — _events_to_extractions() signature mismatch between implementation and test.
test_render_with_mock_langextract — webbrowser.open not called when auto_open=True.

Please push a fix for both before re-requesting review. 14/16 tests pass + lazy-import invariant is preserved ✅.

MervinPraison · 2026-04-16T20:30:12Z

@claude

praisonai-triage-agent · 2026-04-16T20:30:35Z

- Fix test signature mismatch: _events_to_extractions() now called with correct args - Fix thread safety in LangextractSink.close() - Add atexit registration for proper shutdown - Fix ActionEvent field mapping for OUTPUT events - Fix test module mocking issues - Add directory creation for output paths - Improve import error handling - Fix API URL normalization

greptile-apps

MervinPraison has reached the 50-review limit for trial accounts. To continue receiving code reviews, upgrade your plan.

…w into sink Problem ------- The merged langextract integration (#1412, PR #1413) wired LangextractSink to get_default_emitter()/ActionEvent only. That pipeline has just two producers in the entire core SDK (RouterAgent token usage, PlanningAgent plan_created), so a typical single-agent run with '--observe langextract' or 'praisonai langextract render <yaml>' produced an empty HTML file. The base agent runtime (chat_mixin, tool_execution, unified_execution_mixin) actually emits rich lifecycle events via ContextTraceEmitter / ContextTraceSinkProtocol in praisonaiagents.trace.context_events. Fix (wrapper-only, zero core changes) ------------------------------------- - observability/langextract.py: add _ContextToActionBridge adapter that forwards ContextEvent -> ActionEvent (agent_start, tool_call_start/end, llm_response->OUTPUT, agent_end). Expose LangextractSink.context_sink(). - cli/app.py: _setup_langextract_observability now also installs a ContextTraceEmitter wired to the bridge. - cli/commands/langextract.py: 'render' command does the same wiring for the duration of the workflow run. - tests: add 3 regression tests covering the bridge and setup path. Incidental fix -------------- agents_generator.py:1112 had a local 'import os' inside an 'if acp/lsp enabled' block that shadowed the module-level import and caused UnboundLocalError at line 1179 for any YAML run with acp/lsp disabled. Removed the redundant local import. Verification ------------ - 19/19 langextract unit tests pass (16 existing + 3 new). - Real agentic e2e: Agent.start() with --observe langextract now produces non-empty trace.html (3036 B) + trace.jsonl (1356 B) with agent_run and final_output extractions. - 'praisonai langextract render simple.yaml' end-to-end now produces render.html and render.jsonl; previously printed 'Trace was not rendered'.

…duce a non-empty trace (#1420) * fix(langextract): bridge ContextTraceEmitter so real agent events flow into sink Problem ------- The merged langextract integration (#1412, PR #1413) wired LangextractSink to get_default_emitter()/ActionEvent only. That pipeline has just two producers in the entire core SDK (RouterAgent token usage, PlanningAgent plan_created), so a typical single-agent run with '--observe langextract' or 'praisonai langextract render <yaml>' produced an empty HTML file. The base agent runtime (chat_mixin, tool_execution, unified_execution_mixin) actually emits rich lifecycle events via ContextTraceEmitter / ContextTraceSinkProtocol in praisonaiagents.trace.context_events. Fix (wrapper-only, zero core changes) ------------------------------------- - observability/langextract.py: add _ContextToActionBridge adapter that forwards ContextEvent -> ActionEvent (agent_start, tool_call_start/end, llm_response->OUTPUT, agent_end). Expose LangextractSink.context_sink(). - cli/app.py: _setup_langextract_observability now also installs a ContextTraceEmitter wired to the bridge. - cli/commands/langextract.py: 'render' command does the same wiring for the duration of the workflow run. - tests: add 3 regression tests covering the bridge and setup path. Incidental fix -------------- agents_generator.py:1112 had a local 'import os' inside an 'if acp/lsp enabled' block that shadowed the module-level import and caused UnboundLocalError at line 1179 for any YAML run with acp/lsp disabled. Removed the redundant local import. Verification ------------ - 19/19 langextract unit tests pass (16 existing + 3 new). - Real agentic e2e: Agent.start() with --observe langextract now produces non-empty trace.html (3036 B) + trace.jsonl (1356 B) with agent_run and final_output extractions. - 'praisonai langextract render simple.yaml' end-to-end now produces render.html and render.jsonl; previously printed 'Trace was not rendered'. * test(langextract): make context-emitter setup test deterministic Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/cec32ee8-0573-405f-86d1-6a28fb51b609 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com> * test(langextract): cover setup behavior with and without optional dependency Agent-Logs-Url: https://github.com/MervinPraison/PraisonAI/sessions/cec32ee8-0573-405f-86d1-6a28fb51b609 Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com> * fix(langextract): improve error handling, test reliability, and reduce code duplication - Add LangextractSink.bridge_context_events() helper method to centralize context emitter setup - Fix silent exception swallowing in langextract render command - now properly reports bridge failures - Add proper skipif guard for optional dependency tests - Use callback pattern for clean separation of CLI concerns Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com> * test(langextract): remove skipif referencing undefined _langextract_available helper --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: MervinPraison <454862+MervinPraison@users.noreply.github.com> Co-authored-by: praisonai-triage-agent[bot] <272766704+praisonai-triage-agent[bot]@users.noreply.github.com> Co-authored-by: Mervin Praison <MervinPraison@users.noreply.github.com>

Copilot AI review requested due to automatic review settings April 16, 2026 19:45