Skip to content
View bpsa2's full-sized avatar

Block or report bpsa2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bpsa2/README.md

BPSA - Beyond Python SmolAgents

How to Install BPSA

BPSA - Beyond Python SmolAgents is a fork of the original smolagents that extends its original abilities:

  • πŸ’» Interactive CLI (bpsa): Multi-turn REPL with slash commands, command history, tab completion, session stats, and auto-approve mode.
  • πŸ”„ Infinite runtime CLI (ad-infinitum): Allows agents to run ad infinitum via autonomous looping.
  • πŸ—œοΈ Context compression: Biologically inspired automatic LLM-based summarization of older memory steps to manage context window size during long-running tasks.
  • 🌐 Browser integration: Control a headed Chromium browser from agent code blocks via Playwright (--browser flag).
  • πŸ–₯️ GUI interaction: Launch, screenshot, click, type, and send keys to native GUI applications on X11 via xdotool/ImageMagick (--gui-x11 flag).
  • πŸ”Œ MCP server integration: Connect any Model Context Protocol server as a tool source via the --mcp CLI flag. Supports both HTTP (Streamable HTTP) and stdio-based servers.
  • πŸ‘οΈ Image loading: Agents can load and visually inspect image files (plots, screenshots, diagrams) via the built-in load_image tool β€” always available, no flags needed.
  • 🎨 Image tools: Visual image diffing (diff_images), OCR text extraction from images (screen_ocr), and a canvas for drawing shapes, text, and annotations (canvas_create, canvas_draw) β€” always available.
  • 🎀 Dictation input: Dictate prompts via microphone using Whisper or ElevenLabs transcription (/dictation command, requires BPSA_DICTATION_TRANSCRIBER env var).
  • ⚑ Native Python execution: Execute Python code natively via exec for unrestricted processing.
  • 🌍 Multi-language support: Code in multiple languages beyond Python (Pascal, PHP, C++, Java and more).
  • πŸ› οΈ Developer tools: Lots of new tools that help agents to compile, test, and debug source code in various computing languages.
  • πŸ‘₯ Multi-agent collaboration: Collaborate across multiple agents to solve complex problems.
  • πŸ” Research tools: Tools that help agents to research and write technical documentation.
  • πŸ“š Documentation generation: Generate and update documentation including READMEs for existing codebases.

Installation

Install the project, including the dictation support, CLIs, OpenAI protocol and LiteLLM dependencies.

$ pip install bpsa[dictation,browser,openai,litellm]

Find out more at the BPSA GitHub repository.

Pinned Loading

  1. joaopauloschuler/beyond-python-smolagents joaopauloschuler/beyond-python-smolagents Public

    Forked from huggingface/smolagents

    Beyond Python Smolagents is a fork of the original smolagents that extends its original abilities to code in pascal, php and other languages.

    Python 31 71