Setup Guide — Speech-to-Code Generation for Collaborative Robots

The system consists of a frontend (operator machine) and a backend (machine connected to the robot). Both must be running simultaneously.

Component	Machine	OS
Frontend — GUI	Operator machine	Windows 11
Backend — Mock / Universal Robots UR10e	Same machine as frontend, or a separate Windows PC	Windows 11
Backend — Franka Emika Panda	Dedicated Linux PC	Ubuntu 20.04 (RT kernel)

Frontend and backend communicate over ZeroMQ on port 5555. For Franka, the Linux PC must be reachable from the operator machine on the network.

Frontend Installation (Windows 11)

Prerequisites

LM Studio must be running before the application starts:

Download from lmstudio.ai and install it.
Download a meta-llama-3.1-8b-instruct model in the Model Search tab on the left.
Start the local server on port 1234, using the Developer tab on the left.

Note: LM Studio is required for all adapters, including the mock adapter. The mock adapter requires no robot hardware, but LM Studio must still be running for LLM parsing.

Install

Use PowerShell as administrator to run the frontend setup script:

irm https://raw.githubusercontent.com/Meisdy/Speech-to-Code-Generation-for-Collaborative-Robots/main/Setup/setup_frontend.ps1 | iex

The script installs ffmpeg and uv via winget if missing, downloads the application to C:\Program Files\Speech-to-Cobot, creates a Desktop shortcut, and pre-downloads the Whisper small model (~466 MB). Full script details in setup_frontend.ps1

To uninstall, see Uninstall.

Backend Installation — Windows (Mock / UR)

Use PowerShell as administrator to run the backend setup script:

irm https://raw.githubusercontent.com/Meisdy/Speech-to-Code-Generation-for-Collaborative-Robots/main/Setup/setup_backend.ps1 | iex

Installs uv via winget if missing, downloads to C:\Program Files\Speech-to-Cobot-Backend, creates a Desktop shortcut. Full script details in setup_backend.ps1

Before running with a real UR robot, see Before Each Session for robot preparation steps and Network Setup for IP configuration. The mock adapter works out of the box without any robot hardware or network configuration.

Backend Installation — Linux (Franka)

This setup is time-consuming and not recommended as a first test. It uses an older robot and a full Linux FCI stack that must be configured manually. If you do not need to use a Franka Emika Panda robot, skip this section.

The Franka adapter requires a dedicated Linux PC with a real-time kernel. This cannot be automated easily — the ROS stack must be set up manually first.

Prerequisites

Ubuntu 20.04 with PREEMPT_RT real-time kernel
ROS Noetic
MoveIt (panda_moveit_config, franka_control)
libfranka (compatible with your robot firmware)
A working MoveIt workspace at ~/ws_moveit

See the official docs: ROS Noetic, MoveIt, libfranka.

Ensure the backend source code is on the Linux PC before launching.

Launch Command

python3 -m Backend.main

The backend connects to the robot at 192.168.1.100 by default. Verify ROBOT_IP in Backend/robot_controllers/franka_controller.py before launching.

Before Each Session — Robot Preparation

These steps are required before every session, separate from the one-time software installation above.

UR10e

Power on the UR10e via the Teach Pendant Start Button.
Set the robot mode from Local to Remote Mode via Teach Pendant.
Plug in the Ethernet cable between the robot and the backend machine.
(First-time only) Ensure the backend machine and robot are on the same network and IPs are configured — see Network Setup.
Launch frontend and backend from their Desktop shortcuts. Use Ping Backend in the GUI to test the backend connection.

Always use Remote Mode. In Local Mode, motion commands are silently ignored — the robot accepts them but does not move. Gripper and save-position still work, which can make the root cause difficult to identify.

Franka

Turn on the Franka controller.
Start the Linux desktop PC, selecting the RT kernel at boot.
(First-time only) Configure the network adapters on the Linux PC — one toward the robot, one toward the operator machine. Follow the Franka documentation for initial network setup.
Check that the network adapters are active and plug in the Ethernet cable between the Linux PC and the operator machine. Run a test ping to confirm the connection.
Open the Franka Desk in a browser at https://<franka-robot-ip>. Accept the self-signed certificate warning — most browsers will show this.
Unlock the Axes in the web desk, using the open lock symbol.
Engage the hand switch for the robot, then release it.
Enable FCI in Desk.
Start the backend on the Linux PC:
```
python3 -m Backend.main
```
Start the frontend on the operator machine. The backend must be reachable before any commands can be sent.

The Franka must show blue (Ready) before issuing motion commands. In hand-guide mode (white), commands are accepted but not executed.

Network Setup

The setup below describes the configuration used during development and evaluation: a direct Ethernet connection between the operator machine and the robot or backend PC. Using a router or switch is also possible if available, but was not tested.

Direct Ethernet (no router)

Find the robot IP. Consult your robot's interface or documentation to locate the current IP address.
On the backend machine, set a static IP on the Ethernet adapter connected to the robot to the same subnet as the robot. On Windows 11: Settings → Network & Internet → Advanced network settings → select the correct adapter → Edit → click IPv4 in the list → Properties → set to Manual. No gateway needed. This setting persists per adapter.

Subnet example: if the robot IP is 192.168.1.100, set the adapter IP to any address in 192.168.1.x that is not already in use (e.g. 192.168.1.10). Subnet mask changes are not needed.
Close the Ethernet Properties window. Disable and re-enable the adapter for the IP change to take effect.
Verify the connection by opening a Command Prompt and running:
```
ping <robot-ip>
```
A successful reply confirms the adapter and IP are configured correctly.
Verify that the robot IP in C:\Program Files\Speech-to-Cobot-Backend\Backend\robot_controllers\<your_robot_controller>.py (your chosen Robot Adapter) matches the actual robot IP. If you are using the UR Robot, also set PC_IP in C:\Program Files\Speech-to-Cobot-Backend\Backend\config_backend.py to the backend machine's adapter IP you chose in step 2 — this is required for UR motion callbacks.

All source files are python and can be opened with any text editor, including Notepad. No Python installation is needed to edit them.
If the frontend and backend run on different machines, both need to be on the same network. Configure a static IP on each machine's Ethernet adapter facing the other (same process as step 2). Then update BACKEND_IPS in Frontend/config_frontend.py with the backend machine's IP. If both run on the same machine, localhost is correct and no network configuration is needed.

Frontend/config_frontend.py backend address reference:

BACKEND_IPS = {
    "franka": "tcp://192.168.2.20:5555",
    "ur":     "tcp://localhost:5555",
    "mock":   "tcp://localhost:5555",
}

Use Ping Backend in the GUI to verify. A green indicator confirms the connection immediately. If it does not turn green, check that the backend is running, IPs are correct, and port 5555 is not blocked by a firewall.

After the initial setup, the backend and robot should be reachable without reconfiguring IPs. Continue with the session setup steps now.

Configuration Reference

Frontend — Frontend/config_frontend.py

Parameter	Default	Description
`ASR_MODEL_SIZE`	`small`	Whisper model: `tiny`, `base`, `small`, `medium`, `large`
`ASR_LANGUAGE`	`en`	ISO language code for ASR
`ASR_SAMPLE_RATE`	`16000`	Audio sample rate in Hz
`ASR_CONFIDENCE_THRESHOLD`	`0.7`	Warn on transcripts below this threshold (0.0–1.0)
`ASR_FP16`	`False`	Requires CUDA (not yet implemented)
`LLM_API_BASE`	`http://localhost:1234/v1`	LM Studio server URL
`LLM_MODEL_NAME`	`meta-llama-3.1-8b-instruct`	Model name as shown in LM Studio
`LLM_TEMPERATURE`	`0.1`	Low = more deterministic output
`LLM_MAX_TOKENS`	`2048`	Maximum tokens in LLM response
`LLM_TIMEOUT`	`60`	LLM timeout in seconds
`BACKEND_IPS`	`{"franka": "tcp://192.168.2.20:5555", "ur": "tcp://localhost:5555", "mock": "tcp://localhost:5555"}`	ZeroMQ addresses per robot backend
`ROBOT_TYPE_KEYS`	`{"Franka Emika": "franka", "Universal Robot": "ur", "Mock Adapter": "mock"}`	Maps GUI display names to backend robot types
`MAX_ATTEMPTS`	`2`	Retry attempts when backend is unreachable
`LOGGING_LEVEL`	`INFO`	Console log verbosity
`LOGGING_LEVEL_FILE`	`DEBUG`	File log verbosity (can differ from console)
`LOGGING_SAVE_AUDIO`	`False`	Save microphone input as `.wav` for debugging
`LOGGING_SAVE_PARSE`	`False`	Save parsed JSON for parser debugging

Backend — Backend/config_backend.py

Parameter	Default	Description
`BINDING_ADDRESS`	`tcp://*:5555`	ZeroMQ binding address
`ZMQ_TIMEOUT_MS`	`1000`	ZeroMQ socket timeout
`AVAILABLE_ROBOTS`	`["mock", "franka", "ur"]`	Adapters to load
`ALLOWED_COMMANDS`	`["ping", "execute_sequence", "save_script", "run_script", "stop_script", "get_script_status", "delete_script"]`	Main-level commands the backend accepts
`PC_IP`	`192.168.1.101`	Backend machine IP as seen by the UR robot (UR motion callback only)
`LOGGING_LEVEL`	`INFO`	Console log verbosity
`LOGGING_LEVEL_FILE`	`DEBUG`	File log verbosity

Uninstall

Frontend (Windows)

irm https://raw.githubusercontent.com/Meisdy/Speech-to-Code-Generation-for-Collaborative-Robots/main/Setup/uninstall_frontend.ps1 | iex

Removes the installation directory, Desktop shortcut, and Whisper cache. Prompts optionally remove ffmpeg and uv. LM Studio must be uninstalled manually via Settings → Apps.

Backend (Windows)

irm https://raw.githubusercontent.com/Meisdy/Speech-to-Code-Generation-for-Collaborative-Robots/main/Setup/uninstall_backend.ps1 | iex

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup Guide — Speech-to-Code Generation for Collaborative Robots

Frontend Installation (Windows 11)

Prerequisites

Install

Backend Installation — Windows (Mock / UR)

Backend Installation — Linux (Franka)

Prerequisites

Launch Command

Before Each Session — Robot Preparation

UR10e

Franka

Network Setup

Direct Ethernet (no router)

Configuration Reference

Uninstall

Frontend (Windows)

Backend (Windows)

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Setup Guide — Speech-to-Code Generation for Collaborative Robots

Frontend Installation (Windows 11)

Prerequisites

Install

Backend Installation — Windows (Mock / UR)

Backend Installation — Linux (Franka)

Prerequisites

Launch Command

Before Each Session — Robot Preparation

UR10e

Franka

Network Setup

Direct Ethernet (no router)

Configuration Reference

Uninstall

Frontend (Windows)

Backend (Windows)