bug: IORails ModelEngine double /v1 in URL when base_url includes /v1

### Did you check docs and existing issues?

- [x] I have read all the NeMo-Guardrails docs
- [x] I have updated the package to the latest version before submitting this issue
- [ ] (optional) I have used the develop branch
- [x] I have searched the existing issues of NeMo-Guardrails

### Python version (python --version)

3.12

### Operating system/version

26.4

### NeMo-Guardrails version (if you must use a specific version and not the latest

_No response_

### Describe the bug

`ModelEngine._resolve_base_url()` returns the user-provided `base_url` as-is, and `_prepare_request()` appends `/v1/chat/completions` to it. When the user sets `base_url` to an OpenAI-compatible endpoint that already includes `/v1` (the standard convention for vLLM, LiteLLM, etc.), the constructed URL becomes `/v1/v1/chat/completions`, resulting in a 404.

This only affects IORails — LLMRails uses the LangChain OpenAI client which appends only `/chat/completions` to the base URL, so the same config works fine under LLMRails.

`_CHAT_COMPLETIONS_ENDPOINT = "/v1/chat/completions"`, while the default `_ENGINE_BASE_URLS`  intentionally omit `/v1`. The mismatch breaks user-provided URLs that follow the LLMRails / OpenAI SDK convention of including `/v1`.

### Steps To Reproduce

1.  Create a config with `base_url` ending in `/v1` and flows that trigger IORails
2. Use the Guardrails Python API:

```python
import asyncio
from nemoguardrails import RailsConfig
from nemoguardrails.guardrails.guardrails import Guardrails

async def main():
config = RailsConfig.from_path("path/to/config")
g = Guardrails(config=config)
print(f"Engine: {type(g.rails_engine).__name__}") # IORails
await g.startup()
result = await g.generate_async(
messages=[{"role": "user", "content": "Hello"}]
)
print(result)

asyncio.run(main())
```

3. Observe the log output showing the doubled path:

```
HTTP POST https://my-openai-compatible-server.com/v1/v1/chat/completions model='my-model'
HTTP 404 from model 'my-model': {"detail":"Not Found"}
```

### Expected Behavior

The URL should be `https://my-openai-compatible-server.com/v1/chat/completions` — a single `/v1` prefix — regardless of whether the user includes `/v1` in `base_url` or not.

### Actual Behavior

IORails constructs URL with double `/v1`: `https://my-openai-compatible-server.com/v1/v1/chat/completions`, which returns 404 from the upstream server.

The same config works correctly under LLMRails because the LangChain OpenAI client only appends `/chat/completions`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: IORails ModelEngine double /v1 in URL when base_url includes /v1 #1861

Did you check docs and existing issues?

Python version (python --version)

Operating system/version

NeMo-Guardrails version (if you must use a specific version and not the latest

Describe the bug

Steps To Reproduce

Expected Behavior

Actual Behavior

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

bug: IORails ModelEngine double /v1 in URL when base_url includes /v1 #1861

Description

Did you check docs and existing issues?

Python version (python --version)

Operating system/version

NeMo-Guardrails version (if you must use a specific version and not the latest

Describe the bug

Steps To Reproduce

Expected Behavior

Actual Behavior

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions