NVIDIA
diff --git a/‎docs/source/build-workflows/llms/index.md‎
Lines changed: 29 additions & 0 deletions b/‎docs/source/build-workflows/llms/index.md‎
Lines changed: 29 additions & 0 deletions
diff --git a/‎docs/source/components/integrations/index.md‎
Lines changed: 2 additions & 1 deletion b/‎docs/source/components/integrations/index.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎docs/source/components/integrations/integrating-oci-generative-ai-models.md‎
Lines changed: 98 additions & 0 deletions b/‎docs/source/components/integrations/integrating-oci-generative-ai-models.md‎
Lines changed: 98 additions & 0 deletions
diff --git a/‎docs/source/conf.py‎
Lines changed: 2 additions & 0 deletions b/‎docs/source/conf.py‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/source/get-started/installation.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/source/get-started/installation.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎examples/frameworks/agno_personal_finance/pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎examples/frameworks/agno_personal_finance/pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎examples/frameworks/multi_frameworks/pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎examples/frameworks/multi_frameworks/pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎packages/nvidia_nat_agno/pyproject.toml‎
Lines changed: 1 addition & 1 deletion b/‎packages/nvidia_nat_agno/pyproject.toml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎packages/nvidia_nat_core/src/nat/llm/oci_llm.py‎
Lines changed: 78 additions & 0 deletions b/‎packages/nvidia_nat_core/src/nat/llm/oci_llm.py‎
Lines changed: 78 additions & 0 deletions
diff --git a/‎packages/nvidia_nat_core/src/nat/llm/register.py‎
Lines changed: 1 addition & 0 deletions b/‎packages/nvidia_nat_core/src/nat/llm/register.py‎
Lines changed: 1 addition & 0 deletions
@@ -28,6 +28,7 @@ NVIDIA NeMo Agent Toolkit supports the following LLM providers:
 | [OpenAI](https://openai.com) | `openai` | OpenAI API |
 | [AWS Bedrock](https://aws.amazon.com/bedrock/) | `aws_bedrock` | AWS Bedrock API |
 | [Azure OpenAI](https://learn.microsoft.com/en-us/azure/ai-foundry/openai/quickstart) | `azure_openai` | Azure OpenAI API |
+| [OCI Hosted OpenAI-Compatible](https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm) | `oci` | OCI-hosted OpenAI-compatible API, including OCI Generative AI or OKE-hosted gateways |
 | [LiteLLM](https://github.com/BerriAI/litellm) | `litellm` | LiteLLM API |
 | [Hugging Face](https://huggingface.co) | `huggingface` | Hugging Face API |
 | [Hugging Face Inference](https://huggingface.co/docs/api-inference) | `huggingface_inference` | Hugging Face Inference API, Endpoints, and TGI |
@@ -52,6 +53,10 @@ llms:
   azure_openai_llm:
     _type: azure_openai
     azure_deployment: gpt-4o-mini
+  oci_llm:
+    _type: oci
+    model_name: nvidia/Llama-3.1-Nemotron-Nano-8B-v1
+    endpoint: https://inference.generativeai.us-chicago-1.oci.oraclecloud.com/20231130/actions/chat/completions/openai/v1
   litellm_llm:
     _type: litellm
     model_name: gpt-4o
@@ -118,6 +123,30 @@ The AWS Bedrock LLM provider is defined by the {py:class}`~nat.llm.aws_bedrock_l
 * `credentials_profile_name` - The credentials profile name to use for the model
 * `max_retries` - The maximum number of retries for the request
 
+### OCI Hosted OpenAI-Compatible
+
+You can use the following environment variables to configure the OCI Generative AI LLM provider:
+
+* `OCI_GENAI_API_KEY` - The API key or bearer token to access the OCI-hosted endpoint
+* `OCI_GENAI_BASE_URL` - The OCI OpenAI-compatible endpoint base URL
+* `OCI_GENAI_ENDPOINT` - Alternate OCI Generative AI endpoint variable
+
+The OCI OpenAI-compatible LLM provider is defined by the {py:class}`~nat.llm.oci_llm.OCIModelConfig` class.
+
+* `model_name` - The name of the model to use
+* `endpoint` - The OCI OpenAI-compatible endpoint base URL
+* `temperature` - The temperature to use for the model
+* `top_p` - The top-p value to use for the model
+* `max_tokens` - The maximum number of tokens to generate
+* `seed` - The seed to use for the model
+* `api_key` - The API key to use for the model
+* `max_retries` - The maximum number of retries for the request
+* `request_timeout` - HTTP request timeout in seconds
+
+:::{note}
+This provider targets OCI-hosted OpenAI-compatible chat-completions endpoints and does not enable the Responses API.
+:::
+
 ### Azure OpenAI
 
 You can use the following environment variables to configure the Azure OpenAI LLM provider:
 
@@ -23,4 +23,5 @@ limitations under the License.
 ./frameworks.md
 ./a2a.md
 AWS Bedrock <./integrating-aws-bedrock-models.md>
-```
+OCI Generative AI <./integrating-oci-generative-ai-models.md>
+```
@@ -0,0 +1,98 @@
+<!--
+SPDX-FileCopyrightText: Copyright (c) 2025-2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+SPDX-License-Identifier: Apache-2.0
+
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+-->
+
+# NVIDIA NeMo Agent Toolkit OCI Integration
+
+The NeMo Agent Toolkit supports integration with multiple [LLM](../../build-workflows/llms/index.md) providers, including OCI Generative AI. The `oci` provider uses OCI SDK authentication and is designed for OCI Generative AI model and endpoint access. For workflow parity with the AWS Bedrock path, the toolkit also includes a LangChain wrapper built on `langchain-oci`.
+
+To view the full list of supported LLM providers, run `nat info components -t llm_provider`.
+
+## Configuration
+
+### Prerequisites
+Before integrating OCI, ensure you have:
+
+- access to OCI Generative AI in the target region
+- a valid OCI auth method such as `API_KEY`, `SECURITY_TOKEN`, `INSTANCE_PRINCIPAL`, or `RESOURCE_PRINCIPAL`
+- the target compartment OCID
+- the Generative AI service endpoint for the region or a custom endpoint URL
+
+Common deployment patterns include:
+
+- OCI Generative AI regional endpoints
+- custom OCI Generative AI endpoints
+- OCI-hosted inference for NVIDIA Nemotron used as a live integration target
+
+### Example Configuration
+Add the OCI LLM configuration to your workflow config file:
+
+```yaml
+llms:
+  oci_llm:
+    _type: oci
+    model_name: nvidia/Llama-3.1-Nemotron-Nano-8B-v1
+    endpoint: https://inference.generativeai.us-chicago-1.oci.oraclecloud.com
+    compartment_id: ocid1.compartment.oc1..example
+    auth_type: API_KEY
+    auth_profile: API_KEY_AUTH
+    temperature: 0.0
+    max_tokens: 1024
+    top_p: 1.0
+    request_timeout: 60
+```
+
+### Configurable Options
+* `model_name`: The name of the OCI-hosted model to use (required)
+* `endpoint`: The OCI Generative AI service endpoint or custom endpoint URL
+* `compartment_id`: OCI compartment OCID
+* `auth_type`: OCI SDK auth type
+* `auth_profile`: OCI profile name for file-backed auth
+* `auth_file_location`: Path to the OCI config file
+* `provider`: Optional OCI provider override such as `meta`, `google`, `cohere`, or `openai`
+* `temperature`: Controls randomness in the output (0.0 to 1.0)
+* `max_tokens`: Maximum number of tokens to generate
+* `top_p`: Top-p sampling parameter (0.0 to 1.0)
+* `seed`: Optional random seed
+* `max_retries`: Maximum number of retries for the request
+* `request_timeout`: HTTP request timeout in seconds
+
+### Limitations
+* This provider targets OCI Generative AI through the OCI SDK-backed `langchain-oci` path.
+* The Responses API is not enabled for this provider in the current release.
+
+## Nemotron On OCI
+
+One strong OCI deployment pattern is NVIDIA Nemotron hosted on OCI and exposed through an OpenAI-compatible route. In that setup, the toolkit can validate live integration behavior against the OCI-hosted Nemotron endpoint while the official provider and LangChain wrapper cover the OCI Generative AI path.
+
+## Usage
+Reference the OCI LLM in your configuration:
+
+```yaml
+llms:
+  oci_llm:
+    _type: oci
+    model_name: nvidia/Llama-3.1-Nemotron-Nano-8B-v1
+    endpoint: https://inference.generativeai.us-chicago-1.oci.oraclecloud.com
+    compartment_id: ocid1.compartment.oc1..example
+    auth_profile: API_KEY_AUTH
+```
+
+## Troubleshooting
+* `401 Unauthorized`: verify the OCI profile, signer, and IAM permissions for Generative AI.
+* `404 Not Found`: confirm the regional endpoint or custom endpoint URL is correct.
+* `Connection errors`: verify OCI networking and regional endpoint reachability.
+* `Tool calling issues`: verify the served model supports tool calling and that the serving stack is configured for it.
@@ -379,6 +379,8 @@ def _build_api_tree() -> Path:
         '/extend/custom-components/gated-fields.html',
     'extend/integrating-aws-bedrock-models':
         '/components/integrations/integrating-aws-bedrock-models.html',
+    'extend/integrating-oci-generative-ai-models':
+        '/components/integrations/integrating-oci-generative-ai-models.html',
     'extend/memory':
         '/extend/custom-components/memory.html',
     'extend/object-store':
 
@@ -27,6 +27,7 @@ The following [LLM](../build-workflows/llms/index.md) API providers are supporte
 - OpenAI
 - AWS Bedrock
 - Azure OpenAI
+- OCI Generative AI
 
 ## Packages
 
 
@@ -34,7 +34,7 @@ classifiers = ["Programming Language :: Python"]
 [tool.setuptools_dynamic_dependencies]
 dependencies = [
   "nvidia-nat[agno,test] == {version}",
-  "openai~=1.106",
+  "openai>=1.106,<3.0.0",
 ]
 
 [tool.uv.sources]
 
@@ -38,7 +38,7 @@ dependencies = [
   "beautifulsoup4~=4.13",
   "markdown-it-py~=3.0",
   "nvidia-haystack~=0.3.0",
-  "openai~=1.106",
+  "openai>=1.106,<3.0.0",
 ]
 
 [tool.uv.sources]
 
@@ -57,7 +57,7 @@ dependencies = [
   "agno>=1.2.3,<2.0.0",
   "google-search-results>=2.4.2,<3.0.0",
   "litellm~=1.74",
-  "openai~=1.106",
+  "openai>=1.106,<3.0.0",
 ]
 
 [tool.setuptools_dynamic_dependencies.optional-dependencies]
 
@@ -0,0 +1,78 @@
+# SPDX-FileCopyrightText: Copyright (c) 2024-2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
+# SPDX-License-Identifier: Apache-2.0
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from pydantic import AliasChoices
+from pydantic import ConfigDict
+from pydantic import Field
+
+from nat.builder.builder import Builder
+from nat.builder.llm import LLMProviderInfo
+from nat.cli.register_workflow import register_llm_provider
+from nat.data_models.llm import LLMBaseConfig
+from nat.data_models.optimizable import OptimizableField
+from nat.data_models.optimizable import OptimizableMixin
+from nat.data_models.optimizable import SearchSpace
+from nat.data_models.retry_mixin import RetryMixin
+from nat.data_models.ssl_verification_mixin import SSLVerificationMixin
+from nat.data_models.thinking_mixin import ThinkingMixin
+
+class OCIModelConfig(LLMBaseConfig, RetryMixin, OptimizableMixin, ThinkingMixin, SSLVerificationMixin, name="oci"):
+    """OCI Generative AI LLM provider."""
+
+    model_config = ConfigDict(protected_namespaces=(), extra="allow")
+
+    endpoint: str | None = Field(
+        default=None,
+        validation_alias=AliasChoices("endpoint", "service_endpoint", "base_url"),
+        description="OCI Generative AI service endpoint URL.",
+    )
+    compartment_id: str | None = Field(default=None, description="OCI compartment OCID for Generative AI requests.")
+    auth_type: str = Field(default="API_KEY",
+                           description="OCI SDK authentication type: API_KEY, SECURITY_TOKEN, INSTANCE_PRINCIPAL, "
+                           "or RESOURCE_PRINCIPAL.")
+    auth_profile: str = Field(default="DEFAULT",
+                              description="OCI config profile to use for API_KEY or SECURITY_TOKEN auth.")
+    auth_file_location: str = Field(default="~/.oci/config",
+                                    description="Path to the OCI config file used for SDK authentication.")
+    model_name: str = OptimizableField(validation_alias=AliasChoices("model_name", "model"),
+                                       serialization_alias="model",
+                                       description="The OCI Generative AI model ID.")
+    provider: str | None = Field(default=None,
+                                 description="Optional OCI provider override such as cohere, google, meta, or openai.")
+    context_size: int | None = Field(
+        default=1024,
+        gt=0,
+        description="The maximum number of tokens available for input.",
+    )
+    seed: int | None = Field(default=None, description="Random seed to set for generation.")
+    max_retries: int = Field(default=10, description="The max number of retries for the request.")
+    max_tokens: int | None = Field(default=None, gt=0, description="Maximum number of output tokens.")
+    temperature: float | None = OptimizableField(
+        default=None,
+        ge=0.0,
+        description="Sampling temperature to control randomness in the output.",
+        space=SearchSpace(high=0.9, low=0.1, step=0.2))
+    top_p: float | None = OptimizableField(default=None,
+                                           ge=0.0,
+                                           le=1.0,
+                                           description="Top-p for distribution sampling.",
+                                           space=SearchSpace(high=1.0, low=0.5, step=0.1))
+    request_timeout: float | None = Field(default=None, gt=0.0, description="HTTP request timeout in seconds.")
+
+
+@register_llm_provider(config_type=OCIModelConfig)
+async def oci_llm(config: OCIModelConfig, _builder: Builder):
+
+    yield LLMProviderInfo(config=config, description="An OCI Generative AI model for use with an LLM client.")
@@ -27,4 +27,5 @@
 from . import huggingface_llm
 from . import litellm_llm
 from . import nim_llm
+from . import oci_llm
 from . import openai_llm
Original file line number	Diff line number	Diff line change
`@@ -34,7 +34,7 @@ classifiers = ["Programming Language :: Python"]`
`34`	`34`	`[tool.setuptools_dynamic_dependencies]`
`35`	`35`	`dependencies = [`
`36`	`36`	`"nvidia-nat[agno,test] == {version}",`
`37`		`- "openai~=1.106",`
	`37`	`+ "openai>=1.106,<3.0.0",`
`38`	`38`	`]`
`39`	`39`
`40`	`40`	`[tool.uv.sources]`
Original file line number	Diff line number	Diff line change
`@@ -38,7 +38,7 @@ dependencies = [`
`38`	`38`	`"beautifulsoup4~=4.13",`
`39`	`39`	`"markdown-it-py~=3.0",`
`40`	`40`	`"nvidia-haystack~=0.3.0",`
`41`		`- "openai~=1.106",`
	`41`	`+ "openai>=1.106,<3.0.0",`
`42`	`42`	`]`
`43`	`43`
`44`	`44`	`[tool.uv.sources]`
Original file line number	Diff line number	Diff line change
`@@ -57,7 +57,7 @@ dependencies = [`
`57`	`57`	`"agno>=1.2.3,<2.0.0",`
`58`	`58`	`"google-search-results>=2.4.2,<3.0.0",`
`59`	`59`	`"litellm~=1.74",`
`60`		`- "openai~=1.106",`
	`60`	`+ "openai>=1.106,<3.0.0",`
`61`	`61`	`]`
`62`	`62`
`63`	`63`	`[tool.setuptools_dynamic_dependencies.optional-dependencies]`