Skip to content

Commit 99206c2

Browse files
third pass, including removing docker steps
1 parent 9a1ccdc commit 99206c2

6 files changed

Lines changed: 18 additions & 53 deletions

File tree

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/_index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Build RAG applications with LlamaIndex on a Google Cloud C4A Axion virtual machine
2+
title: Build RAG applications with LlamaIndex on a Google Cloud C4A virtual machine
33

4-
description: Set up LlamaIndex on Google Cloud C4A Axion Arm VMs running SUSE Linux to build browser-based Retrieval-Augmented Generation (RAG) applications using local LLMs, vector databases, and FastAPI.
4+
description: Set up LlamaIndex on Google Axion-based C4A Arm64 VMs running SUSE Linux to build browser-based Retrieval-Augmented Generation (RAG) applications using local LLMs, vector databases, and FastAPI.
55

66
minutes_to_complete: 30
77

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/background.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Learn about LlamaIndex and Google Axion C4A for RAG applications
3-
description: Learn how LlamaIndex supports browser-based RAG applications on Google Axion-based C4A Arm instances.
2+
title: Learn about LlamaIndex and Google Cloud C4A for RAG applications
3+
description: Learn how LlamaIndex supports browser-based RAG applications on Google Axion-based C4A instances.
44
weight: 2
55

66
layout: "learningpathall"
@@ -24,12 +24,12 @@ LlamaIndex provides a unified framework with components such as:
2424
- Vector store integrations for scalable embedding search
2525
- LLM integrations for generating grounded responses
2626

27-
Running LlamaIndex on Google Axion C4A Arm-based infrastructure enables efficient execution of AI and RAG workloads by using multi-core Arm CPUs and optimized memory performance. This results in improved performance per watt, reduced infrastructure costs, and better scalability for browser-based AI applications and local inference pipelines.
27+
Running LlamaIndex on Google Cloud C4A Arm-based infrastructure enables efficient execution of AI and RAG workloads by using multi-core Arm CPUs and optimized memory performance. This results in improved performance per watt, reduced infrastructure costs, and better scalability for browser-based AI applications and local inference pipelines.
2828

2929
In this Learning Path, you'll use these components to build a browser-based RAG application that answers questions from custom documents.
3030

3131
## What you've learned and what's next
3232

33-
You've now learned about Google Cloud C4A Arm-based VMs and their performance advantages for AI and RAG workloads. You were also introduced to core LlamaIndex components including document ingestion, indexing pipelines, query engines, vector stores, and LLM integrations.
33+
You've now learned about Arm-based Google Cloud C4A VMs and their performance advantages for AI and RAG workloads. You were also introduced to core LlamaIndex components including document ingestion, indexing pipelines, query engines, vector stores, and LLM integrations.
3434

3535
Next, you'll create a firewall rule in Google Cloud Console to enable remote access to the browser-based LlamaIndex RAG application that you'll create in this Learning Path.

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/build-browser-rag-app.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -31,9 +31,9 @@ FastAPI
3131
3232
LlamaIndex
3333
34-
ChromaDB Vector Store
34+
ChromaDB vector store
3535
36-
Ollama Local LLM
36+
Ollama local LLM
3737
3838
Documents
3939
```
@@ -383,7 +383,7 @@ What is Google Cloud Axion?
383383

384384
The answers will appear directly in the browser interface.
385385

386-
## Add your own documents
386+
## (Optional) Add your own documents
387387

388388
After confirming that the application works, you can try adding your own documents.
389389

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/firewall.md

Lines changed: 4 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -9,9 +9,9 @@ layout: learningpathall
99

1010
## Allow inbound access to the LlamaIndex browser application
1111

12-
Create a firewall rule in Google Cloud Console to expose port 8000 for the browser-based LlamaIndex RAG application.
12+
Create a firewall rule in Google Cloud console to expose port 8000 for the browser-based LlamaIndex RAG application.
1313

14-
### Configure the firewall rule in Google Cloud Console
14+
### Configure the firewall rule in the Google Cloud console
1515

1616
To configure a firewall rule:
1717

@@ -26,11 +26,7 @@ To configure a firewall rule:
2626
![Google Cloud console Create firewall rule form with Name set to allow-llamaindex-port and Direction of traffic set to Ingress#center](images/network-rule.png "Configuring the allow-llamaindex-port firewall rule")
2727

2828
5. Under **Protocols and ports**, select **Specified protocols and ports**.
29-
6. Select the **TCP** checkbox. Port `8000` is used by the FastAPI server that backs the browser-based LlamaIndex RAG application. Enter:
30-
31-
```text
32-
8000
33-
```
29+
6. Select the **TCP** checkbox. For **Ports**, enter `8000`. Port `8000` is used by the FastAPI server that backs the browser-based LlamaIndex RAG application.
3430

3531
![Google Cloud console Protocols and ports section with TCP selected and port 8000 entered#center](images/network-port.png "Setting the LlamaIndex browser application port in the firewall rule")
3632

@@ -39,6 +35,6 @@ To configure a firewall rule:
3935

4036
## What you've accomplished and what's next
4137

42-
You've now created a firewall rule that exposes port 8000 for the browser-based LlamaIndex RAG application and port 22 for SSH. The firewall rule uses the network tag `allow-llamaindex-port`, which you'll attach to your virtual machine in the next section.
38+
You've now created a firewall rule that exposes port 8000 for the browser-based LlamaIndex RAG application and port 22 for SSH. You'll attach this firewall rule to your virtual machine in the next section.
4339

4440
Next, you'll create a Google Cloud C4A virtual machine and connect to it using SSH.

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/instance.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -11,16 +11,16 @@ layout: learningpathall
1111

1212
In this section, you'll create a Google Cloud C4A Arm-based virtual machine (VM). You'll use the `c4a-standard-4` machine type, which provides four vCPUs and 16 GB of memory. This VM will host your browser-based LlamaIndex RAG application.
1313

14-
### Configure the C4A virtual machine in Google Cloud Console
14+
### Configure the C4A virtual machine in the Google Cloud console
1515

1616
To create a virtual machine based on the C4A instance type in the console:
1717

18-
1. Navigate to the [Google Cloud Console](https://console.cloud.google.com/).
18+
1. Navigate to the [Google Cloud console](https://console.cloud.google.com/).
1919
2. Go to **Compute Engine** > **VM instances** and select **Create instance**.
2020
3. Under **Machine configuration**, populate fields such as **Instance name**, **Region**, and **Zone**.
2121
4. Set **Series** to `C4A`, then select `c4a-standard-4` for **Machine type**.
2222

23-
![Screenshot of the Google Cloud Console showing the Machine configuration section. The Series dropdown is set to C4A and the machine type c4a-standard-4 is selected.#center](images/gcp-vm.png "Configuring machine type to C4A in Google Cloud Console")
23+
![Screenshot of the Google Cloud console showing the Machine configuration section. The Series dropdown is set to C4A and the machine type c4a-standard-4 is selected.#center](images/gcp-vm.png "Configuring machine type to C4A in Google Cloud Console")
2424

2525
5. Under **OS and storage**, select **Change** and then choose an Arm64-based operating system image. For this Learning Path, select **SUSE Linux Enterprise Server**.
2626
6. For the license type, choose **Pay as you go**.
@@ -31,7 +31,7 @@ To create a virtual machine based on the C4A instance type in the console:
3131

3232
After the instance starts, select **SSH** next to the VM in the instance list to open a browser-based terminal session.
3333

34-
![Google Cloud Console VM instances page displaying running instance with green checkmark and SSH button in the Connect column#center](images/gcp-pubip-ssh.png "Connecting to a running C4A VM using SSH")
34+
![Google Cloud console VM instances page displaying running instance with green checkmark and SSH button in the Connect column#center](images/gcp-pubip-ssh.png "Connecting to a running C4A VM using SSH")
3535

3636
A new browser window opens with a terminal connected to your VM.
3737

content/learning-paths/servers-and-cloud-computing/llamaindex-rag-axion/setup-llamaindex-rag.md

Lines changed: 1 addition & 32 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ weight: 5
77
layout: learningpathall
88
---
99

10-
## Prepare the environment
10+
## Prepare the environment for a LLamaIndex RAG application
1111

1212
In this section, you'll prepare a Google Cloud Axion Arm64 VM for running a browser-based RAG application using LlamaIndex.
1313

@@ -60,37 +60,6 @@ Python 3.11.10
6060
pip 22.3.1 from /usr/lib/python3.11/site-packages/pip (python 3.11)
6161
```
6262

63-
<!-- ### (Optional) Install Docker
64-
65-
For this Learning Path, ChromaDB and Ollama run natively. For extended use, you can install Docker so that you can run containerized workloads alongside the RAG pipeline if needed:
66-
67-
```bash
68-
sudo zypper install -y docker
69-
sudo systemctl enable docker
70-
sudo systemctl start docker
71-
```
72-
73-
Verify Docker is running and add your user to the `docker` group so you don't need `sudo` for Docker commands:
74-
75-
```bash
76-
sudo systemctl status docker
77-
sudo usermod -aG docker $USER
78-
newgrp docker
79-
```
80-
81-
Test Docker:
82-
83-
```bash
84-
docker run hello-world
85-
```
86-
87-
The output is similar to:
88-
89-
```output
90-
Hello from Docker!
91-
This message shows that your installation appears to be working correctly.
92-
``` -->
93-
9463
### Create project directory
9564

9665
Create a project directory and a Python virtual environment. The virtual environment isolates the Python packages for this project from your system packages:

0 commit comments

Comments
 (0)