Skip to content

Commit b747030

Browse files
authored
Merge pull request #10 from Sydney-Informatics-Hub/dgx-workshop-updates
edit the guide in preparation for the workshop
2 parents 812f3b5 + d1afc82 commit b747030

14 files changed

Lines changed: 99 additions & 50 deletions

_quarto.yml

Lines changed: 13 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -18,28 +18,33 @@ website:
1818
contents:
1919
- text: "Introduction to the SIH GPU Cluster"
2020
href: notebooks/dgx.html
21+
- text: "Accessing the SIH GPU Cluster"
22+
href: notebooks/access.html
2123
- section: "Getting Started with Run:ai"
2224
contents:
2325
- text: "Login"
2426
href: notebooks/login.html
2527
- text: "Navigating the User Interface"
2628
href: notebooks/user_interface.html
27-
- section: "How-to Guides"
29+
- section: "Run:ai Features"
2830
contents:
29-
- text: "How to Use the Dashboards"
31+
- text: "Dashboards"
3032
href: notebooks/dashboards.html
31-
- text: "How to Manage Projects"
33+
- text: "Projects"
3234
href: notebooks/projects.html
33-
- text: "How to Configure Environments"
35+
- text: "Environments"
3436
href: notebooks/environments.html
37+
- text: "Data Sources"
38+
href: notebooks/data_sources.html
39+
40+
- section: "Tutorials"
41+
contents:
42+
- text: "Creating a basic Jupyter Lab workload"
43+
href: notebooks/jupyter_tutorial.html
3544
- text: "How to transfer data to/from the SIH GPU Cluster"
3645
href: notebooks/data_transfer.html
3746
- text: "How to run a terminal using the Command Line Interface (CLI)"
3847
href: notebooks/CLI.html
39-
- section: "Tutorials"
40-
contents:
41-
- text: "Creating a Jupyter Lab Workload"
42-
href: notebooks/jupyter_tutorial.html
4348
cookie-consent:
4449
type: express
4550
style: simple
16.3 KB
Loading

fig/environment_predefined.png

41.6 KB
Loading

fig/project_example.png

156 KB
Loading

fig/project_hackathon.png

-151 KB
Binary file not shown.

index.qmd

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,11 +9,12 @@ output:
99
:::{.callout-note}
1010
The SIH GPU Cluster is managed by the [Sydney Informatics Hub](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html) at the University of Sydney. Contact <sih.info@sydney.edu.au> for more questions.
1111
:::
12-
12+
This Onboarding Guide is structured as follows:
1313

1414
| | |
1515
| --- | :--- |
1616
| **Section** | **Content** |
17-
| Getting Started with the SIH GPU Cluster | Cluster fundamentals |
18-
| Getting Started with Run:ai | Run:ai fundamentals |
19-
| How-to Guides | Provide step-by-step instructions |
17+
| Getting Started with the SIH GPU Cluster | Overview and basic information about the GPU Cluster |
18+
| Getting Started with Run:ai | Introduction to Run:ai concepts and fundamentals |
19+
| Run:ai Features | Step-by-step instructions for using various Run:ai features. More details can be found in the [official Run:ai user guide (version 2.18)](https://docs.run.ai/v2.18/Researcher/overview-researcher/) |
20+
| Tutorials | Practical examples and demonstrations of different use cases |

notebooks/access.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
# Accessing the SIH GPU Cluster
2+
3+
For early acccess to the SIH GPU Cluster, please directly contact the [Sydney Informatics Hub](https://www.sydney.edu.au/research/facilities/sydney-informatics-hub.html) with the following information:
4+
5+
- **DashR Research Project Shortcode**: Please log in to the [DashR portal](https://dashr.sydney.edu.au/projects), click on the project you intend to work on, and locate the *Project Short Code*. This is essential for linking your compute usage to the correct research allocation.
6+
- **Billing Code**: This should be the internal billing code associated with your project or department. If you are unsure, please consult your supervisor or finance administrator.
7+
8+
:::{.callout-note}
9+
**Work in Progress**: Long term project provisioning and access to the SIH GPU Cluster will be streamlined and directly managed on the university's [Researcher Dashboard (DashR)](https://dashr.sydney.edu.au/).
10+
:::

notebooks/dashboards.md

Lines changed: 16 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,29 +1,35 @@
1-
# How to Use the Dashboards
1+
# Run:ai Dashboards
22
## Overview
33
This dashboard view provides holistic infrastructure information useful for both researchers and system administrators in managing and planning the resources.
44

55
### Indicators
66
This section presents high-level statistics of the GPU computing resources
7-
![](../fig/dashboards_overview_indicators.png)
7+
8+
![The "Indicators" panel under "Overview"](../fig/dashboards_overview_indicators.png)
89

910
### Cluster Load
1011
Real-time monitoring of the cluster status in terms of GPU and CPU utilisation.
11-
![](../fig/dashboards_overview_clusterload.png)
12+
13+
![Monitoring system-wide workload](../fig/dashboards_overview_clusterload.png)
1214

1315
### Queueing
14-
Inspecting all queueing jobs. Possible reasons why jobs are queueing include:
16+
Inspecting queueing jobs. Possible reasons why your jobs are queueing include:
1517

16-
- The number of GPUs requested to be allocated to the job has exceeded the remaining GPUs in the project.
18+
- The number of GPUs requested to be allocated to the job has exceeded the remaining GPU quota in the project.
19+
- The GPU cluster is currently at full capacity and therefore has no available resources to schedule the job.
1720
- The job is waiting for other jobs to finish before it can be scheduled.
18-
![](../fig/dashboards_overview_queueing.png)
21+
22+
![Queueing jobs](../fig/dashboards_overview_queueing.png)
1923

2024
### Idle GPUs
2125
Displaying the number of idle GPUs currently allocated to running workloads.
22-
![](../fig/dashboards_overview_idleGPUs.png)
26+
27+
![Idle GPUs](../fig/dashboards_overview_idleGPUs.png)
2328

2429
### Running Workloads
2530
Summary of the list of running workloads.
26-
![](../fig/dashboards_overview_workloads.png)
31+
32+
![Running workloads](../fig/dashboards_overview_workloads.png)
2733

2834
## Analytics
2935
This dashboard provides more detailed breakdowns of the SIH GPU running status. Key
@@ -33,4 +39,5 @@ statistics that are reported at separate levels:
3339
- Project
3440
- Workloads
3541
- Nodes
36-
![](../fig/dashboards_analytics.png)
42+
43+
![System analytics](../fig/dashboards_analytics.png)

notebooks/data_sources.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
# Data Sources

0 commit comments

Comments
 (0)