Skip to content

Commit 1f62f2c

Browse files
authored
Merge pull request #113330 from dfitzmau/OSDOCS-16988-2
OSDOCS-16988-2: CQA PR 2 for INSTALL-1: Core Installation Concepts and Customizations
2 parents f4bd7b9 + 8040393 commit 1f62f2c

7 files changed

Lines changed: 68 additions & 62 deletions

installing/validation_and_troubleshooting/installing-troubleshooting.adoc

Lines changed: 13 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,11 +6,14 @@ include::_attributes/common-attributes.adoc[]
66

77
toc::[]
88

9-
To assist in troubleshooting a failed {product-title} installation, you can gather logs from the bootstrap and control plane machines. You can also get debug information from the installation program. If you are unable to resolve the issue using the logs and debug information, see xref:../../support/troubleshooting/troubleshooting-installations.adoc#determining-where-installation-issues-occur_troubleshooting-installations[Determining where installation issues occur] for component-specific troubleshooting.
9+
[role="_abstract"]
10+
To assist in troubleshooting a failed {product-title} installation, you can gather logs from the bootstrap and control plane machines. You can also get debug information from the installation program.
11+
12+
If you are unable to resolve the issue by using the logs and debug information, see "Determining where installation issues occur" in the _Additional resources_ section.
1013

1114
[NOTE]
1215
====
13-
If your {product-title} installation fails and the debug output or logs contain network timeouts or other connectivity errors, review the guidelines for xref:../../installing/install_config/configuring-firewall.adoc#configuring-firewall[configuring your firewall]. Gathering logs from your firewall and load balancer can help you diagnose network-related errors.
16+
If your {product-title} installation fails and the debug output or logs contain network timeouts or other connectivity errors, review the guidelines "Configuring your firewall" in the _Additional resources_ section. By gathering logs from your firewall and load balancer, you can diagnose network-related errors.
1417
====
1518

1619
== Prerequisites
@@ -23,10 +26,16 @@ include::modules/manually-gathering-logs-with-ssh.adoc[leveloffset=+1]
2326

2427
include::modules/manually-gathering-logs-without-ssh.adoc[leveloffset=+1]
2528

26-
include::modules/installation-getting-debug-information.adoc[leveloffset=+1]
29+
include::modules/installing-getting-debug-information.adoc[leveloffset=+1]
2730

2831
include::modules/restarting-installation.adoc[leveloffset=+1]
2932

3033
[role="_additional-resources"]
31-
.Additional resources
34+
[id="additional-resources_installing-troubleshooting"]
35+
== Additional resources
36+
37+
* xref:../../support/troubleshooting/troubleshooting-installations.adoc#determining-where-installation-issues-occur_troubleshooting-installations[Determining where installation issues occur]
38+
39+
* xref:../../installing/install_config/configuring-firewall.adoc#configuring-firewall[Configuring your firewall]
40+
3241
* xref:../../installing/overview/index.adoc#ocp-installation-overview[Installing an {product-title} cluster]

modules/installation-bootstrap-gather.adoc

Lines changed: 6 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -8,14 +8,11 @@
88
= Gathering logs from a failed installation
99

1010
[role="_abstract"]
11-
If you gave an SSH key to your installation program, you can gather data about
12-
your failed installation.
11+
If you provided an SSH key to your installation program, you can gather data about your failed installation.
1312

1413
[NOTE]
1514
====
16-
You use a different command to gather logs about an unsuccessful installation
17-
than to gather logs from a running cluster. If you must gather logs from a
18-
running cluster, use the `oc adm must-gather` command.
15+
You use a different command to gather logs about an unsuccessful installation than to gather logs from a running cluster. If you must gather logs from a running cluster, use the `oc adm must-gather` command.
1916
====
2017

2118
.Prerequisites
@@ -26,8 +23,7 @@ running cluster, use the `oc adm must-gather` command.
2623
2724
.Procedure
2825

29-
. Generate the commands that are required to obtain the installation logs from
30-
the bootstrap and control plane machines:
26+
. Generate the commands that are required to obtain the installation logs from the bootstrap and control plane machines:
3127
+
3228
** If you used installer-provisioned infrastructure, change to the directory that contains the installation program and run the following command:
3329
+
@@ -54,8 +50,8 @@ $ ./openshift-install gather bootstrap --dir <installation_directory> \
5450
where:
5551
+
5652
--
57-
* `installation_directory`:: Specify the same directory you specified when you ran `./openshift-install create cluster`. This directory contains the {product-title} definition files that the installation program creates.
58-
* `<bootstrap_address>`:: Specify the fully qualified domain name or IP address of the cluster's bootstrap machine.
53+
* `installation_directory`:: Specifies the same directory you specified when you ran `./openshift-install create cluster`. This directory contains the {product-title} definition files that the installation program creates.
54+
* `<bootstrap_address>`:: Specifies the fully qualified domain name or IP address of the cluster's bootstrap machine.
5955
* `<master_*_address>`:: For each control plane, or master, machine in your cluster, replace this placeholder with its fully qualified domain name or IP address.
6056
+
6157
[NOTE]
@@ -71,5 +67,4 @@ INFO Pulling debug logs from the bootstrap machine
7167
INFO Bootstrap gather logs captured here "<installation_directory>/log-bundle-<timestamp>.tar.gz"
7268
----
7369
+
74-
If you open a Red Hat support case about your installation failure, include
75-
the compressed logs in the case.
70+
If you open a Red Hat support case about your installation failure, include the compressed logs when opening a Red Hat support case.

modules/installation-getting-debug-information.adoc

Lines changed: 0 additions & 24 deletions
This file was deleted.
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
// Module included in the following assemblies:
2+
//
3+
// *installing/installing-troubleshooting.adoc
4+
5+
:_mod-docs-content-type: PROCEDURE
6+
[id="installing-getting-debug-information_{context}"]
7+
= Getting debug information from the installation program
8+
9+
[role="_abstract"]
10+
You can choose between two methods to get debug information from the installation program.
11+
12+
.Procedure
13+
14+
* Look at debug messages from a past installation in the hidden `.openshift_install.log` file. To do this task, enter a command similar to the following example:
15+
+
16+
[source,terminal]
17+
----
18+
$ cat ~/<installation_directory>/.openshift_install.log
19+
----
20+
+
21+
For `<installation_directory>`, specify the same directory you specified when you ran `./openshift-install create cluster`.
22+
23+
* Change to the directory that contains the installation program and re-run the command with the `--log-level=debug` argument:
24+
+
25+
[source,terminal]
26+
----
27+
$ ./openshift-install create cluster --dir <installation_directory> --log-level debug
28+
----
29+
+
30+
For `<installation_directory>`, specify the same directory you specified when you ran `./openshift-install create cluster`.

modules/manually-gathering-logs-with-ssh.adoc

Lines changed: 9 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,10 @@
44

55
:_mod-docs-content-type: PROCEDURE
66
[id="installation-manually-gathering-logs-with-SSH_{context}"]
7-
= Manually gathering logs with SSH access to your host(s)
7+
= Manually gathering logs with SSH access to your hosts
88

9-
Manually gather logs in situations where `must-gather` or automated collection
10-
methods do not work.
9+
[role="_abstract"]
10+
Manually gather logs in situations where `must-gather` or automated collection methods do not work.
1111

1212
[IMPORTANT]
1313
====
@@ -16,44 +16,39 @@ By default, SSH access to the {product-title} nodes is disabled on the {rh-opens
1616

1717
.Prerequisites
1818

19-
* You must have SSH access to your host(s).
19+
* You must have SSH access to your hosts.
2020
2121
.Procedure
2222

23-
. Collect the `bootkube.service` service logs from the bootstrap host using the
24-
`journalctl` command by running:
23+
. Collect the `bootkube.service` service logs from the bootstrap host by entering the `journalctl` command:
2524
+
2625
[source,terminal]
2726
----
2827
$ journalctl -b -f -u bootkube.service
2928
----
3029

31-
. Collect the bootstrap host's container logs using the podman logs. This is shown
32-
as a loop to get all of the container logs from the host:
30+
. Collect the container logs of the bootstrap host by using the podman logs. Podman logs are shown as a loop to get all of the container logs from the host.
3331
+
3432
[source,terminal]
3533
----
3634
$ for pod in $(sudo podman ps -a -q); do sudo podman logs $pod; done
3735
----
3836

39-
. Alternatively, collect the host's container logs using the `tail` command by
40-
running:
37+
. Alternatively, collect the container logs of the host by entering the `tail` command:
4138
+
4239
[source,terminal]
4340
----
4441
# tail -f /var/lib/containers/storage/overlay-containers/*/userdata/ctr.log
4542
----
4643
47-
. Collect the `kubelet.service` and `crio.service` service logs from the master
48-
and worker hosts using the `journalctl` command by running:
44+
. Collect the `kubelet.service` and `crio.service` service logs from the control plane and compute hosts using the `journalctl` command by running:
4945
+
5046
[source,terminal]
5147
----
5248
$ journalctl -b -f -u kubelet.service -u crio.service
5349
----
5450
55-
. Collect the master and worker host container logs using the `tail` command by
56-
running:
51+
. Collect the control plane and compute host container logs by entering the `tail` command:
5752
+
5853
[source,terminal]
5954
----

modules/manually-gathering-logs-without-ssh.adoc

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,11 +6,10 @@
66
[id="installation-manually-gathering-logs-without-SSH_{context}"]
77
= Manually gathering logs without SSH access to your host(s)
88

9-
Manually gather logs in situations where `must-gather` or automated collection
10-
methods do not work.
9+
[role="_abstract"]
10+
Manually gather logs in situations where `must-gather` or automated collection methods do not work.
1111

12-
If you do not have SSH access to your node, you can access the systems journal
13-
to investigate what is happening on your host.
12+
If you do not have SSH access to your node, you can access the systems journal to investigate what is happening on your host.
1413

1514
.Prerequisites
1615

modules/restarting-installation.adoc

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,21 +6,23 @@
66
[id="restarting-installation_{context}"]
77
= Reinstalling the {product-title} cluster
88

9+
[role="_abstract"]
910
If you are unable to debug and resolve issues in the failed {product-title} installation, consider installing a new {product-title} cluster. Before starting the installation process again, you must complete thorough cleanup.
10-
For a user-provisioned infrastructure (UPI) installation, you must manually destroy the cluster and delete all associated resources. The following procedure is for an installer-provisioned infrastructure (IPI) installation.
11+
12+
For a user-provisioned infrastructure installation, you must manually destroy the cluster and delete all associated resources. The following procedure is for an installer-provisioned infrastructure installation.
1113

1214
.Procedure
1315

1416
. Destroy the cluster and remove all the resources associated with the cluster, including the hidden installer state files in the installation directory:
1517
+
1618
[source,terminal]
1719
----
18-
$ ./openshift-install destroy cluster --dir <installation_directory> <1>
20+
$ ./openshift-install destroy cluster --dir <installation_directory>
1921
----
20-
<1> `installation_directory` is the directory you specified when you ran `./openshift-install create cluster`. This directory contains the {product-title}
21-
definition files that the installation program creates.
22+
+
23+
Where `<installation_directory>` is the directory you specified when you ran `./openshift-install create cluster`. This directory contains the {product-title} definition files that the installation program creates.
2224

23-
. Before reinstalling the cluster, delete the installation directory:
25+
. Before reinstalling the cluster, delete the installation directory by running a command similar to the following command:
2426
+
2527
[source,terminal]
2628
----

0 commit comments

Comments
 (0)