K8SPG-740 fix error message by nmarukovich · Pull Request #1411 · percona/percona-postgresql-operator

nmarukovich · 2026-01-19T20:49:27Z

CHANGE DESCRIPTION

Problem:
Short explanation of the problem.
2025-03-04T11:01:27.724Z ERROR failed to cleanup outdated backups

Cause:
Short explanation of the root cause of the issue if applicable.
We encountered an error when attempting to delete backups while the repohost pod was not ready.
We've fixed this by checking the repohost pod status before attempting backup cleanup.

Solution:
Short explanation of the solution we are providing with this PR.

CHECKLIST

Jira

Is the Jira ticket created and referenced properly?
Does the Jira ticket have the proper statuses for documentation (Needs Doc) and QA (Needs QA)?
Does the Jira ticket link to the proper milestone (Fix Version field)?

Tests

Is an E2E test/test case added for the new feature/change?
Are unit tests added where appropriate?

Config/Logging/Testability

Are all needed new/changed options added to default YAML files?
Are all needed new/changed options added to the Helm Chart?
Did we add proper logging messages for operator actions?
Did we ensure compatibility with the previous version or cluster upgrade process?
Does the change support oldest and newest supported PG version?
Does the change support oldest and newest supported Kubernetes version?

egegunes · 2026-01-20T06:05:42Z

+	if repoCondition == nil || repoCondition.Status != metav1.ConditionTrue {
+		log.Info("pgBackRest repo host not ready, skipping backup cleanup")
+		return nil
+


unnecessary empty line

…rator into K8SPG-740

pooknull · 2026-01-26T09:30:46Z

 		return errors.Wrap(err, "reconcile backup jobs")
 	}

+	repoCondition := meta.FindStatusCondition(cr.Status.Conditions, postgrescluster.ConditionRepoHostReady)


I think we should move this check to cleanupOutdatedBackups method

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

shfmt

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1581 in 4c08f48

[ $? -eq 0 ] || return 1

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1583 to 1584 in 4c08f48

    
           echo "MachineConfig created" 
        
           echo "Waiting for worker pool to update (~10 minutes)..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1586 to 1589 in 4c08f48

    
           kubectl wait --for=condition=Updated mcp/worker --timeout=900s 2>/dev/null || { 
        
               echo "Update taking longer than expected" 
        
               return 1 
        
           }

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1591 in 4c08f48

echo "Worker pool updated"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1593 to 1594 in 4c08f48

    
           sleep 10 
        
           verify_hugepages_on_nodes

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1598 in 4c08f48

echo "Verifying hugepages on nodes"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1600 to 1605 in 4c08f48

    
               # Get first worker node, fallback to first non-master, fallback to any node 
        
               local node_name=$( 
        
                   kubectl get nodes -l node-role.kubernetes.io/worker -o jsonpath='{.items[0].metadata.name}' 2>/dev/null || \ 
        
                   kubectl get nodes -l '!node-role.kubernetes.io/master,!node-role.kubernetes.io/control-plane' -o jsonpath='{.items[0].metadata.name}' 2>/dev/null || \ 
        
                   kubectl get nodes -o jsonpath='{.items[0].metadata.name}' 
        
               )

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1607 to 1610 in 4c08f48

    
           if [ -z "${node_name}" ]; then 
        
               echo "No nodes found" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1612 in 4c08f48

echo "Checking node: ${node_name}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1614 to 1615 in 4c08f48

    
               local hugepages_capacity=$(kubectl get node ${node_name} \ 
        
                   -o jsonpath='{.status.capacity.hugepages-2Mi}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1617 to 1623 in 4c08f48

    
           if [ -n "${hugepages_capacity}" ] && [ "${hugepages_capacity}" != "0" ]; then 
        
               echo "Node has hugepages capacity: ${hugepages_capacity}" 
        
               return 0 
        
           else 
        
               echo "No hugepages capacity found on node ${node_name}" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1627 to 1629 in 4c08f48

    
           local pod_name=$1 
        
           local namespace=$2 
        
           local container=${3:-postgres}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1631 in 4c08f48

echo "Verifying hugepages in pod ${pod_name}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1633 to 1635 in 4c08f48

    
               # Check /proc/meminfo 
        
               local hugepages_total=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \ 
        
                   grep HugePages_Total /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1637 to 1638 in 4c08f48

    
               local hugepages_free=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \ 
        
                   grep HugePages_Free /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1640 to 1641 in 4c08f48

    
           echo "HugePages_Total: ${hugepages_total}" 
        
           echo "HugePages_Free: ${hugepages_free}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1643 to 1649 in 4c08f48

    
           if [ "${hugepages_total}" -gt 0 ]; then 
        
               echo "Hugepages are available in pod" 
        
               return 0 
        
           else 
        
               echo "No hugepages in pod" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1653 to 1654 in 4c08f48

    
           local cluster_name=$1 
        
           local expected_value=${2:-try}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1656 in 4c08f48

echo "Verifying PostgreSQL huge_pages setting..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1658 to 1660 in 4c08f48

    
               local huge_pages=$(run_psql_local \ 
        
                   "SHOW huge_pages;" \ 
        
                   "postgres:$(get_psql_user_pass ${cluster_name}-pguser-postgres)@$(get_psql_user_host ${cluster_name}-pguser-postgres)")

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1662 in 4c08f48

echo "huge_pages: ${huge_pages}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1664 to 1670 in 4c08f48

    
           if [[ "${huge_pages}" == *"${expected_value}"* ]]; then 
        
               echo "PostgreSQL huge_pages is set to '${expected_value}'" 
        
               return 0 
        
           else 
        
               echo "PostgreSQL huge_pages not set to '${expected_value}' (value: ${huge_pages})" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1674 to 1676 in 4c08f48

    
           local pod_name=$1 
        
           local namespace=$2 
        
           local container=${3:-database}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1678 in 4c08f48

echo "Checking hugepages usage..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1680 to 1681 in 4c08f48

    
           kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
               grep HugePages /proc/meminfo

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1683 to 1684 in 4c08f48

    
               local hugepages_total=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
                   grep HugePages_Total /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1686 to 1687 in 4c08f48

    
               local hugepages_free=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
                   grep HugePages_Free /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1689 in 4c08f48

local hugepages_used=$((hugepages_total - hugepages_free))

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1691 to 1694 in 4c08f48

    
           echo "" 
        
           echo "HugePages usage:" 
        
           echo "  Total: ${hugepages_total}" 
        
           echo "  Used:  ${hugepages_used}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1696 to 1703 in 4c08f48

    
               if [ "${hugepages_used}" -gt 0 ]; then 
        
                   echo "PostgreSQL is using hugepages" 
        
                   return 0 
        
               else 
        
                   echo "Hugepages available but NOT being used by PostgreSQL" 
        
                   return 1 
        
               fi 
        
           }

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

shfmt

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1581 in 4c08f48

[ $? -eq 0 ] || return 1

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1583 to 1584 in 4c08f48

    
           echo "MachineConfig created" 
        
           echo "Waiting for worker pool to update (~10 minutes)..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1586 to 1589 in 4c08f48

    
           kubectl wait --for=condition=Updated mcp/worker --timeout=900s 2>/dev/null || { 
        
               echo "Update taking longer than expected" 
        
               return 1 
        
           }

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1591 in 4c08f48

echo "Worker pool updated"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1593 to 1594 in 4c08f48

    
           sleep 10 
        
           verify_hugepages_on_nodes

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1598 in 4c08f48

echo "Verifying hugepages on nodes"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1600 to 1605 in 4c08f48

    
               # Get first worker node, fallback to first non-master, fallback to any node 
        
               local node_name=$( 
        
                   kubectl get nodes -l node-role.kubernetes.io/worker -o jsonpath='{.items[0].metadata.name}' 2>/dev/null || \ 
        
                   kubectl get nodes -l '!node-role.kubernetes.io/master,!node-role.kubernetes.io/control-plane' -o jsonpath='{.items[0].metadata.name}' 2>/dev/null || \ 
        
                   kubectl get nodes -o jsonpath='{.items[0].metadata.name}' 
        
               )

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1607 to 1610 in 4c08f48

    
           if [ -z "${node_name}" ]; then 
        
               echo "No nodes found" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1612 in 4c08f48

echo "Checking node: ${node_name}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1614 to 1615 in 4c08f48

    
               local hugepages_capacity=$(kubectl get node ${node_name} \ 
        
                   -o jsonpath='{.status.capacity.hugepages-2Mi}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1617 to 1623 in 4c08f48

    
           if [ -n "${hugepages_capacity}" ] && [ "${hugepages_capacity}" != "0" ]; then 
        
               echo "Node has hugepages capacity: ${hugepages_capacity}" 
        
               return 0 
        
           else 
        
               echo "No hugepages capacity found on node ${node_name}" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1627 to 1629 in 4c08f48

    
           local pod_name=$1 
        
           local namespace=$2 
        
           local container=${3:-postgres}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1631 in 4c08f48

echo "Verifying hugepages in pod ${pod_name}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1633 to 1635 in 4c08f48

    
               # Check /proc/meminfo 
        
               local hugepages_total=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \ 
        
                   grep HugePages_Total /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1637 to 1638 in 4c08f48

    
               local hugepages_free=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \ 
        
                   grep HugePages_Free /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1640 to 1641 in 4c08f48

    
           echo "HugePages_Total: ${hugepages_total}" 
        
           echo "HugePages_Free: ${hugepages_free}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1643 to 1649 in 4c08f48

    
           if [ "${hugepages_total}" -gt 0 ]; then 
        
               echo "Hugepages are available in pod" 
        
               return 0 
        
           else 
        
               echo "No hugepages in pod" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1653 to 1654 in 4c08f48

    
           local cluster_name=$1 
        
           local expected_value=${2:-try}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1656 in 4c08f48

echo "Verifying PostgreSQL huge_pages setting..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1658 to 1660 in 4c08f48

    
               local huge_pages=$(run_psql_local \ 
        
                   "SHOW huge_pages;" \ 
        
                   "postgres:$(get_psql_user_pass ${cluster_name}-pguser-postgres)@$(get_psql_user_host ${cluster_name}-pguser-postgres)")

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1662 in 4c08f48

echo "huge_pages: ${huge_pages}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1664 to 1670 in 4c08f48

    
           if [[ "${huge_pages}" == *"${expected_value}"* ]]; then 
        
               echo "PostgreSQL huge_pages is set to '${expected_value}'" 
        
               return 0 
        
           else 
        
               echo "PostgreSQL huge_pages not set to '${expected_value}' (value: ${huge_pages})" 
        
               return 1 
        
           fi

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1674 to 1676 in 4c08f48

    
           local pod_name=$1 
        
           local namespace=$2 
        
           local container=${3:-database}

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1678 in 4c08f48

echo "Checking hugepages usage..."

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1680 to 1681 in 4c08f48

    
           kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
               grep HugePages /proc/meminfo

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1683 to 1684 in 4c08f48

    
               local hugepages_total=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
                   grep HugePages_Total /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1686 to 1687 in 4c08f48

    
               local hugepages_free=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \ 
        
                   grep HugePages_Free /proc/meminfo | awk '{print $2}')

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Line 1689 in 4c08f48

local hugepages_used=$((hugepages_total - hugepages_free))

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1691 to 1694 in 4c08f48

    
           echo "" 
        
           echo "HugePages usage:" 
        
           echo "  Total: ${hugepages_total}" 
        
           echo "  Used:  ${hugepages_used}"

[shfmt] _{reported by reviewdog 🐶}

percona-postgresql-operator/e2e-tests/functions

Lines 1696 to 1703 in 4c08f48

    
               if [ "${hugepages_used}" -gt 0 ]; then 
        
                   echo "PostgreSQL is using hugepages" 
        
                   return 0 
        
               else 
        
                   echo "Hugepages available but NOT being used by PostgreSQL" 
        
                   return 1 
        
               fi 
        
           }

egegunes · 2026-02-03T08:50:29Z

+	repoCondition := meta.FindStatusCondition(cr.Status.Conditions, postgrescluster.ConditionRepoHostReady)
+	if repoCondition == nil || repoCondition.Status != metav1.ConditionTrue {
+		log.Info("pgBackRest repo host not ready, skipping backup cleanup")
+		return nil
+	}


since this function is not using repo host, i am confused how this fixes the issue

Original error full stacktrace here :

ERROR failed to cleanup outdated backups {"controller": "perconapgcluster", "controllerGroup": "pgv2.percona.com", "controllerKind": "PerconaPGCluster", "PerconaPGCluster": {"name":"cl uster1","namespace":"pg2502"}, "namespace": "pg2502", "name": "cluster1", "reconcileID": "bf3f58e4-3d58-4112-b6ec-6af38241dcb7", "error": "get pgBackRest info: pgBackRest info command failed with code 99: other", "errorVerb ose": "pgBackRest info command failed with code 99:

We have

info, err = pgbackrest.GetInfo(ctx, readyPod, repo.Name)

we try to do

which executes pgbackrest info --repo=repo1

If repohost is not ready, we got an error in this case.

does this assume that repo1 is a PVC and stored in repo host? what happens if repo1 is s3?

yes, we assume that repo1 pvc,
well we can rewrite check a bit if repo1 is pvc and repohost is not ready, let me know if it will be ok for you.
if repo1 is s3, yee, you right, we will wait until repohost is ready (we don't need to wait it) and delete backups on the next iteration.

I've verified that RepoHost is only used for pvc. When using S3/Azure repos, pgBackRest connects directly to cloud storage.
Therefore, we should only wait for RepoHost to be ready when the repository type is volume.

JNKPercona · 2026-02-12T17:27:28Z

Test Name	Result	Time
backup-enable-disable	passed	00:07:02
builtin-extensions	passed	00:05:02
custom-envs	passed	00:17:57
custom-extensions	passed	00:13:34
custom-tls	passed	00:04:58
database-init-sql	passed	00:02:22
demand-backup	passed	00:22:04
finalizers	passed	00:03:54
init-deploy	passed	00:03:07
huge-pages	passed	00:02:56
monitoring	passed	00:07:10
monitoring-pmm3	passed	00:08:06
one-pod	passed	00:05:43
operator-self-healing	passed	00:08:00
pitr	passed	00:11:44
scaling	passed	00:04:43
scheduled-backup	passed	00:24:15
self-healing	passed	00:08:18
sidecars	passed	00:02:39
standby-pgbackrest	passed	00:11:40
standby-streaming	passed	00:09:12
start-from-backup	passed	00:10:43
tablespaces	passed	00:06:55
telemetry-transfer	passed	00:03:33
upgrade-consistency	passed	00:05:40
upgrade-minor	passed	00:05:25
users	passed	00:04:50

Summary	Value
Tests Run	27/27
Job Duration	01:17:58
Total Test Time	03:41:46

commit: cab2526
image: perconalab/percona-postgresql-operator:PR-1411-cab252661

nmarukovich added 2 commits January 19, 2026 11:30

K8SPG-740 fix error message

68a0313

fix import

c6ee631

nmarukovich requested review from egegunes, gkech, hors, mayankshah1607, oksana-grishchenko and pooknull as code owners January 19, 2026 20:49

Merge branch 'main' into K8SPG-740

75642a8

egegunes requested changes Jan 20, 2026

View reviewed changes

nmarukovich added 2 commits January 20, 2026 12:36

fix PR comments

5b9e2d2

Merge branch 'K8SPG-740' of github.com:percona/percona-postgresql-ope…

e337bf8

…rator into K8SPG-740

nmarukovich requested a review from egegunes January 20, 2026 11:38

egegunes previously approved these changes Jan 21, 2026

View reviewed changes

oksana-grishchenko previously approved these changes Jan 22, 2026

View reviewed changes

pooknull reviewed Jan 26, 2026

View reviewed changes

nmarukovich dismissed stale reviews from oksana-grishchenko and egegunes via 4c08f48 February 2, 2026 12:55

nmarukovich requested review from eleo007, jvpasinatto and valmiranogueira as code owners February 2, 2026 12:55

github-actions Bot reviewed Feb 2, 2026

View reviewed changes

nmarukovich force-pushed the K8SPG-740 branch 2 times, most recently from 5441cbd to e337bf8 Compare February 2, 2026 13:13

fix PR comments

e147f22

egegunes reviewed Feb 3, 2026

View reviewed changes

nmarukovich added 2 commits February 10, 2026 16:48

fix condition

507c716

fix condition delete space

948c793

nmarukovich requested a review from egegunes February 10, 2026 15:55

nmarukovich requested review from oksana-grishchenko and pooknull February 10, 2026 15:55

egegunes approved these changes Feb 11, 2026

View reviewed changes

nmarukovich added 2 commits February 11, 2026 17:45

Merge branch 'main' into K8SPG-740

0d31128

Merge branch 'main' into K8SPG-740

4d85a26

gkech approved these changes Feb 12, 2026

View reviewed changes

egegunes added this to the v2.9.0 milestone Feb 12, 2026

Merge branch 'main' into K8SPG-740

cab2526

nmarukovich merged commit 6cdcf3d into main Feb 12, 2026
16 checks passed

nmarukovich deleted the K8SPG-740 branch February 12, 2026 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

K8SPG-740 fix error message#1411

K8SPG-740 fix error message#1411
nmarukovich merged 11 commits into
mainfrom
K8SPG-740

nmarukovich commented Jan 19, 2026 •

edited by atlassian Bot

Loading

Uh oh!

egegunes Jan 20, 2026

Uh oh!

pooknull Jan 26, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot left a comment

Uh oh!

egegunes Feb 3, 2026

Uh oh!

nmarukovich Feb 10, 2026

Uh oh!

egegunes Feb 10, 2026

Uh oh!

nmarukovich Feb 10, 2026

Uh oh!

nmarukovich Feb 10, 2026

Uh oh!

JNKPercona commented Feb 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

	echo "MachineConfig created"
	echo "Waiting for worker pool to update (~10 minutes)..."

	kubectl wait --for=condition=Updated mcp/worker --timeout=900s 2>/dev/null \|\| {
	echo "Update taking longer than expected"
	return 1
	}

	# Get first worker node, fallback to first non-master, fallback to any node
	local node_name=$(
	kubectl get nodes -l node-role.kubernetes.io/worker -o jsonpath='{.items[0].metadata.name}' 2>/dev/null \|\| \
	kubectl get nodes -l '!node-role.kubernetes.io/master,!node-role.kubernetes.io/control-plane' -o jsonpath='{.items[0].metadata.name}' 2>/dev/null \|\| \
	kubectl get nodes -o jsonpath='{.items[0].metadata.name}'
	)

	if [ -z "${node_name}" ]; then
	echo "No nodes found"
	return 1
	fi

	local hugepages_capacity=$(kubectl get node ${node_name} \
	-o jsonpath='{.status.capacity.hugepages-2Mi}')

	if [ -n "${hugepages_capacity}" ] && [ "${hugepages_capacity}" != "0" ]; then
	echo "Node has hugepages capacity: ${hugepages_capacity}"
	return 0
	else
	echo "No hugepages capacity found on node ${node_name}"
	return 1
	fi

	local pod_name=$1
	local namespace=$2
	local container=${3:-postgres}

	# Check /proc/meminfo
	local hugepages_total=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \
	grep HugePages_Total /proc/meminfo \| awk '{print $2}')

	local hugepages_free=$(kubectl exec ${pod_name} -n ${namespace} -c ${container} -- \
	grep HugePages_Free /proc/meminfo \| awk '{print $2}')

	echo "HugePages_Total: ${hugepages_total}"
	echo "HugePages_Free: ${hugepages_free}"

	if [ "${hugepages_total}" -gt 0 ]; then
	echo "Hugepages are available in pod"
	return 0
	else
	echo "No hugepages in pod"
	return 1
	fi

	local huge_pages=$(run_psql_local \
	"SHOW huge_pages;" \
	"postgres:$(get_psql_user_pass ${cluster_name}-pguser-postgres)@$(get_psql_user_host ${cluster_name}-pguser-postgres)")

	if [[ "${huge_pages}" == "${expected_value}" ]]; then
	echo "PostgreSQL huge_pages is set to '${expected_value}'"
	return 0
	else
	echo "PostgreSQL huge_pages not set to '${expected_value}' (value: ${huge_pages})"
	return 1
	fi

	local pod_name=$1
	local namespace=$2
	local container=${3:-database}

	kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \
	grep HugePages /proc/meminfo

	local hugepages_total=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \
	grep HugePages_Total /proc/meminfo \| awk '{print $2}')

	local hugepages_free=$(kubectl -n ${namespace} exec ${pod_name} -c ${container} -- \
	grep HugePages_Free /proc/meminfo \| awk '{print $2}')

	echo ""
	echo "HugePages usage:"
	echo " Total: ${hugepages_total}"
	echo " Used: ${hugepages_used}"

	if [ "${hugepages_used}" -gt 0 ]; then
	echo "PostgreSQL is using hugepages"
	return 0
	else
	echo "Hugepages available but NOT being used by PostgreSQL"
	return 1
	fi
	}

Conversation

nmarukovich commented Jan 19, 2026 • edited by atlassian Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CHANGE DESCRIPTION

CHECKLIST

Uh oh!

egegunes Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

pooknull Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

egegunes Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

nmarukovich Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

egegunes Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

nmarukovich Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

nmarukovich Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

JNKPercona commented Feb 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nmarukovich commented Jan 19, 2026 •

edited by atlassian Bot

Loading