Route backend jobs to use correct image by tarrow · Pull Request #993 · wbstack/api

tarrow · 2025-11-06T12:33:28Z

This selects a pod that matches the selector[1]
for the appropriate backend service rather than
any random backend MediaWiki pod

[1] https://kubernetes.io/docs/concepts/services-networking/service/#services-in-kubernetes

Bug: T408624

tarrow · 2025-11-06T20:20:25Z

In addition to this "unit like" test with mocked k8s client responses I also tried to try it out on a kubernetes minikube setup. This time running in GKE - still incredibly painfully slow but at least without locking up my personal machine; didn't actually get as far as really testing it routes correctly.

I would do this by creating a couple of Wikis, one with a 139 and 143 DB. I would then manually dispatch app/Jobs/ProcessMediaWikiJobsJob.php and inspect the jobs that are fired off.

This selects a pod that matches the selector[1] for the appropriate backend service rather than any random backend MediaWiki pod [1] https://kubernetes.io/docs/concepts/services-networking/service/#services-in-kubernetes Bug: T408624

deer-wmde · 2025-11-07T16:08:36Z


-    public function handle(Client $kubernetesClient): void {
+    public function handle(Client $kubernetesClient, MediaWikiHostResolver $resolver): void {
+        $domain = $resolver->getBackendHostForDomain($this->wikiDomain);


while reading I confused this $domain var with a wiki domain, I'd suggest $mwBackendHost or something

Good idea; I picked a different name. I can see why you were confused for sure. I wanted to make it clear that this isn't a host like an ip but actually a domain for a service. Hopefully this makes sense now.

deer-wmde · 2025-11-07T16:34:43Z

+        $serviceName = $domain;
        $kubernetesClient->setNamespace('default');
+        $backendService = $kubernetesClient->services()->setLabelSelector([
+            'name' => $serviceName,


I'm not sure if it will work like this, as the name of the service seems more like mediawiki-143-app-backend than mediawiki-143-app-backend.default.svc.cluster.local ?

see kubectl get service mediawiki-143-app-backend -o yaml

yeah, you're totally right. I just took the first bit. Apparently also the LabelSelector was wrong and I should have used the field one with metadata.name. I basically only determined this my trial and error though.

tarrow · 2025-11-13T13:45:43Z

You might find you wanted to try this out in your local cluster but it depends a bit on this chart update being there e.g. this being merged (wmde/wbaas-deploy#2325). If you want to try out this new chart locally then you can do this to disable argo self heal and select a newer than used version of the chart:

kubectl patch application app-of-apps -n argocd --type='json' -p='[{"op": "replace", "path": "/spec/syncPolicy/automated/selfHeal", "value": false}]'
kubectl patch application api -n argocd --type='json' -p='[{"op": "replace", "path": "/spec/sources/0/targetRevision", "value": "0.34.0"}]'

deer-wmde · 2025-11-13T14:13:51Z

Even with the change in the chart I somehow still get this access error - is this something that works for you locally @tarrow ?

kubectl exec -ti deployments/api-app-backend -- php artisan job:dispatchNow ProcessMediaWikiJobsJob somewiki.wbaas.dev

In Client.php line 417:
                                                                                                                                                              
  Authentication Exception: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"services \"mediawiki-143-app-backend\" is forbidd  
  en: User \"system:serviceaccount:default:default\" cannot list resource \"services\" in API group \"\" in the namespace \"default\"","reason":"Forbidden",  
  "details":{"name":"mediawiki-143-app-backend","kind":"services"},"code":403}

deer-wmde · 2025-11-13T14:18:06Z

Even with the change in the chart I somehow still get this access error - is this something that works for you locally @tarrow ?

kubectl exec -ti deployments/api-app-backend -- php artisan job:dispatchNow ProcessMediaWikiJobsJob somewiki.wbaas.dev

In Client.php line 417:
                                                                                                                                                              
  Authentication Exception: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"services \"mediawiki-143-app-backend\" is forbidd  
  en: User \"system:serviceaccount:default:default\" cannot list resource \"services\" in API group \"\" in the namespace \"default\"","reason":"Forbidden",  
  "details":{"name":"mediawiki-143-app-backend","kind":"services"},"code":403}

I just checked, the clusterrole didnt get updated for some reason in my local cluster, I'll have a look

tarrow · 2025-11-13T14:35:29Z

can always k edit clusterrole api-defaultrole -o yaml to look like

if it helps

deer-wmde · 2025-11-13T14:51:30Z

I nuked my cluster because I didn't understand what was the cause, and indeed now the defaultrole looks better, but now I get an error for pods for some reason

In Client.php line 417:
                                                                                                                                                              
  Authentication Exception: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"pods is forbidden: User \"system:serviceaccount:default:default\" cannot list resource \"pods\" in API group \"\" in the namespace \"default\"","reason":"Forbidden","details":{"kind":"pods"},"code":403}

deer-wmde · 2025-11-13T15:32:58Z

Okay - turns out I was holding it wrong. The api chart only provides the api-queue deployments with the defaultrole-api serviceaccount, which effectively prevents dispatching this job synchronously (as intended). Dispatching it to the queue then worked as expected - now I only need to create a database state with two wiki versions, but this seems great so far

deer-wmde · 2025-11-13T16:26:15Z

can confirm it works! 🎉

* Route backend jobs to use correct image This selects a pod that matches the selector[1] for the appropriate backend service rather than any random backend MediaWiki pod [1] https://kubernetes.io/docs/concepts/services-networking/service/#services-in-kubernetes Bug: T408624 * parse out service name from backend host * try getting service using field not label selector * fix pint

tarrow force-pushed the T408624 branch 3 times, most recently from 78fa82e to 530fb26 Compare November 6, 2025 19:46

tarrow marked this pull request as ready for review November 6, 2025 20:20

Route backend jobs to use correct image

488d041

This selects a pod that matches the selector[1] for the appropriate backend service rather than any random backend MediaWiki pod [1] https://kubernetes.io/docs/concepts/services-networking/service/#services-in-kubernetes Bug: T408624

tarrow force-pushed the T408624 branch from 530fb26 to 488d041 Compare November 6, 2025 20:22

deer-wmde reviewed Nov 7, 2025

View reviewed changes

tarrow added 3 commits November 12, 2025 20:23

parse out service name from backend host

87937d4

try getting service using field not label selector

35a638a

fix pint

2c197da

tarrow requested a review from deer-wmde November 12, 2025 20:48

deer-wmde approved these changes Nov 13, 2025

View reviewed changes

deer-wmde merged commit 03dad69 into main Nov 14, 2025
5 checks passed

deer-wmde deleted the T408624 branch November 14, 2025 11:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Route backend jobs to use correct image#993

Route backend jobs to use correct image#993
deer-wmde merged 4 commits into
mainfrom
T408624

tarrow commented Nov 6, 2025 •

edited

Loading

Uh oh!

tarrow commented Nov 6, 2025

Uh oh!

deer-wmde Nov 7, 2025 •

edited

Loading

Uh oh!

tarrow Nov 12, 2025

Uh oh!

deer-wmde Nov 7, 2025

Uh oh!

tarrow Nov 12, 2025

Uh oh!

tarrow commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

tarrow commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tarrow commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tarrow commented Nov 6, 2025

Uh oh!

deer-wmde Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tarrow Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

deer-wmde Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

tarrow Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

tarrow commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

tarrow commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

deer-wmde commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tarrow commented Nov 6, 2025 •

edited

Loading

deer-wmde Nov 7, 2025 •

edited

Loading