Operations guides specific to managing Sourcegraph with Kubernetes installations.
Trying to deploy Sourcegraph with Kubernetes? Refer to our installation guide.
Configure your Sourcegraph deployment with our deployment reference. Upgrade
Upgrade your deployment to the latest Sourcegraph release. Troubleshoot
Troubleshoot common issues with your Sourcegraph instance.
Refer to our installation guide for details on how to deploy Sourcegraph.
Migrating from another deployment type? Refer to our migration guides.
In general, Sourcegraph with Kubernetes is deployed by applying the Kubernetes manifests in our deploy-sourcegraph reference repository - see our configuration guide for more details.
We provide a kubectl-apply-all.sh script that you can use to do this, usually by running the following from the root directory of the deploy-sourcegraph reference repository:
./kubectl-apply-all.shNOTE: By default, this script applies our base manifests using
kubectl applywith a variety of arguments specific to the reference repository's layout. If you have specific commands that should be run whenever you apply your manifests, you should modify this script as needed. For example, if you use overlays to make changes to the manifests, you should modify this script to apply your generated cluster instead.
Once you have applied your changes:
-
Watch - verify your deployment has started:
kubectl get pods -A -o wide --watch
-
Port-foward - verify Sourcegraph is running by temporarily making the frontend port accessible:
kubectl port-forward svc/sourcegraph-frontend 3080:30080
-
Log in - browse to your Sourcegraph deployment, login, and verify the instance is working as expected.
We strongly recommend referring to our Configuration guide to learn about how to configure your Sourcegraph with Kubernetes instance.
- See the Updating Sourcegraph docs on how to upgrade.
- See the Updating a Kubernetes Sourcegraph instance docs for details on changes in each version to determine if manual migration steps are necessary.
List all pods in your cluster and the corresponding health status of each pod:
kubectl get pods -o=wideTail the logs for the specified pod:
kubectl logs -f $POD_NAMEIf Sourcegraph is unavailable and the sourcegraph-frontend-* pod(s) are not in status Running, then view their logs with kubectl logs -f sourcegraph-frontend-$POD_ID (filling in $POD_ID from the kubectl get pods output). Inspect both the log messages printed at startup (at the beginning of the log output) and recent log messages.
Display detailed information about the status of a single pod:
kubectl describe $POD_NAMEList all Persistent Volume Claims (PVCs) and their statuses:
kubectl get pvcList all Persistent Volumes (PVs) that have been provisioned. In a healthy cluster, there should be a one-to-one mapping between PVs and PVCs:
kubectl get pvList all events in the cluster's history:
kubectl get eventsDelete failing pod so it gets recreated, possibly on a different node:
kubectl delete pod $POD_NAMERemove all pods from a node and mark it as unschedulable to prevent new pods from arriving
kubectl drain --force --ignore-daemonsets --delete-local-data $NODERestarting Sourcegraph Instance:
kubectl rollout restart deployment sourcegraph-frontendGet the id of one pgsql Pod:
kubectl get pods -l app=pgsql
NAME READY STATUS RESTARTS AGE
pgsql-76a4bfcd64-rt4cn 2/2 Running 0 19mMake sure you are operating under the correct namespace (i.e. add -n prod if your pod is under the prod namespace).
Open a PostgreSQL interactive terminal:
kubectl exec -it pgsql-76a4bfcd64-rt4cn -- psql -U sgRun your SQL query:
SELECT * FROM users;NOTE: To execute an SQL query against the database without first creating an interactive session (as below), append
--command "SELECT * FROM users;"to the docker container exec command.
The following instructions are specific to backing up and restoring the sourcegraph databases in a Kubernetes deployment. These do not apply to other deployment types.
WARNING: Only core data will be backed up.
These instructions will only back up core data including user accounts, configuration, repository-metadata, etc. Other data will be regenerated automatically:
- Repositories will be re-cloned
- Search indexes will be rebuilt from scratch
The above may take a while if you have a lot of repositories. In the meantime, searches may be slow or return incomplete results. This process rarely takes longer than 6 hours and is usually much faster.
NOTE: In some places you will see
$NAMESPACEused. Add-n $NAMESPACEto commands if you are not using the default namespace More kubectl configuration options can be found here: kubectl Cheat Sheet
These instructions will back up the primary sourcegraph database and the codeintel database.
A. Verify deployment running
kubectl get pods -AB. Stop all connections to the database by removing the frontend deployment
kubectl scale --replicas=0 deployment/sourcegraph-frontend
# or
kubectl delete deployment sourcegraph-frontendC. Check for corrupt database indexes. If amcheck returns errors, please reach out to support@sourcegraph.com
create extension amcheck;
select bt_index_parent_check(c.oid, true), c.relname, c.relpages
from pg_index i
join pg_opclass op ON i.indclass[0] = op.oid
join pg_am am ON op.opcmethod = am.oid
join pg_class c ON i.indexrelid = c.oid
join pg_namespace n ON c.relnamespace = n.oid
where am.amname = 'btree'
-- Don't check temp tables, which may be from another session:
and c.relpersistence != 't'
-- Function may throw an error when this is omitted:
and i.indisready AND i.indisvalid;D. Generate the database dumps
kubectl exec -it $pgsql_POD_NAME -- bash -c 'pg_dump -C --username sg sg' > sourcegraph_db.out
kubectl exec -it $codeintel-db_POD_NAME -- bash -c 'pg_dump -C --username sg sg' > codeintel_db.outEnsure the sourcegraph_db.out and codeintel_db.out files are moved to a safe and secure location.
The following instructions apply only if you are restoring your databases into a new deployment of Sourcegraph ie: a new virtual machine
If you are restoring a previously running environment, see the instructions for restoring a previously running deployment
A. Copy the database dump files (eg. sourcegraph_db.out and codeintel_db.out) into the root of the deploy-sourcegraph directory
B. Start the database services by running the following command from the root of the deploy-sourcegraph directory
kubectl rollout restart deployment pgsql
kubectl rollout restart deployment codeintel-dbC. Copy the database files into the pods by running the following command from the root of the deploy-sourcegraph directory
kubectl cp sourcegraph_db.out $NAMESPACE/$pgsql_POD_NAME:/tmp/sourcegraph_db.out
kubectl cp codeintel_db.out $NAMESPACE/$codeintel-db_POD_NAME:/tmp/codeintel_db.outD. Restore the databases
kubectl exec -it $pgsql_POD_NAME -- bash -c 'psql -v ERROR_ON_STOP=1 --username sg -f /tmp/sourcegraph_db.out sg'
kubectl exec -it $codeintel-db_POD_NAME -- bash -c 'psql -v ERROR_ON_STOP=1 --username sg -f /tmp/condeintel_db.out sg'E. Check for corrupt database indexes. If amcheck returns errors, please reach out to support@sourcegraph.com
create extension amcheck;
select bt_index_parent_check(c.oid, true), c.relname, c.relpages
from pg_index i
join pg_opclass op ON i.indclass[0] = op.oid
join pg_am am ON op.opcmethod = am.oid
join pg_class c ON i.indexrelid = c.oid
join pg_namespace n ON c.relnamespace = n.oid
where am.amname = 'btree'
-- Don't check temp tables, which may be from another session:
and c.relpersistence != 't'
-- Function may throw an error when this is omitted:
and i.indisready AND i.indisvalid;F. Start the remaining Sourcegraph services by following the steps in applying manifests.
A. Stop the existing deployment by removing the frontend deployment
kubectl scale --replicas=0 deployment/sourcegraph-frontend
# or
kubectl delete deployment sourcegraph-frontendB. Remove any existing volumes for the databases in the existing deployment
kubectl delete pvc pgsql
kubectl delete pvc codeintel-db
kubectl delete pv $pgsql_PV_NAME --force
kubectl delete pv $codeintel-db_PV_NAME --forceC. Copy the database dump files (eg. sourcegraph_db.out and codeintel_db.out) into the root of the deploy-sourcegraph directory
D. Start the database services only
kubectl rollout restart deployment pgsql
kubectl rollout restart deployment codeintel-dbE. Copy the database files into the pods by running the following command from the root of the deploy-sourcegraph directory
kubectl cp sourcegraph_db.out $NAMESPACE/$pgsql_POD_NAME:/tmp/sourcegraph_db.out
kubectl cp codeintel_db.out $NAMESPACE/$codeintel-db_POD_NAME:/tmp/codeintel_db.outF. Restore the databases
kubectl exec -it $pgsql_POD_NAME -- bash -c 'psql -v ERROR_ON_STOP=1 --username sg -f /tmp/sourcegraph_db.out sg'
kubectl exec -it $codeintel-db_POD_NAME -- bash -c 'psql -v ERROR_ON_STOP=1 --username sg -f /tmp/condeintel_db.out sg'G. Check for corrupt database indexes. If amcheck returns errors, please reach out to support@sourcegraph.com
create extension amcheck;
select bt_index_parent_check(c.oid, true), c.relname, c.relpages
from pg_index i
join pg_opclass op ON i.indclass[0] = op.oid
join pg_am am ON op.opcmethod = am.oid
join pg_class c ON i.indexrelid = c.oid
join pg_namespace n ON c.relnamespace = n.oid
where am.amname = 'btree'
-- Don't check temp tables, which may be from another session:
and c.relpersistence != 't'
-- Function may throw an error when this is omitted:
and i.indisready AND i.indisvalid;H. Start the remaining Sourcegraph services by following the steps in applying manifests.
See the Troubleshooting docs.