how to integrate load testing in cicd pipeline
what is deployment and daemon set .
what is ingress how to use .
two module a & b moudle a is creating ec2 instance module instance id how to to get instance id by module b
==========
lets say you have application java running micro services in kubernetes cluster from there to grafana
In prometheus grafana lets say we have distributed monitoring lsts say resources are spread across diff region how we can build now.
Suppose we have EKS cluster version X to upgrade upper version 3 node cluster while ensuring zero downtime? how did you do that?
Write a jenkins pipeline trigger from main branch Build & push to ACR before that in ci pipeline we need code scan onc done trigger to cd pipeline?
Write a terraform file for first create one resource group on top of it create resource group, vnet , 2 subnet , private endpoint?
we have nodeselector in particular pod going to deploy new pod without objection on particular nodespace so it cant restrict that pod
i want build a new nodepool for our aks cluster i want version memory disk pace OS version network taint and toleration how to achive using yaml file.
In AKS we have 3-4 pipeline dev uat test n prod once development done go for next step prd time we have approval gateway and deploy to prod.
master node build that
Github workflows & yaml exp migrating azure to githubaction flows , aks deployment .most deploy in aks.
==========
Questions: A jenkins pipeline takes 45 mins to build microservices Build how do you reduces it less tah 10 minutes without adding more agents.
Questions: Developers complains there is pipeline is getting failed no actionable log what do you change.
Questions: A microservices in kubernetes become slow during peak hours no code chnage what do you investigate?
Question: Production hotfix must go urgenty but multiple teams have ongoing change in developed branch?
Question: A deployment to production fail half way some pods are running older version some in new version what do you do?
Question: Jenkins pipeline is frequently fails due to intermitent network issue while pulling dependencies what your approach?
Question: your jenkins pipeline takes 20 mins before even starting unit test becasuse its download dependecies every time how do you fix it without modifying developer machine?
Question: Latency spike occured during high load metrics show cpu is fine but conetxt switching is high?
Question A micro services deployment causes API Gateway 502 error even through pods are healthy what your approach
Question: EC2 based application is slow during high traffic cpu is normal response time is high how to troubleshoot.
Question: Production application behind ALB app LB return 502 error intermitently how to troubleshoot.
Question: Your AWS bill jump from 1k to 4k in a month how do you identify the root cause and prevent for next time recurrence.
- Production AKS app went down at 2 AM. What is your immediate action plan?
- Pods are in CrashLoopBackOff after deployment. How will you troubleshoot?
- Deployment works in Dev but fails in Prod. What will you compare first?
- Terraform apply failed midway and some Azure resources are created. What next?
- Someone modified Azure resource manually which is managed by Terraform. How do you handle drift?
- AKS cannot pull image from ACR. How will you debug?
- Production pods CPU is 90 percent. What steps will you take?
- AKS node shows NotReady. What is your investigation approach?
- How will you design zero downtime deployment for critical application?
- Azure DevOps pipeline is successful but changes not reflecting in AKS. Where will you check?
- How will you structure Terraform for Dev, QA, and Prod environments?
- How do you secure secrets in Kubernetes for production workloads?
- Cluster autoscaler is not scaling nodes even when CPU is high. What will you verify?
- Terraform state file accidentally deleted. What are recovery steps?
- How will you configure private AKS cluster with restricted access?
- HPA configured but pods are not scaling. What could be wrong?
- Production deployment failed and business wants immediate rollback. What is your approach?
- How will you migrate manually created Azure infrastructure to Terraform?
- CI/CD pipeline taking 40 minutes. How will you optimize it?
- Multiple engineers working on same Terraform code. How do you prevent conflicts?
- How will you expose AKS app using Application Gateway instead of LoadBalancer?
- Ingress configured but traffic not reaching pods. What will you check?
- Node showing disk pressure error. How will you fix?
- How will you restrict production access only to specific IP addresses?
- Application needs persistent storage in AKS. What will you use and why?
- After new release, some users still see old version. What could be the issue?
- How will you implement Blue Green deployment in AKS?
- Security team says AKS cluster is publicly accessible. How will you secure it?
- How will you rotate secrets without downtime?
- There is complete production outage. How do you handle troubleshooting and stakeholder communication?