We are looking for a DevOps / Platform Engineer to design, automate, and operate a production-grade on-prem Kubernetes infrastructure focused on automation, GitOps, reliability, and scalability.
Responsibilities:
- Design and manage on-prem infrastructure using Terraform and Ansible
- Deploy and operate Kubernetes clusters and containerized workloads
- Implement and maintain GitOps workflows using ArgoCD
- Build and maintain CI/CD pipelines using GitLab CI
- Manage Kubernetes networking and ingress configurations
- Monitor infrastructure and services using Prometheus, Grafana, and ELK stack
- Improve platform reliability, security, and automation
- Troubleshoot infrastructure, networking, and Kubernetes-related issues
Requirements:
- Strong Linux system administration skills
- Hands-on experience with Kubernetes in production environments
- Experience with:
- Terraform
- Ansible
- GitLab CI
- ArgoCD
- Good understanding of networking concepts (TCP/IP, DNS, routing, CNI)
- Experience with Cilium or Calico
- Familiarity with observability and logging stacks
- Strong troubleshooting and problem-solving skills
- Understanding of distributed systems and high-availability concepts
Nice to Have:
- Experience with Kubespray
- On-prem Kubernetes environments
- Service mesh technologies (Istio / Linkerd)
- Vault integration with Kubernetes
- Experience with high-availability and disaster recovery practices