Mahsan Co. is an Iranian Information Technology and Services company that designs, develops and sells IT solutions. Continuous improvement of products, working with the greats and security for customers are our honor.
We believe our employees are our most important asset. Therefore, by working with young, motivated and efficient people we build a fresh, creative and organized environment.
Server Virtualization (ESX,VMware and …) - Advanced
Elastic Search - Intermediate
Docker - Advanced
Kubernetes - Advanced
Helm - Intermediate
Prometheus - Intermediate
Gerafana - Intermediate
Ansible - Advanced
Gitlab - Advanced
language English-Upper Intermediate
Job Description
Main Responsibilities:
Automate infrastructure provisioning and management, Using Terraform for provisioning and Ansible for configuration.
Build and maintain CI/CD pipelines to automatically deploy applications, integrating with Docker Compose services and preparing for Kubernetes deployments to enable zero-downtime releases.
Ensure system reliability and observability via Implementing monitoring, logging, and alerting (SLIs/SLOs). proactively identify issues and create mitigation strategies.
Participate in on-call rotations, perform root cause analysis (RCAs), and handle production issues across hybrid on-premises setups.
Script configurations for Network and Infrastructure.
Kubernetes migration and operations: Deploy and manage apps on Kubernetes clusters, optimizing for high availability, scaling, and auto-healing in multi-data-center environments.
Capacity planning and performance optimization: Monitor trends, plan for scale, and optimize workflows to support huge concurrent connections and geo-redundant services.
Collaborate cross-functionally: Work with engineering teams to review designs, evangelize best practices, and contribute to runbooks and automation tools.
Custom automation and API-driven tasks with Coding proficiency in Python, Bash, or Go.
Required Skills:
5+ years of SRE or infrastructure engineering experience, with proven track record in scripting and Git.
Hands-on expertise in Terraform (IaC for provisioning), Ansible (configuration management and orchestration), CI/CD pipelines with GitLab CI.
Proficiency in containerization and orchestration: Docker Compose (deployment, scaling, troubleshooting).
Hands-on expertise in Git and configure Network and infrastructure devices via scripts and codes.
Server virtualization experience. any of the VMware, Proxmox, or KVM.
Strong troubleshooting skills for production incidents, including log analysis, performance tuning, and disaster recovery.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK) and automation to reduce toil.
Familiarity with Object storage.
Familiarity with agile methodologies, on-call rotations, and SRE principles (error budgets, SLOs).