
At Fanap, we believe in building a better future based on responsibility, collaboration, tolerance, and innovation. As we move toward creating a platform for innovative and technology-driven solutions, we are looking for motivated and committed individuals to join us on this journey.
We are seeking an SRE Engineer to design, build, and maintain highly available, fault-tolerant, and scalable systems that support our next-generation digital services.
In this role, you will be responsible for ensuring system reliability, performance, and scalability across cloud and on-prem environments while driving automation, observability, and operational excellence.
Your Responsibilities:
• Participate in system design discussions with a strong focus on architecture awareness, monitoring, and operational reliability
• Design, build, and maintain highly available, scalable, and fault-tolerant systems
• Monitor systems end-to-end and actively identify, troubleshoot, and resolve production issues
• Lead and manage incident response, including mitigation, debugging, and root cause analysis
• Define, implement, and continuously improve SLIs, SLOs, and error budgets
• Develop automation to reduce operational toil and improve system reliability and efficiency
• Conduct and contribute to blameless postmortems, ensuring corrective actions are tracked and completed
• Support capacity planning, performance tuning, and system resilience activities
• Collaborate closely with development teams to ensure smooth release, deployment, and production stability
• Implement and improve monitoring, logging, and alerting systems
• Follow and promote reliability best practices across DevOps and infrastructure teams
• Ensure system stability, security, and availability across cloud environments
Requirements:
• At least 4 years of experience in software development, DevOps, SRE, or cloud operations
• Strong experience with Linux systems administration
• Solid knowledge of networking fundamentals and distributed systems
• Hands-on experience with Kubernetes and containerized environments (Docker)
• Experience with Infrastructure as Code (IaC) tools such as Terraform and Ansible
• Strong experience with monitoring and observability tools such as: (Prometheus, Grafana, ELK)
• Experience with distributed systems and data platforms such as: (Kafka, MySQL / PostgreSQL, NoSQL databases )
• Strong scripting and automation skills in at least one of (Python,Go,Shell scripting)
• Familiarity with CI/CD pipelines and Git-based workflows (e.g., GitLab, Jenkins)
• Strong knowledge of public cloud platforms (AWS or Azure), including core services such as compute, networking, storage, and IAM
• Familiarity with OpenStack architecture and concepts
If you are a motivated, responsible individual who is passionate about innovation and eager to play a role in building a better future, we look forward to receiving your resume
ثبت مشکل و تخلف آگهی
ارسال رزومه برای فناپ
مقایسه من با 102 متقاضی دیگر