Key Responsibilities:
· Site Reliability Engineering (SRE):
Monitor and improve SLA/SLO compliance, manage downtime, and enhance service availability.
· DevOps Implementation:
Automate deployment processes, manage CI/CD, and collaborate with development teams.
· Linux Server Management:
Install, secure, and troubleshoot Linux servers.
· Research & Development (R&D):
Research new technologies, develop PoC solutions, and evaluate tools for operational improvement.
· Technical Strategy & Leadership:
Define DevOps/SRE strategy, align with business goals, and manage risk mitigation.
· Incident Management:
Lead 24/7 incident response and organize work shifts for continuous coverage.
· Process Improvement:
Develop best practices, optimize processes, and maintain documentation.
Requirements:
Technical Skills:
Proficiency in LPIC-1 (Linux Professional Institute Certification).
Strong knowledge of CCNA (Cisco Certified Network Associate).
Familiarity with ITIL 4 (IT Infrastructure Library).
Ability to set up CI/CD pipelines at a junior level.
Solid understanding of Database Management and SQL.
Familiarity with VM/Cloud Infrastructure concepts (e.g., AWS, Azure, VMware).
Understanding of reliability parameters for system uptime and performance.
Soft Skills:
Strong communication skills (both written and verbal).
Excellent problem-solving and conflict management abilities.
Attention to detail with high organizational skills.
Ability to work effectively in dynamic, collaborative environments.
ثبت مشکل و تخلف آگهی
ارسال رزومه برای بهسا (تابعه هلدینگ همراه اول)