Site Reliability Engineer (SRE)

(153 days ago)

Huawei Technologies Services Iranian Co LTD

Tehran/ Sa'adat Abad

Full Time

Working days and hours

Saturday till Wednesday 08:00 to 17:00

Business trips

Facilities and Benefits

Transportation -Bonus -Health insurance -Gym facilities -Coffee shop -Occasional packages and gifts

درباره شرکت

Company Size

501 - 1000 employees

Industry

Telecom

Company Type

Branch of non - Iranian company / Embassy

Establishment year

1987

Ownership type

Privately held

توضیحات بیشتر

key Requirements

4 years experience in similar position

C# - Intermediate

C++ - Intermediate

Java - Intermediate

Python - Advanced

JavaScript - Advanced

Go - Advanced

Linux - Advanced

Cloud Security - Intermediate

Server Virtualization (ESX,VMware and …) - Advanced

CCNA - Intermediate

CCNP - Intermediate

Network+ - Intermediate

ShellScript - Advanced

Kubernetes - Advanced

Prometheus - Intermediate

Ansible - Advanced

language English-Upper Intermediate

Job Description

We are looking for a Site Reliability Engineer (SRE) to ensure the reliability, availability, and scalability of our systems. The role works closely with development teams to improve system resilience, automate operations, and respond to production incidents.(Software updates, bug fixes, and security patches).

Responsibilities

• Design, operate, and improve reliable, scalable, and fault-tolerant systems.
• Monitor system health and maintain alerting and observability dashboards.
• Participate in on-call rotations and incident response; perform root cause analysis and blameless postmortems.
• Define and manage SLAs, SLOs, and error budgets.
• Automate operational tasks and reduce manual toil.
• Perform capacity planning and resilience improvements (e.g., chaos engineering).
• Collaborate with development teams on deployments, updates, and security patches.
• Promote SRE best practices and contribute to reliability initiatives.

Requirements

• Bachelor’s degree in Computer Science or related field, or equivalent practical experience.
• 3+ years of experience in SRE, DevOps, systems, or software engineering.
• Experience with at least one programming language (e.g., C++, C#, Java, Python, JavaScript)
• Strong knowledge in one or more areas: networking, Linux, containers, storage, virtualization, cybersecurity, databases, or big data.
• Experience with Kubernetes/containers and cloud infrastructure.
• Proficiency with Infrastructure as Code (Terraform, Ansible, Puppet, Chef, etc.).
• Scripting and automation skills (Python, Shell, Go).
• Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK), automated O&M tools (e.g. Ansible, Terraform, Jenkins, etc.).
• Understanding of incident management, reliability engineering, and distributed systems.
• Strong problem-solving, communication, and teamwork skills.
• Hands-on with Spark, Hadoop, Fink, or ElasticSearch

Preferred:

• Cloud or networking certifications (AWS Solutions Architect, Azure Architect, Google Cloud Developer, HCIE, Cisco).
• ITIL certification or other relevant OPS certifications
• Experience with large-scale or high-availability systems

Job Requirements

Age

24 - 44 Years Old

Gender

Men / Women

Education

Bachelor| Computer and IT

Language

English| Upper Intermediate 70%

Software

Linux| Advanced

Server Virtualization (ESX,VMware and …)| Advanced

Kubernetes| Advanced

Cloud Security| Intermediate

Ansible| Advanced

Python| Advanced

ShellScript| Advanced

Go| Advanced

Prometheus| Intermediate

Network+| Intermediate

CCNA| Intermediate

CCNP| Intermediate

JavaScript| Advanced

C#| Intermediate

C++| Intermediate

Java| Intermediate

ثبت مشکل و تخلف آگهی

ارسال رزومه برای خدمات تکنولوژی هوآوی ایرانیان

این آگهی بسته شده است

مقایسه من با سایر متقاضیان

سوابق ارسال رزومه برای این شرکت

Site Reliability Engineer (SRE)

Company benefits

key Requirements

Job Description

Job Requirements