Job Description
We are searching for an experienced engineer to join our team and help us achieve our mission by empowering us with a rich feature set, high availability, and stellar performance; As we expand our infrastructure, we are improving our processes and pipelines; The selected candidate will manage and maintain our monitoring services, analyze problems, troubleshoot, and find appropriate customer resolutions.
Responsibilities
- Manage day-to-day operations, monitoring alerts, servers, and backup platforms;
- Maintain and configure monitoring services to ensure reliability and uptime;
- Identify hardware, software, and environmental issues;
- Documenting problems and defining solutions, prioritizing problems, and assessing the impact of issues;
- Perform or delegate regular backup operations and implement appropriate processes for data protection, disaster recovery, and failover procedures;
- Develop, implement, and maintain procedures to measure and track service performance and quality;
Requirements
- The ideal candidate should be self-motivated, proactive, capable of multi-tasking, meeting deadlines, and working in a collaborative environment;
- Must be able to work in a 24/7 environment and work second/third shifts, weekends, and holidays;
- Must have at least 2 years of work experience as an NOC Engineer or related positions;
- Excellent problem-solving mindset and the ability to diagnose complex technical issues;
- Detail-oriented and the be able to manage multiple projects;
- Strong communication and collaboration skills, which are essential to execute duties to the others in the team;
- Good knowledge of Linux system management and administration, and interest in knowledge upgrading;
- Ample experience configuring and automating Monitoring tools (Prometheus, Grafana, Zabbix, etc.);
- Knowledge of one Logging stack (Preferably ELK);
- Hands-on experience with networking principles (DNS, Routing, Firewalls, Load Balancing, etc.);
Preferred Qualifications
- Possess vast knowledge and experience in system automation, deployment, and implementation;
- Familiarity with CDN (Content Delivery Network) systems;
- Basic knowledge of container concepts;
- Familiarity with open-source services such as HAProxy, MySQL, Redis, and Memcached.