Who We Are?
Cloudinative builds and operates Private Cloud Platforms for organizations that need control, security, and reliability. We design and run OpenStack-based Cloud Infrastructure (with automation, observability, and production-grade operations) and help teams deliver IaaS and managed cloud services at scale.
Role Overview
We’re looking for a Senior OpenStack Engineer to design, deploy, operate, and continuously improve our OpenStack-based Private Cloud platform.
If you’re passionate about open-source cloud platforms, infrastructure automation, and building scalable distributed systems that run-in production (not just in a lab!), we’d like to talk.
Location: On-site | Tehran, Iran
Type: Full-time
What You’ll Do
- Deploy, configure, operate, and upgrade production-grade OpenStack environments (multi-node, HA).
- Design and implement highly available, multi-tenant cloud architectures.
- Manage and optimize Ceph clusters for block/object/file storage (RBD, RGW, CephFS).
- Automate provisioning, configuration, and day-2 operations using Ansible, Terraform, and/or Kolla-Ansible.
- Monitor, troubleshoot, and resolve complex production issues across compute, storage, and networking layers.
- Implement network virtualization using Neutron (OVS/OVN, VXLAN/VLAN, SDN concepts).
- Ensure platform security: hardening, patching, access controls, and baseline compliance practices.
- Improve observability: metrics, logging, alerting, dashboards, and incident response workflows.
- Collaborate with product/dev teams to optimize workloads, resource allocation, and performance.
- Drive capacity planning, performance tuning, upgrades, and disaster recovery strategies.
- Maintain clear documentation (runbooks, SOPs, architecture notes) and share knowledge within the team.
- Participate in on-call rotation and handle critical incidents with ownership.
What We’re Looking For
- 3+ years hands-on, production experience with OpenStack (deploying, operating, troubleshooting).
- Strong Linux administration skills (systemd, networking, storage, performance troubleshooting).
- Strong experience with Ceph (RBD/RGW/CephFS) and real troubleshooting/tuning experience.
- Solid knowledge of virtualization: KVM, QEMU, libvirt.
- Practical ability to read and analyze OpenStack logs and trace issues across services.
- Automation mindset with experience in IaC and scripting (Ansible/Terraform; Python is a plus).
- Strong networking fundamentals: TCP/IP, VLAN/VXLAN, routing, load balancing, firewalls, security groups.
- Familiarity with containerization: Docker/Kubernetes, and how they integrate with OpenStack ecosystems.
- Strong sense of ownership, reliability under pressure, and willingness to be on-call when needed.
- Good communication and documentation habits.
Bonus Experience
- OpenStack deployment frameworks and tooling: Kolla-Ansible, OpenStack-Ansible, Juju/Charmed OpenStack, TripleO.
- Advanced Neutron / dataplane topics: OVN deep-dive, BGP, DVR, SR-IOV, DPDK.
- OpenStack on Kubernetes patterns (e.g., OpenStack-Helm) or cloud-native operational models.
- Experience with multi-site / multi-region OpenStack deployments.
- Strong observability stack experience: Prometheus/Grafana, ELK/EFK, Loki, etc.
- Upstream contribution experience (OpenDev/GitHub) or active open-source involvement.
- Certifications (nice-to-have): COA, Red Hat OpenStack, RHCE/RHCSA, Kubernetes certifications, AWS/Azure.
- Experience in startup / telco / large-scale datacenter environments.