Send me Jobs like this
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
What Will You Do
- Design and maintain scalable, highly available, and fault-tolerant systems across multiple cloud providers (AWS, OCI).
- Lead incident response efforts, conducting blameless post-mortems and driving systemic improvements.
- Build and refine automated deployment pipelines, ensuring fast, safe, and repeatable delivery of changes.
- Implement robust observability frameworks (metrics, tracing, logging) to proactively detect and address performance issues.
- Collaborate with development teams to embed reliability into every stage of the software lifecycle.
- Optimize infrastructure costs while maintaining service quality.
- Drive chaos engineering experiments to validate system resilience.
- Document architecture, runbooks, and operational processes for internal and cross-team use.
What Are We Looking For
We re looking for a reliability-focused engineer with strong technical depth, who thrives in solving complex operational challenges at scale. You must be hands-on with distributed systems, cloud-native platforms, and automation tools.
- Strong background in SRE principles (SLIs, SLOs, SLAs) and operational excellence.
- Experience with Kubernetes, container orchestration, and service mesh technologies.
- Proven expertise in infrastructure as code (Terraform, Ansible, Crossplane is optional) and automation scripting (Bash, Python, Go).
- Deep understanding of monitoring and alerting systems (Prometheus/Grafana, ELK, Loki, Datadog, AWS CloudWatch).
- Skilled in cloud networking, load balancing, API gateway management (NGINX, Kong, AWS API GW).
- Solid experience with relational and NoSQL databases in production (MySQL/PostgreSQL, MongoDB, DocumentDB, Redis).
- Familiarity with distributed tracing (Jaeger, OpenTelemetry) and chaos testing frameworks.
- Excellent troubleshooting skills and ability to resolve high-impact incidents under pressure.
Who Will Excel
- Candidates who successfully operated high-traffic, mission-critical platforms in a cloud-native environment.
- Candidates that demonstrate strong collaboration and communication skills across engineering, product, and business teams.
- Candidates who bring a data-driven approach to performance tuning and capacity planning.
- Candidates that thrive in fast-paced, high-growth SaaS environments and embraces continuous improvement.
What We Offer You
We believe you will love working at Foodics!
- Highly competitive compensation packages, including bonuses and potential equity.
- Annual learning stipend and regular training to accelerate your career.
- Exposure to cutting-edge cloud technologies and large-scale distributed systems.
- A truly global team of over 30 nationalities in 14 countries.
- Autonomy, challenging goals, and the chance to directly impact the reliability of platforms serving millions.
Company Industry
- IT - Software Services
Department / Functional Area
- Engineering
Keywords
- Staff Site Reliability Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Similar Jobs
AWS Cloud Engineer
Staff Connect Information Technology Consultants
- 6 - 10 Years
- Dubai - United Arab Emirates (UAE)
Devops Engineer
Staff Connect Information Technology Consultants
- 6 - 11 Years
- Dubai - United Arab Emirates (UAE)
Devops Lead
Confidential Company
- 10 - 15 Years
- Dubai - United Arab Emirates (UAE)