SRE/DevOps Engineer
Agolo -
Maadi, CairoPosted 2 years ago29Applicants for1 open position
- 7Viewed
- 0In Consideration
- 0Not Selected
Job Details
Experience Needed:
Career Level:
Education Level:
Salary:
Job Categories:
Skills And Tools:
Job Description
We are seeking an experienced DevOps / site reliability engineer to join our new engineering team in Cairo. The site reliability engineer would ensure that our suite of servers and databases are operating smoothly.
As a site reliability engineer, you will:
- Establish monitoring tools to increase service reliability.
- Achieve continuous integration and deployment to all of our services.
- Create and develop tools/scripts to automate builds and backups.
- Conduct regular on-call duties and address issues pertaining to operations.
- Engage in capacity planning and demand forecasting activities.
- Design, implement and manage our different clouds including client environments.
- Meet with our clients to understand deployment requirements and limitations.
- Work with our product and sales teams to design and implement SLIs, SLAs, and SLOs.
- Identifies projects that result in substantial cost savings or revenue.
- Implement Infrastructure as Code on Kubernetes using Helm and Terraform.
- Maintain our datastores, monitor the load, design and implement backup and restore plans, scaling, clustering (sharding/replication).
- Continuously enhancing our monitoring and alerting to prevent incidents.
- Practice sustainable incident response and blameless postmortems.
Our interview process is as follows:
- A screening call (30 mins).
- 1-2 technical interviews with a member of the Reliability Engineering team (60 mins).
- Culture interview (60 mins).
- 3 reference calls.
- Final meeting with our VP of Engineering or CTO (60 mins).
Job Requirements
Required qualifications:
- 1-3 years of industry experience.
- Experience with at least one of the cloud providers: AWS, Azure, or GCP.
- Solid experience with container runtimes and orchestrators: Docker and Kubernetes.
- Know your way around Linux and the Unix Shell.
- Strong knowledge of HTTP and debugging networking issues within Kubernetes.
- Solid experience with continuous integration systems using Git and CircleCI.
- Confident to debug and write scripts in Python and Bash.
- Very strong verbal and written communication skills in English are a must.
- Strong communication skills and a sense of ownership and drive.
Preferred qualifications:
- Large-scale distributed systems: Kafka and ElasticSearch
- Operating microservice architecture
- Security
- Helm
- Monitoring tools: Prometheus
- Designing tests and quality assurance
- Infrastructure as code tools like Terraform