Job Details
Experience Needed:
Career Level:
Education Level:
Salary:
Job Categories:
Skills And Tools:
Job Description
- As a Senior Cloud Operations Engineer with Crossover, you will be a hands-on professional working with massive-scale cloud infrastructure.
- You will be a senior member of a shift based, 24x7 infrastructure support team for our Cloud-based applications.
- You will be the respondent for challenging system change requests from the business as well as handling communications during any system outages.
- You will configure and use infrastructure monitoring tools and proactively do in-depth root cause analysis on monitoring alerts to constantly improve the stability of the systems.
- You will have daily and weekly targets to work towards, and you will receive regular feedback and coaching related to your performance.
- You will also deliver technical feedback and specific coaching to others in your team.
KEY RESPONSIBILITIES
- Ensure the uptime of our multi-tenant infrastructure
- Perform capacity planning and optimize our public cloud computing costs
- Work closely with the engineering teams to improve our platforms and eliminate complexity from architecture and processes
- Configure and use state of the art monitoring tools to gather insights and then act upon the results
- Conduct incident response and in-depth root cause analysis
Job Requirements
- Bachelor's degree in Computer Science or related technical field involving IT or equivalent practical experience
- 3+ years of demonstrated experience managing and maintaining large-scale SaaS applications in one of the major platforms (Amazon Web Services, VMWare, GPC, IBM Cloud) and cloud orchestration tools (Kubernetes, Marathon, VMware, etc.)
- 3+ years of experience with Linux and/or Windows Server operating systems (strong understanding)
- Experience building and maintaining production systems on using EC2, RDS, S3, ELB, Cloud Formation, etc. and familiarity interacting with the Amazon APIs or VMWare / Azure experience
- Deep experience administering Linux (Centos, RHEL, Ubuntu) systems OR Windows Server systems
- Excellent knowledge of web application technology, (i.e. IIS, Tomcat, Apache, Nginx, Haproxy etc.)
- Experience with industry-standard monitoring tools
- Ability to debug and optimize code and automate routine tasks
Nice to have
- Experienced with declarative configuration management and provisioning tools like Ansible, Puppet or Chef
- Experience with Docker
- Databases experience: MySQL, MSSQL, Oracle, PostgreSQL
- Familiarity with ITIL processes, especially Incident, Change and Problem Management