Job Details
Experience Needed:
Career Level:
Education Level:
Salary:
Job Categories:
Skills And Tools:
Job Description
- Manage and maintain internal Kubernetes clusters and other business application deployment environments
- Improve, test, script, and document the multitude of deployment options and various permutations of on-premise Kubernetes and object stores.
- Build new monitoring/administration solutions including architecture, provisioning, configuration and deployment
- Work closely with front-end, backend, and systems team to improve stability and uptime
- React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
- Conduct periodic on call duties
- Solve complex and difficult problems and build automation to prevent problem recurrence
- Participate in cloud service capacity planning and demand forecasting, software performance analysis and system tuning.
- Collaborate with members of partnering teams to define, design, and implement declarative specifications that form the basis of continuous solutions.
- Follow/enforce DevOps recommended practices to ensure successful and streamlined production deployments
- Stay informed of new technologies
Job Requirements
- 8+ years of relevant professional working experience, of which, 5+ years in designing and implementing continuous solutions in a distributed systems environment, preferably in the cloud
- BS or MS degree in Computer Engineering, Computer Science, or equivalent education
- Experience in designing, analyzing and troubleshooting large-scale, highly performant distributed systems
- Experience deploying, managing and troubleshooting of microservice-based software applications (including Web Apps and integrated B2B solutions)
- Hands-on experience maintaining highly available infrastructure
- Extensive knowledge and hands-on experience in the following areas in public/private cloud environments:
- Linux operating systems
- Virtualization and provisioning technologies: Docker
- Containerization technologies: Kubernetes and/or OpenShift
- REST web services
- Script language: Shell scripting and/or Python
- CI/CD tools: TeamCity, GitLab, Jenkins
- Databases: PostgreSQL, including replication
- Experience in the following areas is a plus:
- Kubernetes backup/restore: Velero
- Artifact repository/Docker Registry: Nexus repository
- Object Storage: MinIO
- Service mesh: Istio
- Network Load balancing/high availability: HAProxy
- PostgreSQL Backup/DR: Barman
- Java technologies: Spring Boot, Spring Integration, Hibernate
- Monitoring: Prometheus/Grafana, Nagios
- Logging: Elastic Stack (Beats/Logstash, ElasticSearch, Kibana)
- Processing engine: Apache Spark
- Distributed/In-memory database: Apache Ignite
- [removed]/li>
- Message-brokers: RabbitMQ
- Innovative Infrastructure as code (IaC) mindset
- Strong analytical and problem-solving skills
- Systematic problem solving approach, combined with a strong sense of ownership and drive
- Possess a passion for technical leadership and mentoring
- Strong interpersonal skills
- Strong verbal and written communication skills