This position covers all aspects of DevOps, SRE, and AWS management for our mission critical 24x7 system. We strongly believe in the values of Infrastructure as Code and that \"configuration management\" is an anti-pattern. We are looking for a person who shares this vision to write code for provisioning, monitoring and maintenance as much as possible.
Build and maintain globe-spanning cloud infrastructure on AWS primarily using Docker, Terraform and Kubernetes.
Automate manual tasks and ensure all resulting code is under source control.
Employ and maintain state of the art monitoring and logging tools.
Build a CI/CD pipeline that ensures our customers never see bugs.
Build a fault-tolerant and highly available system that will survive any disaster short of the apocalypse.
Six years hands-on experience with Cloud Operations/DevOps/SRE on any cloud provider: AWS, GCP, Azure.
Three years experience specifically with AWS. Especially: VPC, ELB/ALB, EC2, S3, IAM.
Three years experience directly managing Kubernetes clusters.
One year experience writing Terraform modules and/or CloudFormation Templates.
Strong knowledge of TCP/IP networking, including routing, subnetting and load balancing.
Experience maintaining CI/CD pipelines for Docker-based microservices.
Strong background in Linux/Unix (Debian variants, AWS Linux).
Proficient in Docker container management.
A passion for creating maintainable systems and an aversion to creating technical debt of any kind.