Site Reliability Engineer

September 7

🏡 Remote – New York

Apply Now
Logo of Cerbo EHR

Cerbo EHR

EHR + Practice Management + Patient Portal as innovative as you are!

Direct Pay • Functional Medicine • Concierge • Custom Forms • Patient Portal

11 - 50

Description

• Design, implement, and maintain scalable and reliable cloud infrastructure on AWS • Manage and optimize Kubernetes clusters using Amazon EKS • Develop and maintain Infrastructure as Code using Terraform • Implement and improve CI/CD pipelines using GitHub Actions and ArgoCD • Ensure system security and implement best practices • Monitor and optimize system performance using Grafana and Prometheus • Track our AWS spending and suggest ways to cut operating costs • Troubleshoot and resolve complex issues in production environments • Collaborate with development teams to improve application reliability and performance • Participate in On Call rotation with other SREs and engineering team members

Requirements

• Extensive experience with AWS services and best practices • Proficiency in managing Kubernetes clusters, particularly Amazon EKS • Strong knowledge of Helm for Kubernetes package management • Extensive experience with Infrastructure as Code, specifically Terraform • Familiarity with CI/CD pipelines, particularly GitHub Actions • Advanced Linux administration skills • Solid understanding of networking concepts and protocols • Experience in implementing and maintaining security best practices • Proficiency in using monitoring and observability tools, especially Grafana and Prometheus

Benefits

• Competitive compensation based on experience • Comprehensive health, dental and vision benefits • 401(k) plan with matching company contribution • Short-term disability & long-term disability insurance • Paid Time Off and company holidays • Full suite of remote working tools and processes

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com