Sr. Site Reliability Engineer

September 5

🏡 Remote – New York

Apply Now
Logo of AKASA

AKASA

AKASA is building the future of healthcare with AI.

AI • Machine Learning • Revenue Cycle Management • Hospital Operations • Healthcare

201 - 500

Description

• Work closely with Infrastructure and Platform teams to integrate monitoring best practices • Develop high-quality runbooks for incident management • Build visualizations and alerting systems for system performance • Manage and optimize infrastructure using Terraform, GitHub CI/CD, and Kubernetes • Troubleshoot production issues and automate operational processes • Design and maintain core infrastructure for SaaS products • Proactively identify potential issues using telemetry data collection and monitoring tools • Collaborate with development teams to embed reliability in the software lifecycle

Requirements

• Proficient in visualizing, monitoring, and alerting on telemetry data • Experience with Docker, Kubernetes, Terraform, or similar technologies • 5+ years of professional experience using Python, Go, Java, or similar • Proficient with Linux and Unix Shell • Excellent collaboration and asynchronous communication skills • Committed to thorough documentation to streamline learning and processes • Proactive and enthusiastic attitude towards identifying and fixing issues • Ability to deliver quickly, iterate fast, and adapt to changing requirements • Proficient in using Git/GitHub for version control

Benefits

• Unlimited paid time off (PTO) • Expansive coverage for health, dental, and vision • Employer contribution to Health Savings Accounts (HSA) • Generous parental leave policy • Full employee coverage for life insurance • Company-paid holidays • 401(K) plan

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com