Senior Site Reliability Engineer - Databases

3 days ago

🏡 Remote – New York

Apply Now
Logo of Grafana Labs

Grafana Labs

Grafana Labs supports organizations’ monitoring, visualization and observability goals. 950,000+ active installations

Monitoring • Observability • Dashboards

501 - 1000

Description

• Support Grafana Cloud customers by improving database reliability • Own software configuration via Helm charts and Jsonnet • Engage in feature releases and ensure SLOs are met • Collaborate with engineering to enhance database stability and observability • Participate in incident response and customer communication

Requirements

• Strong engineering background (at least 6 years), focused on SRE roles (at least 3 years) • Good communication skills for technical conversations with engineers and customers • Experience with Kubernetes on AWS, GCP, or Azure, and IaC tools • Experience with Site Reliability Engineering, Large System Design, and Distributed Computing • Proficient in one or more programming languages (e.g. Go, Python, Java) • Knowledge of Linux internals, networking, cloud storage, and scaling • Excellent problem-solving and troubleshooting skills • Experience with blame-free Incident Response and high-quality PIRs • Strong autonomy and self-direction in an engineering team • Intellectually curious, transparent, action-oriented, and kind individuals are highly valued.

Benefits

• Equity • Bonus (if applicable) • Other benefits listed here.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com