Online accounting software. Connects to all things business: accountants, bookkeepers, banks, enterprise & apps.
Accounting • SaaS • Banking • Invoicing • Design
July 22
🏡 Remote – New York
Online accounting software. Connects to all things business: accountants, bookkeepers, banks, enterprise & apps.
Accounting • SaaS • Banking • Invoicing • Design
• Investigating operational surprises and supporting teams in post incident activities. • Conducting in depth incident analysis and maximizing post incident learning across the organization • Complete short term reliability consultancy and enablement engagements such as SLO reviews and facilitating pre-mortems. • Improving on call health, uplifting observability and addressing any operational hotspots • Identifying, planning and leading implementation of reliability uplift work and initiatives • Support delivery of strategic features and initiatives with reliability and distributed systems expertise • Observing and improving rituals and practices relating to production operations, incident response and incident learning
• Solid experience in logging, monitoring and observability of a highly distributed system • Leading incident management and response and troubleshooting efforts, including critical, complex and high severity incidents • Post incident reviews, incident analysis and learning from incidents • Experience working in a tech or product company with comparable scale and complexity • Systems thinking and thinking about how systems and components interact, how they respond to failure • Proficiency in one or more object-oriented programming languages (C#, JavaScript, Java, Python etc) or experience with infrastructure-as-code (e.g. Terraform, Cloudformation) • Experience in technical leadership, setting technical direction • Experience in leading delivery of technical initiatives in an operational, site reliability or platform engineering capacity
Apply Now