Site Reliability Engineer

3 days ago

🏡 Remote – New York

Apply Now
Logo of Appspace

Appspace

Discover the easiest way to reach your workforce - at work, at home, or on the go.

IPTV • video walls • kiosks • room booking • dashboards

201 - 500

💰 Private Equity Round on 2019-12

Description

• Automating maintenance tasks for our Cloud Platform, therefore strong experience in Python and shell scripting is a must • Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must • Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product • Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases • Monitoring the application’s performance, uptime, and cloud infrastructure’s performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues • Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence • Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds • Collaborating with our Development and Quality Assurance teams to address issues in the product and platform • Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution

Requirements

• Must be able to learn new technologies quickly and a desire to be a life-long learner • Must communicate well and adapt to working well with others across different countries and cultures • Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must • Experience with Google Cloud Platform, Google Kubernetes Engine, Google Compute Engine, and Google Storage is highly desired, but comparable experience with AWS or Azure will be considered • Solid troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails) is a must • Experience with administering MySQL & MongoDB preferred • Experience with administering message brokering systems like RabbitMQ preferred • Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base!) • Open to quarterly travel up to 5%

Benefits

• Generous PTO • Flexible work schedules • Remote work opportunities • Paid company holidays • Appspace Quiet Fridays (No non-essential internal meetings scheduled) • A casual dress work environment

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com