Site Reliability Engineer

January 26

🏡 Remote – New York

Apply Now
Logo of MeridianLink

MeridianLink

Connecting You to Better: MeridianLink is the developer of the industry's first multi-channel loan origination system.

technology • financial services • banking • loan origination system • Loan Origination

501 - 1000

💰 $485M Post-IPO Debt on 2021-11

Description

• Working within a SRE team with a devops culture, you’re heavily involved in all things technical, including automating, troubleshooting, performance tuning, capacity planning, incident analysis and other systems and software related tasks to cloud and data center operations. Along with relevant IT experience, you bring a strong sense of ownership and a passion for learning new things. • The Site Reliability Engineer is part of the infrastructure operations team but will also work closely with software development teams to deliver infrastructure as a service that meets the growing and changing needs of the business. Specific responsibilities include: - Creates and manages infrastructure solutions that are re-usable and flexible across compute and storage of private and public infrastructure. - Supports production and lower environment systems in public, private, and hybrid deployments. - Participates in the continuous development, monitoring and troubleshooting of highly configurable and continuously deployable environments in public and private infrastructure. - Protects the company's data, tools, and information systems by adhering to Operational and Security policies and procedures throughout the service delivery lifecycle. - Troubleshoots infrastructure issues, creates and manages incidents and problems to prevent and reduce the impact of customer-affecting incidents. - Administers & manages storage systems, data backups, system redundancies, and disaster recovery systems. - Analyzes, installs, builds, modifies, and supports operating system templates and application stacks. - Understands the interrelationship of infrastructure components and applications. - Uses diagnostic tools to analyze bottlenecks and performance and with development and operations teams to provide feedback on site performance, monitoring, and overall stability of Meridianlink platform and products. - Creates internal documentation, playbooks, and other consolidated knowledge repositories that can help existing teams and future hired resources. - Participates in feature and story grooming planning sessions. - Participates in 24x7 on-call rotation and manage on-call rotations. - Automation of tasks for speed and consistency leveraging IaC.

Requirements

• Excellent understanding of hypervisor and infrastructure API technologies, private and public infrastructure solutions. (VMware, AWS, Azure) • Knowledge of public, private and hybrid infrastructure solutions and knowledge of cloud system engineering principles and considerations. • Familiar with RDBMS databases such as Microsoft SQL Server. • Knowledge of Agile Methodology and continuous integration/delivery processes. • Familiarity with revision control systems • Must be able to work independently and as part of a team on multiple overlapping projects. • Strong problem solving and analytical skills with an investigative mindset. This means not only solving problems effectively but also investigating its root cause and tracing the various factors leading up to an incident as well as crafting a summary of all contributing factors. • Strong written and oral communication skills, including the ability to facilitate meeting discussions to identify effective solutions. • Possess a “postmortem culture” - investigate the facts and events leading up to an incident blamelessly to fine-tune the infrastructure for the future, preventing outages arising from the exact cause. • Good understanding of load/resource management, comprising load balancing, load shedding, and auto scaling. • Specializes in one or many operating systems or infrastructure technologies. • Must have the capacity to rebalance workloads with minimal supervision. • Must be able to work effectively with cross-functional departments with varying degrees of technical experience. • Minimum of 4 years of system engineering experience in a virtualized environment and/or Cloud environment. AWS experienced preferred. • 4 years of enterprise experience including monitoring and tuning multi-tiered server infrastructures or experience in distributed, highly available, high performance, scalable cloud applications. • Scripting skills with PowerShell and at least one other scripting language. • Experience with infrastructure automation and monitoring tools (Terraform/Puppet/Chef/Ansible). • Prior experience working with developers, DBAs Network Engineers to resolve bugs and performance problems. • Ability to understand application configuration that is running on the server and help developers leverage system features and functionality. • Experience with Kubernetes preferred.

Benefits

• Potential For Equity-Based Awards • Insurance coverage (medical, dental, vision, life, and disability) • Flexible paid time off • Paid holidays • 401(k) plan with company match • Remote work • All compensation and benefits are subject to the terms and conditions of the underlying plans or programs, as applicable and as may be amended, terminated, or superseded from time to time.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com