AI Tools Site Reliability Engineer - Graphic

June 27

🏡 Remote – New York

Apply Now
Logo of MyShell.ai

MyShell.ai

Democratizing & Decentralizing AI-native apps

AI • WEB3 • Open Source • Creator Economy

11 - 50

Description

• We are seeking an experienced Site Reliability Engineer to join our team and manage AI tools like ControlNet, ensuring efficient, stable operation and continuous system performance optimization. • Responsibilities: - System Maintenance & Monitoring: Oversee daily operations of AI tools, including server, database, and network maintenance. Monitor system performance and address issues promptly. - Deployment & Release: Manage deployment and version releases of AI tools. Implement CI/CD processes for automated deployments. - Troubleshooting: Address and resolve issues in AI tools, analyze logs and monitoring data to find root causes, and propose solutions. - Performance Optimization: Enhance deployment architecture, improve efficiency and stability, and implement performance tuning strategies. - Security Management: Ensure tool security, conduct regular assessments, fix vulnerabilities, and implement data protection and backup strategies. - Collaboration & Documentation: Work closely with development teams, contribute to system design and optimization, and maintain operations documentation. • Plus Points: - Exceptional problem-solving abilities and strong communication skills. - Experience with AI or machine learning technologies and their integration into backend systems. - Contributions to open-source projects or a strong presence in the developer community. - Prior experience in a fast-paced startup environment.

Requirements

• Education: Bachelor's degree or higher in Computer Science, Software Engineering, or related fields. • Experience: 3+ years in operations engineering or related roles, with a preference for AI tools experience. • Skills: Proficiency in Linux, monitoring tools (e.g., Prometheus, Grafana, ELK), automation tools (e.g., Ansible, Puppet, Chef), scripting (e.g., Python, Shell), cloud platforms (e.g., AWS, Azure, GCP), and AI tools (e.g., ControlNet). • Other: Strong communication, teamwork, analytical and problem-solving skills. Ability to work under pressure, strong sense of responsibility, and a proactive attitude towards continuous learning.

Benefits

• Competitive salary and equity package, commensurate with experience and location. • Flexible working hours and a fully remote work environment, with the ability to collaborate effectively across time zones. • A dynamic and collaborative work environment that fosters innovation, growth, and professional development. • The opportunity to work on cutting-edge technologies and help shape the future of AI, transforming industries and making a global impact.

Apply Now

Similar Jobs

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com