Infrastructure Operations Engineer (GPU Computing) - Enterprise AI

June 11

🏡 Remote – New York

Apply Now
Logo of Aethir

Aethir

Scalable Decentralized Cloud Infrastructure (DCI)

11 - 50

Description

• We are seeking a highly skilled and motivated Infrastructure Operations Engineer to join our dynamic team • As an integral member of the InfraOps team, you will play a key role in managing and optimizing our GPU-based compute infrastructure (across multiple locations and partners), ensuring maximum performance, scalability, and reliability

Requirements

• Experience in infrastructure operations, preferably in a DevOps or SRE role or Sales Engineering or Solution Architect role - focused on GPU compute • Proficiency in managing GPU-based compute infrastructure, including NVIDIA GPUs and CUDA programming • Strong expertise in Linux system administration and shell scripting (e.g., Bash, Python) • Experience with configuration management tools (e.g., Ansible, Chef, Puppet) and version control systems (e.g., Git) • Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes) • Solid understanding of networking concepts, protocols, and troubleshooting techniques • Excellent analytical and problem-solving skills, with a proactive and results-oriented mindset • Effective communication skills and the ability to collaborate effectively with cross-functional teams. We operate in English, but speaking Mandarin as well is a big bonus as we have engineering teams in China and Southeast Asia • Experience with cloud computing platforms (e.g., AWS, Azure, GCP) and hybrid cloud architectures • Knowledge of HPC frameworks and job scheduling systems (e.g., Slurm, PBS Pro) • Familiarity with GPU-accelerated libraries and frameworks (e.g., TensorFlow, PyTorch, CUDA Toolkit) • Understanding of cybersecurity principles and practices, including encryption, access controls, and threat detection/prevention • Bonus if you know Web3 (cryptocurrency, tokenization of RWAs, mining/staking, etc.)

Benefits

• Competitive compensation structure (and flexible on fiat/token mix) • Can be flexible on benefits, depending on location and setup • Salary is also flexible depending on location and setup • Flexible work hours and remote work options

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com