CyberCoders logo

Director of Engineering- AI Cloud Infrastructure

CyberCoders
Department:Engineering Manager
Type:REMOTE
Region:USA
Location:Atlanta, GA
Experience:Mid-Senior level
Salary:$200,000 - $300,000
Skills:
AIMLHPCCLOUD COMPUTINGKUBERNETESSLURMOPENSTACKMAASNETBOXKVMREDFISHDEVOPSCI/CDINFRASTRUCTURE-AS-CODEDISTRIBUTED SYSTEMSAUTOMATIONNVIDIARDMAROCEINFINIBANDSDNEVPNVXLANBGPCLOSLLM
Share this job:

Job Description

Posted on: August 24, 2025

Title: Director of Engineering - AI Cloud Infrastructure Location: FULLY remote! Must be located in the US Salary: $200k-$300k+ Bonus and RSU Package Requirements: 10+ years of Engineering experience + at least 5 years of AI, ML, HPC and/or Cloud Computing environment. Must also have at least 5 years of leadership experience. If You Are an Engineering Leader with AI Cloud Experience, Please Read On! We're a fast-moving team building the next generation of AI infrastructure-designed from the ground up for scale, speed, and performance. Our platform powers some of the most advanced AI workloads in the world, combining high-density GPU clusters, cutting-edge networking, and smart orchestration tools. We operate Tier-3 data centers optimized for AI and HPC, and offer flexible hybrid cloud solutions that let teams move fast and build big. If you're an Engineering leader that is seasoned in the AI space and passionate about solving hard problems, working with world-class hardware and software, and shaping the future of AI infrastructure, we'd love to meet you. We are positioned extremely well for long term growth and reward our team. What You'll Do As Director of Engineering, you'll lead and grow a team of engineering managers and technical leads, fostering a culture of innovation and excellence. You'll oversee the design, deployment, and scaling of GPU-based AI infrastructure, while ensuring performance, reliability, and security. Your responsibilities include developing tools for provisioning and monitoring, implementing best practices like CI/CD and infrastructure-as-code, and managing change and incident response processes. You'll work closely with cross-functional teams to align infrastructure with business goals, and contribute to strategic planning, budgeting, and reporting to the CTO. Must Have BS/MS in Computer Science or related field. 10+ years in engineering, 5+ in leadership roles. Proven experience with cloud-scale AI/ML infrastructure (e.g., Kubernetes, Slurm). Familiarity with infrastructure tools (OpenStack, MaaS, Netbox, KVM, Redfish). Strong knowledge of distributed systems, cloud-native tech, and automation. Skilled in DevOps, observability, and software delivery pipelines. Bonus Points! Experience with NVIDIA clusters, RDMA, RoCE/Infiniband. Knowledge of SDN (EVPN/VXLAN, BGP, CLOS networks). Familiarity with LLM training/inference at scale. Background in AI platforms or cloud services. Offering Base of $200k-$300k + Bonus and generous RSU Package 5 weeks of PTO 401k w/ match comprehensive medical and supplemental benefits package fully remote! Email Your Resume In Word To Looking forward to receiving your resume through our website and going over the position with you. Clicking apply is the best way to apply, but you may also: sean.gur@cybercoders.com

  • Please do NOT change the email subject line in any way. You must keep the JobID: linkedin : SG6-1875138L706 -- in the email subject line for your application to be considered.***

Sean Gur - Lead Recruiter For this position, you must be currently authorized to work in the United States without the need for sponsorship for a non-immigrant visa. This job was first posted by CyberCoders on 08/22/2025 and applications will be accepted on an ongoing basis until the position is filled or closed. CyberCoders is proud to be an Equal Opportunity Employer All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, sexual orientation, gender identity or expression, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, status as a crime victim, disability, protected veteran status, or any other characteristic protected by law. CyberCoders will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable state and local law, including but not limited to the Los Angeles County Fair Chance Ordinance, the San Francisco Fair Chance Ordinance, and the California Fair Chance Act. CyberCoders is committed to working with and providing reasonable accommodation to individuals with physical and mental disabilities. If you need special assistance or an accommodation while seeking employment, please contact a member of our Human Resources team to make arrangements.

Originally posted on LinkedIn

Apply now

Please let the company know that you found this position on our job board. This is a great way to support us, so we can keep posting cool jobs every day!

RemoteITJobs.app logo

RemoteITJobs.app

Get RemoteITJobs.app on your phone!