hero

Accelerating the Open Metaverse_

Outlier Ventures
118
companies
83
Jobs

Senior HPC Infrastructure Engineer

Cudos

Cudos

Other Engineering
United Kingdom
Posted on Oct 8, 2024

PLEASE NOTE THIS ROLE REQUIRES THE INDIVIDUAL TO WORK WITHIN THE HOURS OF UTC+2/UTC+3. PLEASE DO NOT APPLY UNLESS YOU ARE AVAILABLE TO WORK DURING THESE HOURS OR YOUR APPLICATION WILL BE REJECTED

Are you passionate about high-performance computing (HPC) and ready to shape the future of cloud GPU infrastructure? Cudo Compute is a global leader in providing on-demand and reserved cloud GPUs for AI, machine learning, and HPC workloads. If you thrive on cutting-edge technologies and scalable systems, we want you on our team as a Senior HPC Infrastructure Engineer!

Why Cudo Compute?

At Cudo Compute, we're revolutionizing cloud computing by delivering powerful, scalable, and sustainable cloud infrastructure solutions for AI, machine learning, and scientific workloads. Join us in building a greener, more efficient cloud powered by advanced GPUs.

What You’ll Do:

  • Design & Build: Architect end-to-end HPC clusters tailored to various client needs and workloads.
  • Collaborate & Innovate: Work with pre-sales engineers, hardware suppliers, and data centre partners to design, expand, and optimize systems.
  • Manage Networking & Storage: Set up and maintain Ethernet, Infiniband, RoCE networks, and storage nodes for high-demand environments.
  • GPU Optimization: Set up and manage GPU-accelerated systems with cutting-edge hardware from Nvidia, Intel, and AMD.
  • Service Excellence: Act as the key escalation point for resolving customer issues and liaising with suppliers to ensure optimal performance.
  • Drive Innovation: Work closely with our software teams to integrate HPC features into Cudo's platforms and advocate for cutting-edge HPC advancements.
  • Stay Ahead: Keep up-to-date with the latest in HPC, cloud computing, and GPU technologies, particularly from Nvidia, Intel, and AMD.

What You Bring:

  • 5+ Years of HPC Engineering Experience: Proven track record in hands-on engineering within HPC environments.
  • Expertise in Infiniband & RoCE: In-depth knowledge of these networking technologies.
  • Scripting & Automation: Proficient in automating deployment and ongoing maintenance tasks.
  • Bonus Points:Familiarity with Nvidia AI Enterprise and Nvidia NGC Catalogue
  • Awareness of product-management principles and a product-led approach

What You’ll Get:

  • Competitive Salary
  • Remote Work: Flexibility to work from anywhere in a UTC±3 timezone
  • Impactful Work: Help revolutionize cloud computing with powerful, sustainable infrastructure
  • Cutting-Edge Tech: Work with the latest hardware and software from Nvidia, Intel, and AMD
  • Collaborative Culture: Join a mission-driven team passionate about making a global impact in the cloud and HPC space.

About Cudo Compute

At Cudo Compute, we’re creating the next generation of cloud infrastructure—sustainable, high-performance, and GPU-optimized. Our platform powers innovation for AI, ML, and HPC, all while championing a greener, more efficient future. Join us in pushing the boundaries of what's possible in cloud technology.

Ready to shape the future of cloud HPC? Apply today!