Job Details

Head of Cloud Inference

  2025-09-10     Confidential     San Francisco,CA  
Description:

Head of Cloud Inference

About the Company

Revolutionary provider of serverless infrastructure solutions

Industry
Computer Software

Type
Privately Held, VC-backed

About the Role

The Company is seeking a Head of Cloud Inference to lead the development of a next-generation platform that will transform the deployment of trained machine learning models. This role is pivotal in creating a GenAI cluster-level inference product that ensures seamless and high-performance deployment of customer GenAI models, regardless of the underlying hardware. The successful candidate will be responsible for team leadership and growth, driving the strategic vision for the product, and ensuring technical excellence in a fast-paced environment. Collaboration with engineering leaders to deliver integrated AI inference deployment solutions for enterprise customers is also a key aspect of the role. Applicants for the Head of Cloud Inference position at the company should have a minimum of 7 years' experience in people management and over 10 years in the field of cloud infrastructure. A strong background in developing production-quality, high-performance software, particularly in AI/ML infrastructure or model serving, is essential. The role requires a leader with a proven track record in managing large teams and a deep understanding of cloud infrastructure principles and large-scale system operations. The ideal candidate will be adept at fostering a culture of technical expertise and cutting-edge technology adoption, while also being committed to the success of the platform and its customers.

Travel Percent
Less than 10%

Functions

  • Information Technology
  • Engineering


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search