Job Details

View jobs in our app

Learn more about the app. Workinapps.com

Staff Engineer - ML Inference & Model Efficiency

2026-05-04 Cohere San Francisco,CA

Description:

A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency. In this role, you will enhance LLM inference systems by tackling performance issues and collaborating with cross-functional teams. Ideal candidates have over 5 years of coding experience in C++ or Python and a solid understanding of the LLM inference environment. This position offers a remote-friendly work model, a competitive salary, and extensive benefits including a generous vacation policy.J-18808-Ljbffr

Job Details

View jobs in our app

Staff Engineer - ML Inference & Model Efficiency

Apply for this Job

Registration Required

Login to Apply

You are leaving our site

Registration Required

Email this job to a friend

Job: Staff Engineer - ML Inference & Model Efficiency

Job Alert Sign Up

Add To Job Alert

Job Alert Updated

Email Customer Care