My client is searching for a talented engineer to work on ML/LLM inference and serving. They specialize in developing next-gen LLM fine-tuning and inference engines.
We are seeking a talented and motivated Software Engineer specializing in Machine Learning (ML) and Large Language Model (LLM) inference to join our dynamic ML Inference team. In this role, you will bridge the gap between AI/ML research and systems programming to build and enhance our next-generation LLM Inference Engine. You will play a crucial role in optimizing the performance, scalability, and efficiency of our LLM serving systems.
Key Responsibilities:
Develop and Enhance Inference Engine:
Performance Optimization:
Customer Collaboration:
Technical Leadership:
Infrastructure Development:
Qualifications:
Technical Skills:
Experience: