Job Details

View jobs in our app

Learn more about the app. Workinapps.com

Software Engineer - GenAI Evaluations, AiDP

2025-11-14 Apple, Inc. San Francisco,CA

Description:

Weekly Hours: 40

Role Number: 200616###-####

Summary

We are seeking a driven and analytical Software Engineer to join Apple's Generative AI Evaluations team. In this role, you will help define how we measure, monitor, and improve the performance of AI systems that power next-generation user experiences. You will design robust evaluation frameworks, translate cutting‑edge research into practical tooling, and collaborate closely with cross‑functional teams to ensure our GenAI solutions are trustworthy, efficient, and high‑quality. This is a unique opportunity to influence both the inner workings of Apple's AI platforms and the broader standard for evaluating generative applications at scale.

Responsibilities

Designing and developing platform features to help solution developers experiment and identify optimal configurations for delivering high quality GenAI applications.
Evaluating and analyzing the performance of GenAI applications, and actively collaborating with the team in driving performance improvements.
Translating the latest research into reliable and scalable evaluations that can deliver high quality experiences for our users.
Actively engaging in all aspects of feature development, from ideation and experimentation to deployment and maintenance.
Communicating complex technical topics effectively to a diverse audience.

Minimum Qualifications

Bachelor's degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field or experience.
2+ years of software engineering experience.
Programming skills in Python.
Experience developing scalable and robust services with FastAPI or similar frameworks.
Experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs), Retrieval Augmented Generation (RAG) or GenAI Agents.
Experience with evaluating and optimizing Generative AI platforms or applications.

Preferred Qualifications

Experience with GenAI RAG and Agent evaluation frameworks like RAGAS, DeepEvals, OpenEvals, AgentEvals or OpenAI Evals.
Familiarity with LLM Observability techniques and best practices.
Proven ability to comprehend, interpret, and apply cutting‑edge research into tangible applications.
Proven problem‑solving and leadership abilities, with the capacity to steer the team's research and build practical applications in a collaborative and fast‑paced environment.
Customer‑focused with strong business acumen, capable of translating business needs into impactful technical solutions and a proven history of successfully shipping products that drive significant outcomes.
Experience with cloud platforms like AWS, GCP, or Azure.
Knowledge of containerization and orchestration tools like Docker and Kubernetes.
Creative, collaborative and project focused with an ability to work hands‑on in multi‑functional teams.
Excellent communication skills with the ability to communicate with all stakeholders effectively, including senior leadership.
Master's in Computer Science, Artificial Intelligence, Machine Learning, or a related field.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (

#J-18808-Ljbffr

Job Details

View jobs in our app

Software Engineer - GenAI Evaluations, AiDP

Summary

Responsibilities

Minimum Qualifications

Preferred Qualifications

Apply for this Job

Registration Required

Login to Apply

You are leaving our site

Registration Required

Email this job to a friend

Job: Software Engineer - GenAI Evaluations, AiDP

Job Alert Sign Up

Add To Job Alert

Job Alert Updated

Email Customer Care