Checksum is a high-growth startup revolutionizing software testing with AI-powered, self-healing QA automation. Checksum's platform eliminates the need for manual test maintenance by automatically generating and adapting tests based on real user behavior. Engineering teams can now release code faster without sacrificing quality—and our traction proves it: we've tripled ARR in the last 9 months. We are backed by Leap Global Partners and were formed at super{set}, whose co-founders have over $1B in exits to Microsoft and Salesforce.
We're looking for a builder who ships. You have hands on experience building with LLMs, shown through shipped features or serious prototypes. You have worked with prompts, function calling, tool use, structured outputs, evals and failure handling. You turn ambiguous problems into iterative plans, wire up tools and feedback loops, and measure what matters. You are comfortable moving between prompt design, Python code, and product tradeoffs. You enjoy pairing with engineers and PMs, mentoring teammates, and owning outcomes in a dynamic startup environment.
This is a hybrid role, 4x a week, in the San Francisco Office.