Zyphra is an artificial intelligence company based in San Francisco, California.
The Role:
As a Research Scientist, Model Architectures, you will be a core contributor to Zyphra's AI Architecture Research Team. This will involve designing and rigorously testing novel model architectures and training methodologies, with a focus on improving core modeling capabilities (e.g., loss per flop or loss per parameter) and addressing fundamental bottlenecks in contemporary models. You will also work extremely closely with our pre-training team, who will integrate your insights into our next-generation models.
What you need: