Staff/Principal Site Reliability Engineer
We are seeking an exceptional Staff/Principal Site Reliability Engineer to lead critical infrastructure initiatives and drive Innovation across our organization. You'll architect scalable solutions, navigate complex technical challenges independently, and deliver results under tight deadlines in a fast paced environment. You will work cross‑functionally alongside builders who have helped shape the success of companies such all ways as Google, Okta, AWS, and Snowflake.
Strategic Leadership & Technical Execution
- Lead enterprise‑wide reliability and infrastructure projects across multiple teams with high autonomy
- Navigate ambiguous problem spaces and deliver innovative solutions under tight deadlines
- Architect and deploy solutions for Cloud Prem and SaaS customers at scale
- Drive technical innovation and establish SRE best practices across the organization
- Respond to critical incidents, lead root cause analysis, and implement long‑term resolutions
- Develop automation solutions to streamline operations and reduce manual workload
- Participate in on‑call rotation and ensure effective incident handoff and documentation
Cross‑Functional Collaboration & Communication
- Partner with Engineering, Product, and Customer Success teams to align reliability goals with business objectives
- Communicate complex technical concepts effectively to technical and non‑technical audiences, including executives
- Influence technical decisions across teams through thought leadership and demonstrated expertise
- Build consensus and Drive adoption of new tools, processes, and architectural patterns
Customer‑Facing Technical Leadership
- Provide tier 2/3 technical support to enterprise customers for complex troubleshooting
- Work directly with customer technical teams to resolve deployment, configuration, and integration challenges
- Conduct technical onboarding and provide expert guidance on platform architecture and best practices
- Create customer‑facing documentation, troubleshooting guides, and run‑books
- Lead customer calls and technical discussions as a trusted advisor
Team Development
- Mentor SRE and engineering team members, elevating technical capabilities
- Foster a culture of reliability, operational excellence, and continuous improvement
You have:
Required Experience
- BS degree in Computer Science or related field (or equivalent practical experience)
- 7+ years in Site Reliability Engineering, DevOps, or Infrastructure Engineering
- Proven track record leading large‑scale, cross‑team infrastructure projects from conception to production
- Demonstrated ability to work autonomously on ambiguous projects with tight deadlines
Technical Expertise
- 5+ years with AWS (VPC, EC2, RDS, EKS, CloudFormation) and cloud automation
- Expert‑level experience with Kubernetes, Helm, Linux, and Terraform
- Strong experience with GitOps model, distributed version control, and CI/CD pipelines
- Proficiency with monitoring tools (Prometheus, Grafana, DataDog)
- Strong programming/scripting skills (Python, Go, Bash) for automation
- Deep understanding of distributed systems, microservices, and reliability patterns
- Experience with Bazel and CueLang a plus
Leadership & Communication
- Exceptional ability to articulate complex technical concepts to diverse audiences
- Track record of Driving technical change across organizational boundaries
- Successfully Delivered multiple complex projects under tight deadlines
- Strong customer service orientation with patience and empathy
Work Style
- Thrives in ambiguous environments and makes progress without perfect information
- Hands‑on, "can do" attitude with bias for action
- Low ego and high intellectual curiosity
- Comfortable working across time zones
- Self‑motivated with strong ownership mentality
Compensation Disclosure
$184,000—$240,000 USD
Compensation depends on skills, qualifications, experience, and work location. Variable compensation such as commission is not included.
Our Culture
- Ownership Mindset
- Act with Integrity
- Guardians of our Customers
- Opinionated Humility
- Build Trust, Earn Trust
Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email ...@veza.com.