Senior Site Reliability / Gitops Engineer
Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical.
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include top public cloud and silicon providers, and industry leaders across sectors. We have a global, distributed team of over 1200 colleagues in 75+ countries, with few office-based roles. Teams meet a few times annually in interesting locations worldwide to align on strategy and execution.
The company is founder-led, profitable, and growing. We are hiring a Senior Site Reliability / Gitops Engineer for our Information Systems (IS) team. This role is ideal for an automation-focused senior technologist with a passion for Linux, eager to build a career at Canonical and support those leveraging Ubuntu and open source products.
Job Summary
The IS team maintains all of Canonical's IT production services, supporting over 60 million Ubuntu users. As a Senior SRE & Gitops Engineer, you'll drive operations automation in our private and public clouds using open source infrastructure as code, CI/CD pipelines, and Canonical's automation products. You'll also provide feedback to developers on product operation at scale, collaborate across teams, and support our global SRE team to deliver excellent services.
Responsibilities
- Lead automation and Gitops initiatives as an embedded tech lead.
- Collaborate with IS architect to align solutions with architecture vision.
- Design and develop services as products for IS.
- Enhance infrastructure as code practices, increasing automation and process improvements.
- Automate software operations for reusability and consistency across cloud environments.
- Maintain operational responsibility for core services, networks, and infrastructure.
- Develop troubleshooting, capacity planning, and performance monitoring skills using tools like Prometheus, Grafana, and Elasticsearch.
- Support and work with globally distributed teams.
- Focus on larger automation projects during dedicated development time.
- Share expertise through design sessions, mentorship, and collaborative work.
- Handle time-critical escalations responsibly.
Qualifications
- Modern understanding of hosting architecture with infrastructure as code across clouds.
- Product mindset focused on developing products.
- Experience with Python and large projects.
- Knowledge of Kubernetes or similar container orchestration systems.
- Proven ability to manage and deploy cloud infrastructure via code.
- Practical Linux networking, routing, and firewall skills.
- Experience with Linux storage solutions like Ceph and databases.
- Hands-on Linux server administration.
- Deep understanding of cloud computing concepts.
- Bachelor's degree or higher in computer science or related field.
- Effective communication skills in English.
- Strong troubleshooting skills, kernel to web.
- Adaptability and quick learning.
- Passion for open-source, especially Ubuntu or Debian.
Benefits
We offer competitive, performance-based compensation, annual reviews, and benefits such as:
- Distributed work environment with biannual in-person team sprints.
- USD 2,000 annual learning budget.
- Recognition rewards, holiday leave, maternity/paternity leave.
- Wellness programs, travel opportunities, and more.
About Canonical
Canonical leads in open source innovation, publishing Ubuntu, a key platform for AI, IoT, and cloud. We hire globally, value excellence, and support remote work since 2004. We foster diversity and a discrimination-free workplace.
Additional Details
Level: Mid-Senior, Full-time, Engineering/IT, Software Development.
#J-18808-Ljbffr