Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
Crusoe's mission is to accelerate the abundance of energy and intelligence. We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.
At Crusoe Energy Systems, our Site Reliability Engineering (SRE) team plays a mission‑critical role in maintaining the performance and reliability of our AI‑optimized cloud infrastructure. The Storage‑focused SRE role is responsible for ensuring the availability, performance, and scalability of Crusoe's cloud storage products and services, which power compute‑intensive, latency‑sensitive workloads for AI and HPC use cases. This role directly supports our vertically integrated, sustainable cloud platform by building and optimizing distributed, fault‑tolerant storage systems at scale.
In this role, you will build automation and self‑healing tools to monitor and maintain Crusoe's distributed cloud storage infrastructure, which includes block, file, and object storage systems. You will drive reliability initiatives focused on data replication, encryption, backup and restore strategies, and robust failover mechanisms. Collaborating closely with storage engineers, you will help implement and maintain high‑performance NVMe‑ and SSD‑backed volumes that support large‑scale AI compute clusters. Your responsibilities will also include supporting user‑facing storage services with a focus on availability, performance tuning, and adherence to error budgets. You'll investigate and resolve storage‑related incidents using deep telemetry, logs, and performance profiling, while also partnering with hardware and kernel teams to diagnose low‑level I/O issues and optimize I/O paths, cache policies, and file systems. Additionally, you will contribute to the architecture of fault‑tolerant, scalable storage backends tailored for AI‑first cloud environments.
Compensation will be paid in the range of $166,000 - $201,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to genetic information, citizenship, marital status, veteran status, or any other status protected by law or regulation.