Overview
AirGarage is seeking a Software Engineer to own the reliability, health, and observability of our nationwide IoT device fleet. You will work with embedded systems, backend infrastructure, and site reliability engineering. You'll design and build the tools, monitoring pipelines, and automation that keep hundreds of devices online and performing reliably across our locations.
What You Will Do
- Design and implement systems to monitor, diagnose, and improve IoT device health at scale.
- Build internal tools and scripts for device setup, fleet observability, QA automation, and ongoing monitoring.
- Contribute to backend services that support device integration, calibration, and reliability improvements.
- Investigate and resolve fleet-wide issues by analyzing metrics, logs, and telemetry; minimize downtime through remote debugging and fixes.
- Test and tune hardware products during or post-installation (e.g., camera exposure, detection modes, connectivity parameters) to ensure optimal performance.
- Conduct periodic fleet-wide health assessments to detect degradation, systemic issues, or underperforming devices, and recommend firmware or deployment improvements.
- Serve as the primary internal contact for hardware health, providing regular reports to operations on per-site hardware performance, device uptime, and systemic issues affecting service quality.
- Collaborate with operations and hardware teams to surface recurring pain points and propose architectural or process improvements that drive greater reliability and scalability.
- Author and maintain troubleshooting guides, repair instructions, and internal playbooks that enable consistency and efficiency across deployments.
- Travel occasionally (~20%, otherwise fully remote) for QA, deployments, and on-site debugging when remote fixes aren't possible.
What You Need
- 5+ years of professional software engineering experience.
- Experience managing distributed Linux-based hardware appliances or IoT fleets.
- Familiarity with observability and monitoring tools (e.g., DataDog, OpenTelemetry, Prometheus, Grafana) and building internal tooling for device health and alerting.
- Strong proficiency in Python and SQL, with experience shipping production-quality code. C++ background is a plus.
- Track record building internal tooling, monitoring, or reliability platforms.
- Hands-on experience with Linux systems (dmesg, journalctl, ip, systemd, etc.) and debugging distributed hardware/software environments.
- Background in cellular (4G LTE, CAT 4, CAT 1bis, 5G RedCap), WiFi, WiFi HaLow, or other wireless connectivity.
- Excellent written and verbal communication skills; able to translate complex technical findings into clear reports and playbooks.
- Self-starter who thrives in a fast-paced, ownership-driven environment.
- Willingness to travel to locations for troubleshooting (roughly 20% travel, otherwise fully remote).
The Upside
- Equity: Have a stake in the business that you're helping to build and grow.
- Work remotely: Live and work wherever you like. We currently hire teammates located anywhere within North America.
- Health insurance: We offer health insurance and currently cover 85% of the cost for the primary employee and 50% for dependents.
- Home office setup: Laptop and equipment provided to set you up for success.
- Time to recharge: Unlimited PTO with a minimum requirement of 10 days per year.
- 401k: 401k retirement savings program.
- Team off-sites: ~2 times per year for a full-week gathering in places like Tahoe, Puerto Vallarta, San Diego, and Austin.
- Room to grow: Opportunities to grow with a rapidly expanding team.
- Transform our cities: Help change how real estate is used in our cities.
- Work with a diverse team: Our team is ~40% female and 30%+ from underrepresented communities.
AirGarage is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Candidates and employees are evaluated based on merit, qualifications, and performance. We will never discriminate on the basis of race, color, gender, national origin, ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, or any other legally protected status.
Compensation Range: $160K - $190K
Job Details
- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job function: Engineering and Information Technology
- Industries: Technology, Information and Internet