Job Details

Senior Infrastructure Engineer

  2025-10-25     Vast.ai     San Francisco,CA  
Description:

About Us

Vast.ai's cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity.

Job Title

Staff Infrastructure Engineer

Location

On-site at our office in San Francisco or Westwood, Los Angeles.

About the Role

As a Senior Infrastructure Engineer, you will help design and scale the core systems that play the role of Vast.ai's global GPU marketplace. You'll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We're looking for someone who has previously built large‑scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks.

Tech Stack

  • Python, C++, PostgreSQL, Linux, Docker, KVM, Redis, Terraform, AWS, REST/gRPC APIs

Ideal Experience

  • Distributed Systems: Experience building high‑throughput backend systems or compute clouds
  • Compute Orchestration: Familiarity with Docker, or custom scheduling frameworks
  • GPU Infrastructure: Understanding of GPU provisioning, driver management, and workload scheduling
  • Billing & Metering: Implemented or integrated usage‑based billing and account credit systems
  • Marketplace Dynamics: Knowledge of dynamic pricing, spot instances, or supply‑demand balancing mechanisms
  • Security & Multi‑Tenancy: Experience designing secure, multi‑tenant systems in cloud environments
  • Programming: Strong programming skills in Python and C++; ability to write performant, maintainable, well‑architected code
  • Database Expertise: Comfortable designing schemas and queries for large‑scale data systems (PostgreSQL preferred)

Bonus Points

  • Experience with GPU security, virtualization, or zero‑trust compute isolation
  • Prior startup experience or end‑to‑end product ownership

Key Responsibilities

  • Improve the backend systems that power Vast.ai's compute marketplace
  • Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs
  • Develop scalable infrastructure for workload scheduling and resource management
  • Optimize pricing and marketplace logic for efficiency and transparency
  • Benchmark, profile, and harden systems for performance, reliability, and fault tolerance
  • Collaborate with product and infrastructure teams to shape the future of decentralized compute

Interview Process (≈ 1 week)

  • 15 min – Initial screening with member of your future team (virtual)
  • 40 min – Systems and architectures (virtual)
  • 1 hour – LLM‑assisted coding assessment (virtual)
  • 2 hours – Meet and greet with coding assessment (on‑site)

Annual Salary Range

$180,000 – $300,000 + equity + benefits

Benefits

  • Comprehensive health, dental, vision, and life insurance
  • 401(k) with company match
  • Meaningful early‑stage equity
  • Onsite meals, snacks, and close collaboration with founders and tech leads
  • Ambitious, fast‑paced startup culture where initiative is rewarded
#J-18808-Ljbffr


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search