Backend Tech Lead (AI & Distributed Systems)
The Role
Lead a small, high-impact backend group while staying hands-on (~50%). Set the architecture, raise the bar on reliability and speed, and partner with ML/Product to ship AI features at scale.
What you’ll do
Own architecture for high-scale, user-facing backends and data pipelines.
Lead and mentor engineers; guide design reviews, roadmaps, and execution.
Operate production systems with clear SLOs/SLAs, on-call quality, and blameless postmortems.
Evolve our K8s/GPU platform, CI/CD, and observability to increase deployment velocity and resilience.
Integrate and serve ML models (PyTorch, CUDA/TensorRT, Triton) with a focus on throughput and cost.
Partner cross-functionally (ML, Product, Security, Finance) on priorities and cost/perf trade-offs.
Hire, level up the team, and uplevel engineering standards.
What you bring
7+ years backend engineering, including 2+ years tech-leading or managing.
Deep experience in Node.js or Python (strong in the other).
Distributed systems/Kubernetes/cloud expertise; strong operational rigor.
Track record of scaling systems, improving reliability, and shipping fast.
Excellent communication, prioritization, and mentoring skills.
Nice to have
Large K8s clusters (hundreds of nodes), GPU scheduling, NVIDIA stack (CUDA, Triton).