Logo
Visit Our Website

https://tap.pm/

Backend Tech Lead (AI & Distributed Systems)

The Role

Lead a small, high-impact backend group while staying hands-on (~50%). Set the architecture, raise the bar on reliability and speed, and partner with ML/Product to ship AI features at scale.


What you’ll do

  • Own architecture for high-scale, user-facing backends and data pipelines.

  • Lead and mentor engineers; guide design reviews, roadmaps, and execution.

  • Operate production systems with clear SLOs/SLAs, on-call quality, and blameless postmortems.

  • Evolve our K8s/GPU platform, CI/CD, and observability to increase deployment velocity and resilience.

  • Integrate and serve ML models (PyTorch, CUDA/TensorRT, Triton) with a focus on throughput and cost.

  • Partner cross-functionally (ML, Product, Security, Finance) on priorities and cost/perf trade-offs.

  • Hire, level up the team, and uplevel engineering standards.

What you bring

  • 7+ years backend engineering, including 2+ years tech-leading or managing.

  • Deep experience in Node.js or Python (strong in the other).

  • Distributed systems/Kubernetes/cloud expertise; strong operational rigor.

  • Track record of scaling systems, improving reliability, and shipping fast.

  • Excellent communication, prioritization, and mentoring skills.

Nice to have

  • Large K8s clusters (hundreds of nodes), GPU scheduling, NVIDIA stack (CUDA, Triton).