The role:
Lead a small, high-impact backend group while staying hands-on (~50%). Set the architecture, raise the bar on reliability and speed, and partner with ML/Product to ship AI features at scale.
Location: Hybrid (1 day/week on-site, Beer Sheva)
What you"ll do:
- Own architecture for high-scale, user-facing backends and data pipelines.
- Lead and mentor engineers; guide design reviews, roadmaps, and execution.
- Operate production systems with clear SLOs/SLAs, on-call quality, and blameless postmortems.
- Evolve our K8s/GPU platform, CI/CD, and observability to increase deployment velocity and resilience.
- Integrate and serve ML models (PyTorch, CUDA/TensorRT, Triton) with a focus on throughput and cost.
- Partner cross-functionally (ML, Product, Security, Finance) on priorities and cost/perf trade-offs.
- Hire, level up the team, and uplevel engineering standards.
What you bring:
- 7+ years backend engineering, including 2+ years tech-leading or managing.
- Deep experience in Node.js or Python (strong in the other).
- Distributed systems/Kubernetes/cloud expertise; strong operational rigor.
- Track record of scaling systems, improving reliability, and shipping fast.
- Excellent communication, prioritization, and mentoring skills.
Nice to have:
Large K8s clusters (hundreds of nodes), GPU scheduling, NVIDIA stack (CUDA, Triton).