About PlayerZero
PlayerZero is building a self‑healing system for software—automating defect detection, diagnosis, and remediation so developers ship with confidence. Teams use PlayerZero to spot issues before customers do, pinpoint root causes fast, and close the loop from incident to fix.
Our platform includes capabilities like Agentic Debugging and Code Simulations that let engineers reproduce complex scenarios, reason about failures, and validate fixes safely and quickly.
The role
You will lead the design and development of distributed services that power PlayerZero's agentic workflows and simulations—owning systems end‑to‑end from architecture to operations. You'll work across the stack (frontend when needed), but your center of gravity is backend, reliability, and scale. As a senior/lead, you set technical direction, raise the bar for code and design quality, and mentor others.
What you’ll do
- Own and evolve core microservices: design APIs, data models, and event flows for high‑throughput, low‑latency workloads.
- Build resilient, observable systems: implement SLOs, metrics, tracing, and graceful degradation; drive incident response and postmortems.
- Design asynchronous and event‑driven architectures (queues/streams, sagas, idempotency, backpressure, retries, DLQs).
- Ship platform capabilities that support Agentic Debugging and Code Simulations, including safe execution sandboxes and workflow orchestration.
- Collaborate with product/design to scope problems, write technical specs, and deliver iterative outcomes.
- Improve developer experience: CI/CD, test strategy, feature flags, trunk‑based development, and automation.
- Provide technical leadership: run design reviews, mentor engineers, and influence cross‑team architecture.
- Contribute to the web UI (React/TypeScript) as needed to deliver end‑to‑end features.
What you’ll bring
- 7+ years building production software, with 3+ years designing and operating distributed, microservices‑based systems.
- Deep expertise in at least one backend language (Go, Java/Kotlin, Rust, Python, or TypeScript/Node) and willingness to learn others.
- Strong foundation in systems design: consistency models, CAP tradeoffs, transactions/sagas, caching, partitions, circuit breaking, rate limiting.
- Proficiency with cloud infrastructure (AWS/GCP), containers/Kubernetes, IaC (Terraform), and secure networking.