Principal Product Engineer
About Nscale
Nscale is taking on the hyperscalers by building a vertically integrated GenAI cloud platform that spans from sustainable data centres to advanced AI infrastructure and enterprise applications. We’re shaping the next generation of AI-native computing - secure, efficient, and transparent.
Our culture is built on relentless innovation, accountability, and excellence. As a Nscaler, you’ll join a team that values open collaboration, speed, and respect. We encourage bold thinking and trust every individual to take ownership and deliver impact - together.
About the Role
- Design and implement foundational platform capabilities across APIs, services, workflows, and data/control planes.
- Drive architecture for scalability, reliability, security, and cost efficiency across distributed systems.
- Turn ambiguous problems into crisp designs, aligned execution plans, and high-quality shipped outcomes.
- Raise the engineering bar through design reviews, testing strategy, observability, and post-incident learning.
- Embed security, IAM, privacy, and governance into system design and delivery by default.
- Partner with squads across the firm to align on interfaces, ownership, and operational readiness.
- Improve delivery velocity with platform leverage, automation, and tooling—using AI responsibly to accelerate delivery and scale operations.
- 15+ years building and shipping production platforms at scale.
- Strong distributed systems and cloud-native experience (Kubernetes, CI/CD, reliability patterns).
- Proficiency in Python, Go, and/or Rust; strong fundamentals in code quality, testing, and performance.
- Deep experience with operational excellence: observability, incident response, and continuous improvement.
- Strong security fundamentals (IAM, data protection, governance) in production environments.
- Excellent collaborator and communicator across engineers, PMs, and executives.
- Ability to leverage AI to build, evolve, and maintain large-scale systems.
- Experience building developer platforms (internal tooling, control planes, APIs/SDKs/CLIs).
- Experience with SLOs/SLIs, capacity planning, and cost optimisation in high-availability services.
- Familiarity with Prometheus/Grafana/OpenTelemetry and modern reliability practices.
- Exposure to GenAI/LLM systems is a plus.
At Nscale, we are committed to fostering an inclusive, diverse, and equitable workplace. We believe that a variety of perspectives enriches our work environment, and we encourage applications from candidates of all backgrounds, experiences, and abilities. We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds.
If there’s anything we can do to accommodate your specific situation, please let us know.
The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.
For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here.
Apply for this job
*
indicates a required field