Team Lead, SRE
Veeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep their businesses running. Join us as we move forward together, growing, learning, and making a real impact for some of the world’s biggest brands. The future of data resilience is here - go fearlessly forward with us.
About the Role
Veeam is expanding its Site Reliability Engineering (SRE) organization to support Veaam services. As an SRE Team Leader, you will build and lead a high-performing team that partners with product, platform, and security engineering to make our systems reliable, scalable, and observable from the ground up. You’ll collaborate with peer engineering leaders to embed reliability into service roadmaps.
You’ll drive adoption of SRE principles (SLIs/SLOs/error budgets) and operate a healthy, daytime follow-the-sun on-call model in partnership with other regions. You will lead your team to make improvements in the overall operability, reliability, resilience, and security of the services we support.
What You’ll Do
People & Team Leadership
- Hire, onboard, and develop your SRE team
- Encourage culture that prioritizes learning and engineering over fault-finding and firefighting
- Ensure a sustainable operational coverage; monitor on-call health and workload
Reliability Strategy & Governance
- Establish and operationalize SLIs/SLOs and error budgets with service owners
- Run reliability reviews and hold teams accountable to outcomes
- Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting)
Operations & Incident Excellence
- Ensure incident response readiness
- Lead and coordinate major incidents
- Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction
Engineering & Automation
- Lead software-first reliability investments: observability, resilience testing/chaos, and self-service guardrails
- Drive platform improvements and internal tools
What You’ll Bring
- 3+ years in managing Software, Platform, and/or Reliability Engineering
- Experience in IT Platform Engineering or Software Development
- Demonstrable experience leading engineering teams to predictably deliver outcomes
- Demonstrated success leading SLO/error-budget adoption and reliability programs for services
- Experience leading cross-functional initiatives collaboratively with peers through influence
- Experience with public clouds, Kubernetes, IaC, CI/CD, and observability
- Hands-on incident management and postmortem practice
- Readiness to participate in an on-call rotation (typically during daytime hours, including weekends/holidays)
Bonus Skills
- Experience operating a multi-region follow-the-sun on-call model
- Background in chaos/resilience/performance testing
- Experience in building or scaling SRE teams and influencing org-wide standards
- Coding background with experience improving service reliability
What You’ll Get
- 21 annual vacation days, additional days based on tenure, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
- Private health, dental, and vision insurance for employees and dependents, including outpatient care, hospitalization, pregnancy monitoring, and psychology support
- Monthly lifestyle and daily meal benefits: 40 RON/day via Edenred and 600 RON/month through a flexible cafeteria platform
- Life insurance (2× annual gross salary), critical illness, and disability coverage, plus vision reimbursement
- Free access to Bookster library platform for borrowing your favorite books for free
- Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops and learning events like our annual Global Day of Learning
Please note: If an applicant is permanently located outside of the Romania, Veeam reserves the right to decline the application for this position.
#LI-Remote
#LI-JS4
Please note that any personal data collected from you during the recruitment process will be processed in accordance with our Recruiting Privacy Notice.
The Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to us, will be processed by us in connection with our recruitment processes.
By applying for this position, you consent to the processing of your personal data in accordance with our Recruiting Privacy Notice.
By submitting your application, you acknowledge that the information provided in your job application and any supporting documents is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification of information may result in disqualification from consideration for employment or, if discovered after employment begins, termination of employment.
Create a Job Alert
Interested in building your career at Veeam Software? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field
