Staff SRE Engineer
About ShipMonk
ShipMonk isn't just a 3PL; we're a growth partner for merchants. We provide cutting-edge technology and a network of owned and operated fulfillment centers that empower high-growth ecommerce and DTC brands to stress less and grow more. With over 2,500 employees across five countries, we're on a mission to revolutionize fulfillment by providing everything from the fastest click-to-delivery and real-time inventory to custom solutions—all with a merchant-first mindset.
Why ShipMonk?
We believe in building for the long term, and our success is powered by five key differentiators that help us become true partners to our merchants.
● Global Fulfillment Network: Our 12+ owned and operated fulfillment centers span the US, Canada, Mexico, the U.K., and Mainland Europe. We never outsource, ensuring quality and consistency.
● Proprietary Technology: We've eliminated the need for tribal knowledge with our AI-powered platform. It provides a real-time, unified view of inventory and orders, giving our merchants the control and visibility they need to succeed.
● Unrivaled Support: We provide hands-on, "mom and pop" support with a global reach. Our dedicated teams are on-site at every fulfillment center, ready to jump into action.
● Transparent Pricing: We believe in honest, long-term partnerships. Our all- inclusive pricing means predictable costs, with no hidden fees or surprises.
● Committed to the Future: We invest over $10 million annually in research and development to ensure our technology and services continually evolve, helping merchants plant roots with a partner who is here to stay.
Our values are the heart of our culture. We're looking for individuals who embody these principles every day.
● Merchant-first: We handle the logistics so our merchants can focus on what they do best—growing their business.
● People make ShipMonk: We believe in our team and invest in our people.
● Change the score: We challenge the status quo, constantly innovating and improving.
● Get sh*t done: We're a fast-paced, high-growth company that values action and results.
We are seeking an influential Staff SRE to help architect and drive the strategic evolution of our core cloud and deployment infrastructure, shifting our operations toward a more robust, self-service developer platform. This is a highly strategic, but hands-on role for an engineer ready to challenge inefficiencies and contribute to continuous improvement initiatives, from concept to production.
About us
The opportunity
You will be the key technical innovator defining our infrastructure's future state, specifically focused on scaling, optimizing, and enhancing our fully automated platform. While the current architecture is stable, you will be empowered to conduct deep analysis and implement strategic, iterative architectural changes to substantially improve developer velocity and system reliability.
This role is focused on strategic planning, persuasion, and execution to drive evolutionary improvements that result in a best-in-class developer experience, moving us forward one major step at a time.
Key responsibilities and scope
- Platform Architecture: Propose the design, implementation, and maintenance of core cloud and deployment systems, advocating for self-service patterns.
- Kubernetes and Cloud Orchestration: Take ownership of the scalability, security, and optimization of production Kubernetes clusters and the underlying AWS accounts management structure.
- CI/CD Strategy: Drive best practices across our CI/CD pipelines, optimizing performance and reliability of GitLab CI runners and standardizing deployment flows using ArgoCD.
- Infrastructure Core Services: Provide administrative expertise and reliability improvements for critical services, including RabbitMQ and the enterprise VPN.
- Observability Leadership: Improve the organization’s vision for monitoring, tracing, and logging, and manage the strategic use and optimization of Datadog management across all environments.
Skills and qualifications
-
6+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.
- Deep expertise in AWS multi-account environments (Networking, Security, IAM).
- Expert-level knowledge of Kubernetes administration, networking, and deployment strategies.
- Strong operational experience with messaging systems (e.g., RabbitMQ) and GitOps tools (e.g., Argo CD).
- Proficiency in modern CI/CD tooling, specifically GitLab CI/CD.
- Expertise in Infrastructure as Code (IaC), preferably Terraform.
- Demonstrated experience managing large-scale observability platforms like Datadog.
Ideal candidate
-
An Evolution Driver: Possesses a strong internal drive and the conviction to push for continuous, significant improvements and strategically refine the status quo of existing processes and infrastructure.
- Strategic Communicator: A great communicator who is skilled at listening to the needs of engineering teams, translating those needs into technical roadmaps, and then successfully persuading other engineers and management that their ideas are worth investing in.
- Platform-Focused: Experienced in building internal developer platforms (IDPs) and services, focusing on APIs and tooling that enable developers to deploy and manage their services reliably and independently.
- Technical innovation: Acts as a force multiplier by bringing fresh ideas, challenging conventions, and raising the technical bar across the entire organization.
ShipMonk is an equal opportunity employer. We value diversity and do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Create a Job Alert
Interested in building your career at ShipMonk? Get future opportunities sent straight to your email.
Apply for this job
*
indicates a required field