
Site Reliability Engineer
Who we are
Moniepoint is an all-in-one financial services platform for emerging markets and the second-fastest growing company in Africa.
Since 2019, Moniepoint’s technology has powered over 3 million people, offering personal and business banking, payment, credit and business management tools to help them succeed. Moniepoint processed $182 billion in 2023, and currently processes the majority of the POS transactions in Nigeria.
About the role
Engineering at Moniepoint is an inspired, customer-focused community, dedicated to crafting solutions that redefine our industry. Our infrastructure runs on some of the cool tools that excite infrastructure engineers - kubernetes, docker etc. We also make business decisions based on the large stream of data we receive daily, so we work daily with big data, perform data analytics and build models to make sense of the noise and give our customers the best experience.
We are looking for a Site Reliability Engineer to provide enterprise-level assistance to our production applications and services. You will be responsible for the stability, integrity, and operation of our production applications by supporting, monitoring and driving optimizations while also providing root cause analysis with recommendations for improvements. You will research, diagnose, troubleshoot, and resolve customer issues in an accurate and timely manner.
Curious about what makes Moniepoint an incredible place to work? Check out posts on how we cultivate a culture of innovation, teamwork, and growth.
Job Summary
Responsible for ensuring our systems run smoothly and efficiently while engineering solutions to improve visibility, eliminate repetitive tasks, and increase system resilience. The ideal candidate will balance real-time on-call responsibilities with strategic engineering work to achieve sustainable and scalable service reliability.
What you’ll get to do
- Participate in on-call rotations as the primary technical lead for detecting, triaging, and resolving service degradation, outages, or reliability issues across all environments.
- Act as the Incident Commander during major incidents: initiating war room or bridge calls, coordinating cross-functional teams, providing timely and clear status updates to all stakeholders and leading and documenting blameless Root Cause Analyses (RCAs) to identify the root causes of issues and drive long-term fixes.
- Investigate and resolve customer complaints escalated beyond L1 and L2 support, particularly where performance, reliability, or complex system behavior is affected.
- Participate in feature development discussions to ensure services are built with observability from the ground up. Create and maintain dashboards to monitor application and infrastructure health and set up alerts for key performance metrics.
- Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in collaboration with Product and Engineering teams.
To succeed in this role, we think you should have
- Minimum 3 years of experience supporting enterprise applications in an SRE or similar role.
- Strong knowledge of cloud infrastructure, Kubernetes, and container orchestration tools.
- Experience with APM and observability platforms such as New Relic, Datadog, ELK, or Signoz.
- Proficient in setting up and maintaining monitoring dashboards using Grafana and Prometheus.
- Skilled in diagnosing issues using stack traces, log files, and APIs.
- Proficiency in SQL databases (e.g., MySQL) and hands-on experience in database administration.
What we can offer you
- Culture - We put our people first and prioritize the well-being of every team member. We’ve built a company where all opinions carry weight and where all voices are heard. We value and respect each other and always look out for one another. Above all, we are human.
- Learning - We have a learning and development-focused environment with an emphasis on knowledge sharing, training, and regular internal technical talks.
- Compensation - You’ll receive an attractive salary, pension, health insurance, annual bonus, plus other benefits.
What to expect in the hiring process
- A preliminary phone call with the recruiter
- A technical interview with the Hiring Manager
- A behavioural and technical interview with a member of the Executive team.
Moniepoint is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees and candidates.
Apply for this job
*
indicates a required field