Back to jobs
Network and Systems Operations Engineer (Family Networking)
Romania
Company Background
Our client is dedicated to helping families stay safe, secure, and connected. Trusted by millions across 140 countries, the platform provides tools for real-time location sharing, driving safety features, and notifications, empowering families to navigate life’s unpredictability with confidence.
Project Description
The Network and Systems Operations (NSO) team is part of Cloud Operations and focuses on two core missions:
- Providing world-class observability infrastructure and tools for engineers.
- Delivering L1 service support and driving incident management to ensure system reliability.
Technologies
- Prometheus
- Grafana
- Datadog
- Java
- Python
- Shell
- Ruby
- Docker
- Kubernetes
- AWS
- Terraform
- CloudFormation
- Chef
- Ansible
What You'll Do
- Monitor environments using Prometheus, Grafana, and Datadog to identify and resolve issues;
- Respond to alerts in PagerDuty, drive incidents to resolution, and document improvements from post-mortem action items;
- Serve as part of the L1 support team, resolving or escalating issues in a timely manner using runbooks;
- Contribute to operational excellence by improving processes, documentation, and tooling;
- Analyze and optimize high-traffic internet applications to ensure scalability and reliability;
- Collaborate with cross-functional teams to enhance system performance and observability;
Job Requirements
- 5+ years of experience writing, reading, and debugging code in languages such as Java, Python, Shell, or Ruby;
- 5+ years of experience managing large-scale distributed systems and Linux-based systems in cloud environments such as AWS;
- Deep expertise with large-scale observability systems like Prometheus, Datadog, or similar;
- 3+ years of experience with solutions like Docker, Kubernetes, and system virtualization;
- Proficiency in Infrastructure as Code (IaC) and configuration management tools like Terraform, CloudFormation, Chef, or Ansible;
- Strong analytical, troubleshooting, and problem-solving skills;
- Ability to quickly learn new technologies and adapt to industry trends;
- Attention to detail and ability to optimize high-traffic applications;
- English proficiency from B1 for effective communication;
What Do We Offer
The global benefits package includes:
- Technical and non-technical training for professional and personal growth;
- Internal conferences and meetups to learn from industry experts;
- Support and mentorship from an experienced employee to help you professional grow and development;
- Internal startup incubator;
- Health insurance;
- English courses;
- Sports activities to promote a healthy lifestyle;
- Flexible work options, including remote and hybrid opportunities;
- Referral program for bringing in new talent;
- Work anniversary program and additional vacation days.
Apply for this job
*
indicates a required field