
Back to jobs
Site Reliability Engineer (L2)
Cyprus
Your role at Exness:
You will join our Reliability team and focus on delivering insights from massive data in real-time. You will help us provide our users with a rich functionality, high availability, and superior performance to help them achieve their missions. To do this, you will need extensive experience and a willingness to bring fresh ideas, demonstrate a unique and informed point of view, and enjoy working with a cross-functional team to develop real-world solutions and create a positive user experience at every interaction.
You will:
- Analyse metrics, logs from operating systems, applications and other infrastructure layers to assist in performance tuning and fault-finding
- Provide correlations between layers, highlight hidden parts, improve observability, and reliability of our systems
- Escalate issues to product/platform teams, initiate and facilitate firefighting processes, provide issue summary
- Run blameless postmortems and complete root cause analysis investigations
- Contribute to handbooks/run-books, general documentation
- Contribute to our tools and automation solutions
- Participate in knowledge sharing, mentoring, providing training material and workshops
What makes you a great fit:
- Advanced Linux administration experience
- Basic programming experience in Golang, Python, C++, or Java
- Solid understanding of TCP/IP network fundamentals, experience in troubleshooting of network issues
- Experience using Gitlab CI/CD or Terraform (Iac)
- Experience with Jaeger, Sentry, DataDog, NewRelic, Grafana, and Prometheus
- Experience with system analysis and troubleshooting
- Ability to quickly learn new technologies, frameworks, and architectures
- Strong listening skills and a high level of tolerance
- Fluency in English
- Experience working with globally distributed systems or infrastructure would be considered an advantage
- Experience with Kubernetes, Docker, Postgres, and Kafka would be considered an advantage
- Experience with Rancher, Rancher2, AWS EKS, Alicloud ACS, and Rancher RKE would be considered an advantage
What we offer along the way:
- Competitive and attractive compensation
- Extensive learning opportunities, such as professional training and certifications, soft skills development, free English courses, and trading workshops
- Health and life insurance for employees, spouses, and children, including vaccinations, tests, mental health care, and coverage for vision and dental care
- Allowance for sports club memberships or other physical exercise activities
- Reimbursement for a work laptop, home office equipment, and coworking memberships
- Generous time off, including 21 days of annual leave and paid sick leave
- Special ‘Get to know your team’ trips
Your journey after applying:
- Interview with a Talent Acquisition Specialist (30 minutes)
- Short online English test
- Technical interview (1 hour)
- Behavioral interview (1 hour)
Please use your exness work email for internal applications and ensure to disclose any existing Conflict of Interest you may have.
Apply for this job
*
indicates a required field
