
Back to jobs
Site Reliability Engineer
Cyprus
Your role at Exness:
- Run the production environment by monitoring availability and taking a holistic view of system health
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
- Provide primary operational support and engineering for multiple large distributed software applications
- Build software and systems to manage platform infrastructure and applications
You will:
- Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
- Partner with development teams to improve services through rigorous testing and release procedures
- Facilitate incidents, run blameless postmortems and complete root cause analysis investigations
- Create sustainable systems and services through automation and uplifts
- Balance feature development speed and reliability with well-defined service level objectives
- Participate in system design consulting, platform management, and capacity planning
- Contribute to hadnbooks/runbooks, general documentation
- Knowledge sharing, mentoring, providing training material and workshops
What makes you a great fit:
- Minimum 5+ years of strong hands on Linux and Windows experience
- Experience programming in Golang, Python, C++ or Java at least 3+ years
- Understanding of Linux and TCP/IP network fundamentals
- Experience running services such as load balancers, relational databases, messaging systems and orchestration systems
- Demonstrated expertise with Kubernetes and Docker in a hybrid environment
- Experience with Gitlab CI/CD, Terraform (Iac)
- Experience analyzing and troubleshooting systems
- Ability to quickly learn new technologies, frameworks, and architectures
-
Bonus points:
- Experience working with globally distributed systems or infrastructure
- Experience with Amazon AWS, Alibaba or Google clouds
- Experience with Nginx, HaProxy, Envoy, Traefik
- Experience with Istio, Consul Mesh
- Experience with Postgres, Clickhouse, MongoDB, Elasticsearch
- Experience with Etcd, Zookeeper, Consul
- Experience with Redis, Kafka, RabbitMQ, Nats
- Experience with Graylog, Loki, Prometheus, Thanos, Grafana, Zabbix
- Experience with Jaeger, Sentry, DataDog
- Experience with WAF, CDN
- Experience with Rancher, Rancher2
- Fluency in Golang and/or Python
What we offer along the way:
- Competitive and attractive compensation
- Extensive learning opportunities, such as professional training and certifications, soft skills development, free English courses, and trading workshops
- Flight tickets, hotel or apartment accommodation for your first month, migration support, and legal help for you and your family (if relocating)
- Health and life insurance for employees, spouses, and children, including vaccinations, tests, mental health care, and coverage for vision and dental care
- Generous time off, including 21 days of annual leave and paid sick leave
- Education allowance for your children’s school and kindergarten fees
- Access to our very own sports club with dedicated coaches, free Sanctum Club memberships for you and your spouse, corporate SUPs, jet skis, etc
- A branded company car with a parking space near the office
- Outstanding team-building experiences and Exness community gatherings
Your journey after applying:
- Intro call with your Recruiter (30-40 minutes)
- Short online English test (for non-native speakers)
- Technical interview (1 hour)
- Final interview (1 hour)
Apply for this job
*
indicates a required field