Back to jobs

Site Reliability Engineer

Cyprus

Your role at Exness:

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large distributed software applications
  • Build software and systems to manage platform infrastructure and applications

You will:

  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Facilitate incidents, run blameless postmortems and complete root cause analysis investigations 
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives
  • Participate in system design consulting, platform management, and capacity planning
  • Contribute to hadnbooks/runbooks, general documentation
  • Knowledge sharing, mentoring, providing training material and workshops

What makes you a great fit:

  • Minimum 5+ years of strong hands on Linux and Windows experience
  • Experience programming in Golang, Python, C++ or Java at least 3+ years
  • Understanding of Linux and TCP/IP network fundamentals
  • Experience running services such as load balancers, relational databases, messaging systems and orchestration systems
  • Demonstrated expertise with Kubernetes and Docker in a hybrid environment
  • Experience with Gitlab CI/CD, Terraform (Iac)
  • Experience analyzing and troubleshooting systems
  • Ability to quickly learn new technologies, frameworks, and architectures
  • Bonus points:

    • Experience working with globally distributed systems or infrastructure
    • Experience with Amazon AWS, Alibaba or Google clouds
    • Experience with Nginx, HaProxy, Envoy, Traefik
    • Experience with Istio, Consul Mesh
    • Experience with Postgres, Clickhouse, MongoDB, Elasticsearch
    • Experience with Etcd, Zookeeper, Consul
    • Experience with Redis, Kafka, RabbitMQ, Nats
    • Experience with Graylog, Loki, Prometheus, Thanos, Grafana, Zabbix
    • Experience with Jaeger, Sentry, DataDog 
    • Experience with WAF, CDN
    • Experience with Rancher, Rancher2
    • Fluency in Golang and/or Python

What we offer along the way:

  • Competitive and attractive compensation
  • Extensive learning opportunities, such as professional training and certifications, soft skills development, free English courses, and trading workshops
  • Flight tickets, hotel or apartment accommodation for your first month, migration support, and legal help for you and your family (if relocating)
  • Health and life insurance for employees, spouses, and children, including vaccinations, tests, mental health care, and coverage for vision and dental care
  • Generous time off, including 21 days of annual leave and paid sick leave
  • Education allowance for your children’s school and kindergarten fees
  • Access to our very own sports club with dedicated coaches, free Sanctum Club memberships for you and your spouse, corporate SUPs, jet skis, etc
  • A branded company car with a parking space near the office
  • Outstanding team-building experiences and Exness community gatherings

Your journey after applying:

  1. Intro call with your Recruiter (30-40 minutes)
  2. Short online English test (for non-native speakers)
  3. Technical interview (1 hour)
  4. Final interview (1 hour)

 

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...