Back to jobs

GPU Software Engineer - Distributed ML Training

The world will be unrecognisable in 5 years.

Machine learning models are driving our cars, testing our eyesight, detecting our cancer, giving sight to the blind, giving speech to the mute, and dictating what we consume, enjoy, and think. These AI systems are already an integral part of our lives and will shape our future as a species.

Soon, we'll conjure unlimited content: from never-ending TV series (where we’re the main character) to personalised tutors that are infinitely patient and leave no student behind. We’ll augment our memories with foundation models—individually tailored to us through RLHF and connected directly to our thoughts via Brain-Machine Interfaces—blurring the lines between organic and machine intelligence and ushering in the next generation of human development.

This future demands immense, globally accessible, uncensorable, computational power. Gensyn is the machine learning compute protocol that translates machine learning compute into an always-on commodity resource—outside of centralised control and as ubiquitous as electricity—accelerating AI progress and ensuring that this revolutionary technology is accessible to all of humanity through a free market.

 

Our Principles:

AUTONOMY

  • Don’t ask for permission - we have a constraint culture, not a permission culture.
  • Claim ownership of any work stream and set its goals/deadlines, rather than waiting to be assigned work or relying on job specs.
  • Push & pull context on your work rather than waiting for information from others and assuming people know what you’re doing.
  • No middle managers - we don’t (and will likely never) have middle managers.

FOCUS

  • Small team - misalignment and politics scale super-linearly with team size. Small protocol teams rival much larger traditional teams.
  • Thin protocol - build and design thinly.
  • Reject waste - guard the company’s time, rather than wasting it in meetings without clear purpose/focus, or bikeshedding.

REJECT MEDIOCRITY

  • Give direct feedback to everyone immediately rather than avoiding unpopularity, expecting things to improve naturally, or trading short-term pain for extreme long-term pain.
  • Embrace an extreme learning rate rather than assuming limits to your ability/knowledge.
  • No quit - push to the final outcome, despite any barriers.

Responsibilities:

  • Develop performant GPU kernels and compute infrastructure - from the framework level (e.g. PyTorch) down to IR representations for training, with a strong emphasis on reproducibility in multi-GPU distributed training environments.
  • Design novel algorithms - with a focus on numerical properties and stable compute flows, optimized for modern cryptographic systems.

Minimum Requirements:

  • Strong software engineering skills - with substantial experience as a practicing software engineer and significant contributions to shipping production-level code.
  • Hands on experience in distributed compute environments:
    • Writing GPU Kernels (e.g. CUDA, PTX, MPX/MLX, IR); and/or
    • Implementing low-level GPU-specific optimizations for performance and numerical stability.
  • In-depth understanding of deep learning - including recent architectural trends, training fundamentals, and practical experience with machine learning frameworks and their internal mechanics (e.g., PyTorch, TensorFlow, scikit-learn).

Nice to haves:

  • Open-source contributions to high-performance GPU codebases.
  • Strong understanding of computer architecture - with expertise in specialized architectures for training neural networks, including Intel Xeon CPUs, GPUs, TPUs, and custom accelerators, as well as heterogeneous systems combining these components.
  • Solid foundation in compiler technology - with a working knowledge of traditional compilers (e.g., LLVM, GCC) and graph traversal algorithms.
  • Experience with deep learning compiler frameworks - such as TVM, MLIR, TensorComprehensions, Triton, and JAX.

Compensation / Benefits:

  • Competitive salary + share of equity and token pool
  • Fully remote work - we hire between the West Coast (PT) and Central Europe (CET) time zones
  • Relocation Assistanceavailable for those that would like to relocate after being hired (anywhere from PST through CET time zones)
  • 4x all expenses paid company retreats around the world, per year
  • Whatever equipment you need
  • Paid sick leave
  • Private health, vision, and dental insurance - including spouse/dependents [🇺🇸 only]

Apply for this job

*

indicates a required field

Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf