Back to jobs

Senior Simulation Data Engineer

London, United Kingdom

About us

PhysicsX is a deep-tech company with roots in numerical physics and Formula One, dedicated to accelerating hardware innovation at the speed of software.
We are building an AI-driven simulation software stack for engineering and manufacturing across advanced industries. By enabling high-fidelity, multi-physics simulation through AI inference across the entire engineering lifecycle, PhysicsX unlocks new levels of optimization and automation in design, manufacturing, and operations — empowering engineers to push the boundaries of possibility. Our customers include leading innovators in Aerospace & Defense, Materials, Energy, Semiconductors, and Automotive.

Note: We are currently recruiting for multiple positions, however please only apply for the role that best aligns with your skillset and career goals.

The Role

The Senior Simulation Data Engineer will extend and operate the infrastructure that powers our research Data Factory. You will be responsible for the end-to-end pipeline: from geometry preparation and simulation orchestration through validation, post-processing, and delivery to downstream ML training systems, using PhysicsX platform orchestration services where synergies exist.

This role sits at the intersection of HPC engineering and data engineering. You will orchestrate long-running CFD simulations at scale, build robust data pipelines, and ensure that every simulation we produce meets rigorous quality standards.

Team Context

In this role, you will be vertically embedded in Research , working daily with:

  • Research Scientists who define data requirements and quality standards
  • ML Engineers who consume Data Factory outputs for model training
  • ML Infrastructure Engineers who are accountable for downstream training infrastructure

You will have end-to-end responsibilities over the Data Factory, with the autonomy to make architectural decisions and the responsibility to keep data flowing reliably.

Horizontally, you will be part of an infrastructure engineering group responsible for infrastructure across the company.

What you will do

Simulation Orchestration

  • Extend and operate the Data Factory infrastructure that orchestrates thousands of CFD simulations per day on cloud compute
  • Design and operate job scheduling systems that maximize throughput while handling failures gracefully
  • Build monitoring and alerting to detect simulation failures, convergence issues, and resource bottlenecks early

Data Pipeline Engineering

  • Build high-performance data pipelines that move simulation outputs from solver results to ML-ready training data
  • Implement geometry preprocessing workflows (mesh preparation, morphing, watertightness validation)
  • Design and operate post-processing pipelines: surface decimation, field interpolation, format conversion
  • Optimize I/O performance for large mesh datasets

Data Quality and Validation

  • Implement comprehensive validation checks at every pipeline stage: solver convergence, physical field bounds, post-processing fidelity
  • Build systems that capture and quarantine bad data before they reach training pipelines
  • Track and report data quality metrics across the entire Data Factory
  • Work towards full provenance: training samples should be traceable back to their source geometry and simulation configuration

Integration and Delivery

  • Deliver validated datasets to downstream ML training infrastructure in formats optimized for efficient data loading
  • Design data versioning and cataloging systems that support reproducible training runs
  • Work closely with ML Infrastructure Engineers to ensure smooth handoff between data production and model training
  • Support multi-dataset training workflows

What you bring to the table

  • Ability to scope and effectively deliver projects, prioritising activity as needed.
  • Problem-solving skills and the ability to analyse issues, identify causes, and recommend solutions quickly.
  • Excellent collaboration and communication skills, especially in a research setting. You can translate "the model isn't converging" into infrastructure hypotheses and solutions, and can bridge technical abstractions with implementations.
  • 5+ years of experience in data engineering, HPC engineering, or simulation infrastructure.
    • Strong experience with orchestration systems: SLURM, Kubernetes, Temporal
    • Production data pipeline experience: you've built and operated pipelines that process large volumes of data reliably
    • Proficiency in Python for pipeline development and automation
    • Systems engineering fundamentals: Linux, networking, storage systems, performance debugging
    • Experience with cloud infrastructure; ****ideally CoreWeave or similar GPU/HPC-focused clouds
    • Background in HPC for simulation engineering: experience with CFD, FEA, or similar computational workflows (StarCCM+, OpenFOAM, ANSYS, etc.)
    • Experience with geometry processing: mesh manipulation, CAD formats, PyVista
    • Familiarity with scientific data formats: HDF5, VTK, NetCDF, Zarr
    • Data quality engineering experience: validation frameworks, anomaly detection, data observability

Ideally

  • Understanding of CFD fundamentals, enough to interpret solver outputs and validation metrics
  • Experience with 3D geometry pipelines (mesh decimation, field interpolation)
  • Familiarity with ML data loading patterns and how training systems consume data

What we offer

Build what actually matters

Help shape an AI-native engineering company at a formative stage, tackling problems that genuinely matter for industry and society. This is work with real-world impact - and something you can be proud to stand behind.

Learn alongside exceptional people

Work with a high-caliber, collaborative team of engineers, scientists, and operators who care deeply about doing great work, and about helping each other get better. We come from diverse backgrounds, but we share a commitment to operating at the highest level and addressing some of the most complex challenges out there. If you’re ambitious, thoughtful, and driven by impact, you’ll feel at home.

Influence over hierarchy

We operate with a flat structure: good ideas win - wherever they come from. Questioning assumptions and challenging the status quo isn’t just welcomed, it’s expected.

Sustainable pace, long-term ambition

Building meaningful technology is a marathon, not a sprint. We believe in balancing focused, ambitious work with a life beyond it. Our hybrid model blends time together in our Shoreditch office with work-from-home days, giving you the flexibility to work sustainably while staying connected in person.

And it doesn’t stop there …

🚀 Equity options - share meaningfully in the company you’re helping to build.

🏦 10% employer pension contribution - because investing in future matters.

🍽️ Free office lunches - to keep you energised and focused.

👶 Enhanced parental leave - 3 months full pay paternity and 6 months full pay maternity leave, to provide extra flexibility during the moments that matter most.

🍼 YellowNest nursery scheme - to help working parents manage childcare costs.

☀️ 25 days of Annual Leave (+ Public Holidays) - because taking time to rest matters.

🏥 Private medical insurance - 100% employee cover, giving you complete peace of mind.

💪 Wellhub Subscription - gain access to thousands of gyms, classes and wellness apps, supporting both physical and mental wellbeing.

👀 Eye tests - because good work depends on good health.

📈 Personal development - dedicated support for learning, development, and leveling up over time.

💛 Employee Assistance Programme (EAP) - confidential wellbeing support, available whenever you need it.

🚲 Bike2Work scheme and 🚆 Season ticket loan - to make getting to work easier and greener.

🚗 Octopus EV salary sacrifice - for a simpler, more sustainable way to drive electric.

🔎 Watch this space, we’re continuing to build this as we grow…

 
We value diversity and are committed to equal employment opportunity regardless of sex, race, religion, ethnicity, nationality, disability, age, sexual orientation or gender identity. We strongly encourage individuals from groups traditionally underrepresented in tech to apply. To help make a change, we sponsor bright women from disadvantaged backgrounds through their university degrees in science and mathematics. 
 
We collect diversity and inclusion data solely for the purpose of monitoring the effectiveness of our equal opportunities policies and ensuring compliance with UK employment and equality legislation. This information is confidential, used only in aggregate form, and will not influence the outcome of your application. 
 

Apply for this job

*

indicates a required field

Phone
Resume/CV*

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...
Select...

Please share a specific annual base salary expectation. This won't be used to screen you out, but "negotiable" or "open to discuss" aren't helpful here.

Select...
Select...
Select...

Demographic reporting - EMEA

Capturing data on our gender balance in our hiring process in order to measure the success of our DE&I hiring efforts. 

Select...