×
Register Here to Apply for Jobs or Post Jobs. X

C++ Engineer — Pushing Speed of Light | Ultra-Low Latency Trading Systems

Job in New York, New York County, New York, 10261, USA
Listing for: Mondrian Alpha
Full Time position
Listed on 2026-05-19
Job specializations:
  • Software Development
    Software Engineer, C++ Developer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: C++ Engineer — Pushing the Speed of Light | Ultra-Low Latency Trading Systems
Location: New York

We are partnering with one of the world’s most technically ambitious proprietary trading firms - a group rebuilding their entire trading platform from the metal up to operate at the physical limits of modern hardware. This isn’t an incremental improvement. It’s a total re-architecture of the fastest system on the planet, where every microsecond is contested ground and every cache miss is a bug.

Their engineers operate where nanoseconds decide P&L — measured, profiled, and deployed in live markets where performance is the edge.

They’re now seeking an elite C++ Engineer capable of designing and optimising the core of a real-time execution platform — a system that ingests millions of market events per second and reacts deterministically, faster than anyone else on Earth.

The successful engineer will:

  • Architect zero-GC, lock-free pipelines built around ring buffers and cache-aligned data structures.
  • Develop custom kernel-bypass network stacks using DPDK, RDMA, and Solarflare Onload
    , tuned to single-digit microsecond latency.
  • Engineer branch-prediction-aware order handlers and SIMD-vectorized pricing logic in AVX-512
    .
  • Deliver next-tick telemetry with nanosecond-precision timestamps and cross-core synchronization.
  • Collaborate with FPGA specialists to merge hardware precision with software agility.

The Toolkit

  • Modern C++20/23
    , template metaprogramming, constexpr, inline assembly when necessary.
  • Profiling and optimization using perf
    , VTune
    , bcc
    , and Flame Graphs
    .
  • Deep knowledge of NUMA-aware design
    , memory fences, and lock-free concurrency.
  • Expertise in custom allocator design
    , branchless algorithms
    , and profile-guided optimization
    .
  • A habit of benchmarking rather than assuming — data, not theory.

Ideal Background

  • Proven experience building ultra-low-latency systems in trading, gaming, or networking.
  • Deep understanding of CPU architecture
    , from cache hierarchies to speculative execution.
  • The mindset of someone who thinks in nanoseconds and measures in CPU cycles
    .
  • A record of winning battles with compilers, kernels, and performance bottlenecks.

This firm operates on a flat structure — no committees, no bureaucracy, no excuses. Engineering, hardware, and trading sit shoulder-to-shoulder. Code that’s 10ns faster doesn’t just run better — it changes the business.

If you believe latency is the final frontier, and profiling is the only truth, this is the environment you’ve been building toward.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary