A leading global investment and research firm is looking for a highly technical C++/Python Engineer to design and optimize machine learning infrastructure and compute frameworks that support large-scale quantitative research initiatives. This role requires deep systems-level expertise, a strong understanding of hardware/software co-design, and a passion for pushing the limits of CPU and GPU performance.
Responsibilities:
- Architect and implement high-performance C++ libraries and Python bindings to accelerate ML model development, training, and inference.
- Perform low-level optimizations across CPU and GPU architectures (SIMD vectorization, multithreading, CUDA kernels, memory hierarchy tuning).
- Build scalable distributed compute solutions, optimizing throughput and latency for intensive ML workflows.
- Profile and benchmark system performance, identifying and eliminating bottlenecks at the OS, compiler, and hardware levels.
- Collaborate directly with ML researchers, data engineers, and systems teams to deliver infrastructure capable of supporting cutting-edge quantitative models at scale.
- Contribute to the design of systems for real-time and batch data processing across large clusters.
Qualifications:
- 5+ years of experience developing high-performance software in modern C++ (C++14/17/20) and Python.
- (Nice to have) Deep understanding of CPU microarchitecture, memory models, NUMA, threading, and vectorization (AVX2/AVX-512).
- (Nice to have) Proven expertise optimizing GPU workloads using CUDA, Triton, or equivalent.
- Strong familiarity with ML frameworks such as PyTorch, TensorFlow, and custom kernel development.
- Experience working with distributed systems and frameworks such as NCCL, gRPC, MPI, or similar.
- Advanced degree (Master’s or PhD) in Computer Science, Electrical Engineering, Applied Mathematics, or a related technical field is preferred.
- Bonus: Contributions to open-source HPC/ML libraries, experience with compiler optimizations, or custom silicon/SOC acceleration work.
What We Offer:
- Work with some of the brightest minds in quantitative research and systems engineering.
- Direct access to state-of-the-art compute infrastructure (CPU clusters, GPU supercomputers).
- Competitive base salary, exceptional performance bonuses, and long-term career growth opportunities.
- A culture that values deep technical expertise, innovation, and technical ownership.
If you're passionate about building world-class ML systems and pushing hardware to its limits, we invite you to explore this opportunity.