AI Research Engineer, Handshake AI

Handshake • Remote • 19 days ago

About Handshake AI

Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.

Handshake AI is a human data labeling business that leverages the scale of the largest early career network. We work directly with the world’s leading AI research labs to build a new generation of human data products. From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.

This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.

Now’s a great time to join Handshake. Here’s why:

  • Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.

  • Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.

  • World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.

  • Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.

About the Role

  • Design and implement post-training systems and methodologies in close partnership with research scientists and domain experts

  • Build and maintain infrastructure that supports large-scale model training, specialized data processing, and benchmark evaluation

  • Develop robust frameworks for verifying the quality and integrity of highly specialized domain datasets

  • Create next-generation LLM benchmarks that push the boundaries of model evaluation and capabilities assessment

  • Optimize performance across software and hardware layers to accelerate post-training experimentation and deployment

  • Collaborate across disciplines to ensure rigorous validation of model improvements and benchmark reliability

Desired Capabilities

  • Strong Python programming skills with attention to clean, efficient, and scalable code

  • Experience building and operating large-scale systems for model post-training, specialized data processing, or benchmark evaluation

  • Deep familiarity with PyTorch and modern post-training techniques (RLHF, constitutional AI, etc.)

  • A background in applied machine learning, model evaluation, or large-scale data quality assessment

  • Experience with benchmark design, evaluation methodologies, and performance measurement frameworks

  • Clear communication skills and a collaborative mindset for cross-functional research teams

Extra Credit

  • Experience optimizing deep learning models for performance (e.g., memory usage, training speed)

  • Interest in the societal and ethical impacts of AI technologies

  • Contributions to open-source ML infrastructure or tools

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

Ownership: Equity in a fast-growing company

Financial Wellness: 401(k) match, competitive compensation, financial coaching

Family Support: Paid parental leave, fertility benefits, parental coaching

Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

Growth: $2,000 learning stipend, ongoing development

Remote & Office: Stipends for home office setup, internet, commuting, and free lunch/gym in our SF office

Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!

Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.

Related jobs in Remote