About Handshake AI

Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.

Handshake AI is a human data labeling business that leverages the scale of the largest early career network. We work directly with the world’s leading AI research labs to build a new generation of human data products. From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.

This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.

Now’s a great time to join Handshake. Here’s why:

Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.
Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.
World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.
Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.

About the Role

Design and implement post-training systems and methodologies in close partnership with research scientists and domain experts
Build and maintain infrastructure that supports large-scale model training, specialized data processing, and benchmark evaluation
Develop robust frameworks for verifying the quality and integrity of highly specialized domain datasets
Create next-generation LLM benchmarks that push the boundaries of model evaluation and capabilities assessment
Optimize performance across software and hardware layers to accelerate post-training experimentation and deployment
Collaborate across disciplines to ensure rigorous validation of model improvements and benchmark reliability

Desired Capabilities

Strong Python programming skills with attention to clean, efficient, and scalable code
Experience building and operating large-scale systems for model post-training, specialized data processing, or benchmark evaluation
Deep familiarity with PyTorch and modern post-training techniques (RLHF, constitutional AI, etc.)
A background in applied machine learning, model evaluation, or large-scale data quality assessment
Experience with benchmark design, evaluation methodologies, and performance measurement frameworks
Clear communication skills and a collaborative mindset for cross-functional research teams

Extra Credit

Experience optimizing deep learning models for performance (e.g., memory usage, training speed)
Interest in the societal and ethical impacts of AI technologies
Contributions to open-source ML infrastructure or tools

Perks

Handshake delivers benefits that help you feel supported—and thrive at work and in life.

The below benefits are for full-time US employees.

Ownership: Equity in a fast-growing company

Financial Wellness: 401(k) match, competitive compensation, financial coaching

Family Support: Paid parental leave, fertility benefits, parental coaching

Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend

Growth: $2,000 learning stipend, ongoing development

Remote & Office: Stipends for home office setup, internet, commuting, and free lunch/gym in our SF office

Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!

Connection: Team outings & referral bonuses

Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.

AI Research Engineer, Handshake AI

Handshake • Remote • 19 days ago

About Handshake AI

About the Role

Desired Capabilities

Extra Credit

Perks

Related jobs in Remote

Executive Assistant to the CEO

Executive Assistant

Sr. Executive Assistant

Medical Education Clinical Program Manager

Senior Program Manager – Market Access

Strategic Program Manager (Product)