About Handshake AI
Handshake is building the career network for the AI economy. Our three-sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired.
Handshake AI is a human data labeling business that leverages the scale of the largest early career network. We work directly with the world’s leading AI research labs to build a new generation of human data products. From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain-specific data and evaluation at scale.
This is a unique opportunity to join a fast-growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts.
Now’s a great time to join Handshake. Here’s why:
-
Leading the AI Career Revolution: Be part of the team redefining work in the AI economy for millions worldwide.
-
Proven Market Demand: Deep employer partnerships across Fortune 500s and the world’s leading AI research labs.
-
World-Class Team: Leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, just to name a few.
-
Capitalized & Scaling: $3.5B valuation from top investors including Kleiner Perkins, True Ventures, Notable Capital, and more.
About the Role
-
Design and implement post-training systems and methodologies in close partnership with research scientists and domain experts
-
Build and maintain infrastructure that supports large-scale model training, specialized data processing, and benchmark evaluation
-
Develop robust frameworks for verifying the quality and integrity of highly specialized domain datasets
-
Create next-generation LLM benchmarks that push the boundaries of model evaluation and capabilities assessment
-
Optimize performance across software and hardware layers to accelerate post-training experimentation and deployment
-
Collaborate across disciplines to ensure rigorous validation of model improvements and benchmark reliability
Desired Capabilities
-
Strong Python programming skills with attention to clean, efficient, and scalable code
-
Experience building and operating large-scale systems for model post-training, specialized data processing, or benchmark evaluation
-
Deep familiarity with PyTorch and modern post-training techniques (RLHF, constitutional AI, etc.)
-
A background in applied machine learning, model evaluation, or large-scale data quality assessment
-
Experience with benchmark design, evaluation methodologies, and performance measurement frameworks
-
Clear communication skills and a collaborative mindset for cross-functional research teams
Extra Credit
-
Experience optimizing deep learning models for performance (e.g., memory usage, training speed)
-
Interest in the societal and ethical impacts of AI technologies
-
Contributions to open-source ML infrastructure or tools
Perks
Handshake delivers benefits that help you feel supported—and thrive at work and in life.
The below benefits are for full-time US employees.
Ownership: Equity in a fast-growing company
Financial Wellness: 401(k) match, competitive compensation, financial coaching
Family Support: Paid parental leave, fertility benefits, parental coaching
Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend
Growth: $2,000 learning stipend, ongoing development
Remote & Office: Stipends for home office setup, internet, commuting, and free lunch/gym in our SF office
Time Off: Flexible PTO, 15 holidays + 2 flex days, winter #ShakeBreak where our whole office closes for a week!
Connection: Team outings & referral bonuses
Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers.