Interhuman AI is building the next generation of social intelligence infrastructure—multimodal AI systems that understand not just what humans say, but how they say it. We're developing models that interpret behavioral signals like hesitation, engagement, confusion, and interest across voice, facial expressions, body language, and natural language - in real time.
We are looking for a Student Researcher to join our AI engineering team. This is not a typical "support" role; it is an invitation to apply the latest research in multimodal evaluation and data synthesis to a production-scale infrastructure. You will work on the practical foundations that allow our models to move from "experimental" to "state-of-the-art" every single week.
What you’ll do
You are gonna own a scoped project that upgrades how we collect, validate, or evaluate complex behavioral-signal data, directly impacting model performance.
Create high-signal benchmarks to track improvements on the most challenging "long-tail" machine learning cases in social interaction.
Improve consistency and auditability across our pipeline so that our research results are trustworthy, repeatable, and scalable.
Contribute to the internal ecosystem of scripts and utilities that accelerate our experiment cycles, result comparison, and artifact organization.
Who we’re looking for
You are currently enrolled in a PhD program in Computer Science, Machine Learning, Data Science, or a related quantitative field (e.g., Physics, Mathematics, Statistics).
Strong proficiency in Python and experience with modern ML frameworks (PyTorch, JAX, or TensorFlow).
Deep familiarity with the current LLM/Multimodal research landscape (e.g., CLIP, Audio-LMs, Video-Language models).
You're comfortable figuring things out independently and don't need constant direction. When you encounter obstacles, you find creative solutions and know when to ask for help.
You can explain technical concepts clearly in writing and speech.
You are good at prioritizing and balancing multiple tasks, and flexible when changes occur.
What we offer
We recognize that your PhD research is your priority. We offer a highly flexible 15-hour work week designed to complement your academic schedule. You will join a high-intensity, venture-backed startup environment where your work isn't just a paper. This is a paid role where you will have direct influence on the technical direction of a pioneering social intelligence platform.
Who we are
At Interhuman AI, we're pioneering multimodal AI that reads the full bandwidth of human communication - facial expressions, vocal tone, body language, and words - to interpret social signals in real time. We're building infrastructure for AI interactions that feel adaptive, emotionally aware, and genuinely human.
We're a small, focused team backed by top investors, with a working MVP and a vision to become foundational infrastructure for the next era of conversational AI.
If you want to do work that matters, at the edge of what's possible, we'd love to hear from you.
This job comes with several perks and benefits
