PhD Student Researcher (Multimodal ML)

Salary DKK 8,000 - DKK 10,000

Interhuman AI is building the next generation of social intelligence infrastructure—multimodal AI systems that understand not just what humans say, but how they say it. We're developing models that interpret behavioral signals like hesitation, engagement, confusion, and interest across voice, facial expressions, body language, and natural language - in real time.


We are looking for a Student Researcher to join our AI engineering team. This is not a typical "support" role; it is an invitation to apply the latest research in multimodal evaluation and data synthesis to a production-scale infrastructure. You will work on the practical foundations that allow our models to move from "experimental" to "state-of-the-art" every single week.


What you’ll do


  • You are gonna own a scoped project that upgrades how we collect, validate, or evaluate complex behavioral-signal data, directly impacting model performance.

  • Create high-signal benchmarks to track improvements on the most challenging "long-tail" machine learning cases in social interaction.

  • Improve consistency and auditability across our pipeline so that our research results are trustworthy, repeatable, and scalable.

  • Contribute to the internal ecosystem of scripts and utilities that accelerate our experiment cycles, result comparison, and artifact organization.


Who we’re looking for


  • You are currently enrolled in a PhD program in Computer Science, Machine Learning, Data Science, or a related quantitative field (e.g., Physics, Mathematics, Statistics).

  • Strong proficiency in Python and experience with modern ML frameworks (PyTorch, JAX, or TensorFlow).

  • Deep familiarity with the current LLM/Multimodal research landscape (e.g., CLIP, Audio-LMs, Video-Language models).

  • You're comfortable figuring things out independently and don't need constant direction. When you encounter obstacles, you find creative solutions and know when to ask for help.

  • You can explain technical concepts clearly in writing and speech.

  • You are good at prioritizing and balancing multiple tasks, and flexible when changes occur.


What we offer


We recognize that your PhD research is your priority. We offer a highly flexible 15-hour work week designed to complement your academic schedule. You will join a high-intensity, venture-backed startup environment where your work isn't just a paper. This is a paid role where you will have direct influence on the technical direction of a pioneering social intelligence platform.


Who we are


At Interhuman AI, we're pioneering multimodal AI that reads the full bandwidth of human communication - facial expressions, vocal tone, body language, and words - to interpret social signals in real time. We're building infrastructure for AI interactions that feel adaptive, emotionally aware, and genuinely human.

We're a small, focused team backed by top investors, with a working MVP and a vision to become foundational infrastructure for the next era of conversational AI.


If you want to do work that matters, at the edge of what's possible, we'd love to hear from you.


For more information or questions please contact us at sid@interhuman.ai

Perks and benefits

This job comes with several perks and benefits

Remote work allowed
Remote work allowed

Gamified office
Gamified office

Social gatherings
Social gatherings

Skill development
Skill development

Equity package
Equity package

Mental health support
Mental health support

See all 10 benefits

Working at
Interhuman AI

Interhuman AI is the social intelligence layer for AI, enabling machines to understand the 93% of human communication that happens non-verbally. By analyzing facial expressions, body language, voice tonality, and conversational context in real-time, our technology transforms basic AI interactions into meaningful conversations. Our multimodal perception system works with any AI platform, enhancing interactions by detecting user confusion, engagement, and emotional states that text alone misses. This allows AI to adapt its approach mid-conversation, leading to more effective human-machine communication. Read more about us at https://www.interhuman.ai/

Read more about Interhuman AI

company gallery image