Senior Backend Engineer – Real-Time Audiovisual Systems

Salary DKK 40,000 - DKK 60,000
Equity To be negotiated

The role

We’re looking for a Backend Engineer with a passion for real-time audiovisual technologies and AI-driven interactivity. This isn’t a maintenance role—it’s for someone who wants to build from the ground up, architecting systems that handle multimodal streaming, synchronization, and scalable backend infrastructure. If you thrive at the intersection of low-latency media pipelines, browser interactivity, and AI integration, this is your opportunity to help define the future of human–AI interaction.


What you’ll be doing

  • Design, develop, and optimize backend services for real-time audiovisual streaming and synchronization.

  • Build low-latency pipelines for audio/video capture, processing, and distribution across browsers and mobile devices.

  • Collaborate with AI engineers to integrate multimodal models (speech, vision, and text) into scalable backend APIs.

  • Build and manage infrastructure for parallel audio/video stream analysis, ensuring concurrency, threading, and multiprocessing are efficient and reliable.

  • Work closely with frontend teams to enable browser-based interactive media experiences (WebRTC, WebSockets, Web Audio API, MediaStream APIs, WebCodecs, WebTransport).

  • Develop APIs and services that support real-time, human–AI conversational interactivity.

  • Deploy and scale cloud infrastructure with a focus on performance, data privacy, and reliability.

  • Create internal tools, dashboards, and reference implementations that demonstrate real-time AI-driven media capabilities.

  • Stay on top of new developments in real-time media frameworks, edge streaming, and AI-powered applications.


What you bring

You’re a builder with a strong systems mindset, motivated by solving hard technical challenges in real-time multimedia and AI.

Must-haves

  • Backend engineering experience with Python or Rust.

  • Solid understanding of real-time audio/video streaming protocols (WebRTC, RTP/RTSP, HLS/DASH).

  • Hands-on experience working with browser-based audiovisual technologies:

    • JavaScript/TypeScript (for WebRTC, MediaRecorder, Web Audio API, MediaStream API).

    • WebCodecs API for efficient encoding/decoding.

    • WebTransport/QUIC and WebSockets for low-latency data transport.

    • Understanding of WASM (WebAssembly) for high-performance in-browser processing.

  • Proven experience with multi-threading, multiprocessing, and queue systems for low-latency applications.

  • Strong knowledge of cloud infrastructure (GCP or AWS) for scalable deployments.

  • Strong experience with containerization and orchestration (Docker, Kubernetes).

  • CI/CD workflows and Git expertise.

  • Experience deploying secure, production-ready backend systems that handle high throughput.

  • Familiarity with cross-browser media constraints (permission flows, device compatibility, autoplay policies, bandwidth adaptation).

Nice-to-haves

  • Exposure to AI/ML models (speech recognition, computer vision, NLP).

  • Experience with streaming media frameworks (FFmpeg, GStreamer, Janus, Jitsi, mediasoup).

  • Knowledge of real-time interactive systems (gaming, avatars, collaborative tools).

  • Background in infrastructure/architecture (DevOps, MLOps).

  • Familiarity with browser performance profiling tools (Chrome DevTools, WebRTC internals).

  • Awareness of data privacy, compliance, and secure handling of audiovisual data.

  • Startup experience, comfortable with fast iteration and building under uncertainty.


We offer

  • Competitive salary + employee warrants.

  • Remote and hybrid work options—we trust you to get your best work done wherever you are.

  • Deep technical challenges at the cutting edge of AI and real-time media streaming.

  • Opportunity to shape a product from the ground up that’s redefining conversational AI.

  • A team culture that values ambition, collaboration, and sustainable work-life balance.

  • If you’re in Copenhagen, a modern office space with a vibrant community.

For more information or questions please contact us at benjamin@interhuman.ai or phone number +4551913590

Perks and benefits

This job comes with several perks and benefits

Remote work allowed
Remote work allowed

Gamified office
Gamified office

Social gatherings
Social gatherings

Skill development
Skill development

Equity package
Equity package

Mental health support
Mental health support

See all 10 benefits

Working at
Interhuman AI

Interhuman AI is the social intelligence layer for AI, enabling machines to understand the 93% of human communication that happens non-verbally. By analyzing facial expressions, body language, voice tonality, and conversational context in real-time, our technology transforms basic AI interactions into meaningful conversations. Our multimodal perception system works with any AI platform, enhancing interactions by detecting user confusion, engagement, and emotional states that text alone misses. This allows AI to adapt its approach mid-conversation, leading to more effective human-machine communication. Read more about us at https://www.interhuman.ai/

Read more about Interhuman AI

company gallery image