We’re looking for a Backend Engineer with a passion for real-time audiovisual technologies and AI-driven interactivity. This isn’t a maintenance role—it’s for someone who wants to build from the ground up, architecting systems that handle multimodal streaming, synchronization, and scalable backend infrastructure. If you thrive at the intersection of low-latency media pipelines, browser interactivity, and AI integration, this is your opportunity to help define the future of human–AI interaction.
Design, develop, and optimize backend services for real-time audiovisual streaming and synchronization.
Build low-latency pipelines for audio/video capture, processing, and distribution across browsers and mobile devices.
Collaborate with AI engineers to integrate multimodal models (speech, vision, and text) into scalable backend APIs.
Build and manage infrastructure for parallel audio/video stream analysis, ensuring concurrency, threading, and multiprocessing are efficient and reliable.
Work closely with frontend teams to enable browser-based interactive media experiences (WebRTC, WebSockets, Web Audio API, MediaStream APIs, WebCodecs, WebTransport).
Develop APIs and services that support real-time, human–AI conversational interactivity.
Deploy and scale cloud infrastructure with a focus on performance, data privacy, and reliability.
Create internal tools, dashboards, and reference implementations that demonstrate real-time AI-driven media capabilities.
Stay on top of new developments in real-time media frameworks, edge streaming, and AI-powered applications.
You’re a builder with a strong systems mindset, motivated by solving hard technical challenges in real-time multimedia and AI.
Must-haves
Backend engineering experience with Python or Rust.
Solid understanding of real-time audio/video streaming protocols (WebRTC, RTP/RTSP, HLS/DASH).
Hands-on experience working with browser-based audiovisual technologies:
JavaScript/TypeScript (for WebRTC, MediaRecorder, Web Audio API, MediaStream API).
WebCodecs API for efficient encoding/decoding.
WebTransport/QUIC and WebSockets for low-latency data transport.
Understanding of WASM (WebAssembly) for high-performance in-browser processing.
Proven experience with multi-threading, multiprocessing, and queue systems for low-latency applications.
Strong knowledge of cloud infrastructure (GCP or AWS) for scalable deployments.
Strong experience with containerization and orchestration (Docker, Kubernetes).
CI/CD workflows and Git expertise.
Experience deploying secure, production-ready backend systems that handle high throughput.
Familiarity with cross-browser media constraints (permission flows, device compatibility, autoplay policies, bandwidth adaptation).
Nice-to-haves
Exposure to AI/ML models (speech recognition, computer vision, NLP).
Experience with streaming media frameworks (FFmpeg, GStreamer, Janus, Jitsi, mediasoup).
Knowledge of real-time interactive systems (gaming, avatars, collaborative tools).
Background in infrastructure/architecture (DevOps, MLOps).
Familiarity with browser performance profiling tools (Chrome DevTools, WebRTC internals).
Awareness of data privacy, compliance, and secure handling of audiovisual data.
Startup experience, comfortable with fast iteration and building under uncertainty.
Competitive salary + employee warrants.
Remote and hybrid work options—we trust you to get your best work done wherever you are.
Deep technical challenges at the cutting edge of AI and real-time media streaming.
Opportunity to shape a product from the ground up that’s redefining conversational AI.
A team culture that values ambition, collaboration, and sustainable work-life balance.
If you’re in Copenhagen, a modern office space with a vibrant community.
This job comes with several perks and benefits