ARemote Jobs Ace

ClickUp

Senior AI Engineer, Voice Platform

United States

Role brief

What this role is asking for.

At ClickUp, we're building the future of work: the first truly converged AI workspace unifying tasks, docs, chat, calendar, and enterprise search, all supercharged by context-driven AI. We are an AI-native company. Every team member is expected to leverage AI daily, and we evaluate AI fluency as part of our hiring process. Join us and help redefine what's possible. 馃殌 ROLE OVERVIEW You'll own and evolve the AI systems behind ClickUp's voice platform: real-time streaming transcription, intelligent reformatting, context-aware mention detection, and voice-to-action pipelines. This is a high-impact, hands-on role where you'll push the boundaries of what voice interfaces can do inside a productivity tool used by millions. KEY RESPONSIBILITIES - Design, build, and optimize real-time speech-to-text pipelines (streaming ASR, VAD, audio processing) - Improve transcription accuracy through context injection (user names, teams, custom vocabulary, language detection) - Develop and maintain LLM-powered post-processing (grammar correction, filler removal, mention resolution, formatting) - Build voice-to-action systems that parse natural language into structured workspace commands - Evaluate, benchmark, and integrate ASR models (Whisper, AssemblyAI, Fireworks, etc.) for cost, latency, and accuracy - Collaborate with product and platform teams to ship voice features across MAX Desktop, Mobile,

Company role signals

ClickUp role signals.

Repeated tags across 64 active roles show the current hiring pattern.