Careers

Help voice AI keep its promises.

We're a small, fully remote team across San Francisco, London and Malta, backed by Y Combinator, F-Prime Capital, True Ventures, Liquid 2 Ventures and MTV. We're hiring two founding engineers.

How we work

Small team, whole problems

Five people, no layers. You own a surface end-to-end — from the schema to the pixel to the customer call where you find out if it worked.

Ship weekly, learn daily

Our customers run voice agents in production; when something breaks for them it breaks loudly. We ship small, watch the metrics move, and fix fast.

Remote with real overlap

Fully remote across San Francisco, London and Malta. We keep a few hours of shared overlap, write things down, and fly everyone together a few times a year.

Taste matters

We sweat the pixel and the p95 alike. The product is opinionated and the codebase is too — TypeScript everywhere, one way to do things, no drive-by abstractions.

Open roles

Two seats at the founding table.

Founding Software Engineer — Product

Product engineering · Remote · US or Europe

Apply

Own the surfaces our customers live in — the platform where calls are scored, issues are filed and fixes are proven. You will take features from a customer sentence to production, and you will talk to the people using what you shipped.

What you'll do

  • Design and build product surfaces end-to-end: Next.js/React frontend, GraphQL API, Postgres data model — schema to pixel.
  • Turn messy real-world workflows (test suites, issue triage, run reviews) into interfaces that feel obvious in hindsight.
  • Work directly with design on a system that takes craft seriously — and push back when something reads wrong.
  • Sit in on customer calls, watch people use your features, and ship the fix the same week.
  • Raise the bar on quality: tests where they earn their keep, instrumentation everywhere, no silent failures.

What you'll bring

  • 4+ years building production web applications, with deep TypeScript and React experience.
  • Comfort owning a feature across the whole stack — API design, data modelling, and UI polish.
  • A portfolio (or war stories) showing product taste: you notice spacing, latency and empty states.
  • Bias to ship: you would rather put a small version in front of a user this week than a big one next month.
  • Clear written communication — we are remote and we write things down.

Nice to have

  • Experience with GraphQL (Pothos/Yoga), Drizzle, or event-driven backends.
  • You have built developer-facing tools, dashboards or observability products before.

Stack: TypeScript · Next.js · React · GraphQL · Postgres (Drizzle) · Bun · AWS (SST)

Founding Software Engineer — AI & Voice Systems

AI & infrastructure · Remote · US or Europe

Apply

Own the machinery that makes Roark honest: the evaluation pipelines that score every call, the audio-native metrics that hear what transcripts miss, and the simulation engine that dials agents like a real caller. This is the product — if it is wrong, everything above it is wrong.

What you'll do

  • Build and scale evaluation pipelines that score thousands of production calls in near real time.
  • Work on audio-native metrics — pronunciation, emotion, vocal stress — alongside LLM-based evaluators, and prove they agree with human judgment.
  • Evolve the simulation engine: realistic personas, accents and noise, dialing real agents over the phone and WebRTC.
  • Own analytics at scale: ClickHouse pipelines, DynamoDB streams, and the query layer that keeps dashboards honest.
  • Orchestrate long-running work with Temporal — replays, backfills and load tests that converge instead of double-count.

What you'll bring

  • 4+ years building backend or ML/audio systems in production, ideally in TypeScript or Python.
  • Experience with at least one of: speech/audio processing, LLM evaluation pipelines, or high-volume data infrastructure.
  • Rigour about correctness — you think in idempotency, replays and ground truth, not just happy paths.
  • Comfort operating what you build: dashboards, alerts, and the 2am curiosity to find out why.
  • Clear written communication — designs and post-mortems people actually want to read.

Nice to have

  • Hands-on time with telephony/WebRTC (Twilio, LiveKit, SIP) or TTS/ASR systems.
  • ClickHouse, Temporal, or stream-processing experience at scale.

Stack: TypeScript · Bun · AWS (SST) · ClickHouse · DynamoDB · Temporal · Postgres · WebRTC/telephony

Don't fit either box but convinced you belong here? Tell us why — founders@roark.ai. We read everything, and we reply fast. Learn more about the team →

Bring a recording.
We’ll score it live.

See your own agent measured on the audio it actually produced — in the demo, in real time. Stop guessing whether your voice AI works.

founders@roark.ai · we reply fast