Skip to main content
Company Startup

AssemblyAI

Founded 2017 · San Francisco, USA · ~$115M raised across 4 rounds

8.3/10 Strong
Flagship

Monthly Up to 185 hrs free pre-recorded + 333 hrs streaming Annual STT from $0.15-$0.21/hr Price Voice Agent API $4.50/hr

Editorial · no paid placements

AssemblyAI is the speech-to-text company behind the Universal model family and developer transcription API, founded 2017. Verified June 16, 2026.

Products by AssemblyAI

  1. AssemblyAI Voice AI platform for speech-to-text, Universal 3.5 Pro preview, streaming transcription, LLM Gateway, guardrails, and voice-agent APIs.
    Up to 185 hrs free pre-recorded + 333 hrs streaming; STT from $0.15-$0.21/hr; Voice Agent API $4.50/hr 8.3/10

AssemblyAI is a speech AI company providing developer-grade speech-to-text and audio intelligence through the AssemblyAI API. Founded in 2017, it focuses on accurate transcription and downstream audio understanding (summarization, topic detection, speaker labels), and its Universal model family is positioned for industry-leading accuracy and reduced hallucination on real-world audio. It has raised about $115 million to date.

Key Facts

Founded2017
HQSan Francisco, USA
FundingAbout $115M raised across 4 rounds
Core productSpeech-to-text API and audio intelligence
ModelsUniversal model family (accuracy-focused)
DeliveryAPI and SDKs for apps and devices
BuyersDevelopers and companies adding transcription
CompetitorsDeepgram, OpenAI Whisper, ElevenLabs, Otter

What They Do

. Developers send audio and get back accurate transcripts plus higher-level features like summarization, sentiment, topic detection, speaker diarization, and content moderation. The company invests heavily in its own Universal speech models, competing on accuracy, latency.

Its strategy is to be the developer default for speech-to-text and audio understanding, an infrastructure layer rather than a consumer app. That puts it in a fast-growing voice-AI market alongside Deepgram and open models like Whisper, where model quality and reliable, well-documented APIs are the main competitive levers.

Current Flagship Products

  • AssemblyAI: The speech-to-text and audio-intelligence API, built on the company’s Universal model family, with SDKs for integration.

Strategic Position

and audio-intelligence features that go beyond raw transcription. Its challenge is a competitive, partly commoditized market: Deepgram competes directly, OpenAI’s Whisper is open and free to self-host, and larger voice platforms bundle transcription. AssemblyAI competes on benchmark-leading accuracy and developer experience.

For AIpedia readers, AssemblyAI matters when the need is production-grade speech-to-text and audio understanding via API, weighed against Deepgram and self-hosted Whisper on accuracy, features, and cost.

Sources

  • AssemblyAI for AIpedia’s canonical product and pricing record.
  • AssemblyAI pricing and product pages for current model and API details.
  • Tracxn and PitchBook for funding context.
Share LinkedIn
Spotted an error or want to share your experience with AssemblyAI?

Every tool page is re-verified on a recurring cycle, and corrections land faster when readers flag them directly. If you spot a stale fact, a missing capability, or have used AssemblyAI and want to share what worked or didn't, the editorial desk reviews every message sent through this form.

Email editorial@aipedia.wiki