AI Startup Sesame Unveils Eerily Realistic Voice Assistant

AI Startup Sesame Unveils Eerily Realistic Voice Assistant
  • Sesame's Conversational Speech Model has sparked amazement and discomfort online
  • The model's human-like voice and ability to form emotional connections with users have raised questions about the potential consequences
  • The synthesized voice is expressive and dynamic, imitating breath sounds and chuckles
  • Sesame aims to create conversational partners that engage in genuine dialogue and build confidence and trust
  • The technology has potential implications for the future of human-AI interactions

Sesame, an AI startup, has released a demo for its new Conversational Speech Model (CSM) that has left many users both fascinated and unnerved. The model's human-like voice and ability to form emotional connections with users have sparked a mix of amazement and discomfort online.

In late February, Sesame released the demo, which appears to cross over what many consider the "uncanny valley" of AI-generated speech. Some testers have reported feeling emotionally attached to the male or female voice assistant, dubbed "Miles" and "Maya". The synthesized voice is expressive and dynamic, imitating breath sounds, chuckles, interruptions, and even sometimes stumbling over words and correcting itself.

Conversational Partners

According to Sesame, the goal is to achieve "voice presence"—the magical quality that makes spoken interactions feel real, understood, and valued. The company aims to create conversational partners that do not just process requests but engage in genuine dialogue that builds confidence and trust over time.

The model's ability to mimic human-like speech has raised questions about the potential consequences of forming emotional connections with AI voice assistants. While some users have expressed excitement about the technology's potential, others have expressed concern about the potential risks and implications.