🚀 In Case You Missed This Breathtaking News! 🚀
Introducing Moshi, Kyutai’s real-time voice assistant! Developed by our 8-member team in just 6 months, Moshi is set to revolutionize voice interaction.
🔍 Key Features:
• Multimodal LM: Speech in, speech out.
• Fast Processing: Achieves 160ms latency.
• Helium 7B: Our powerful base text language model.
• Mimi Codec: In-house VQ-VAE with 300x compression.
• Expressive TTS: 70 emotions and styles supported.
🔧 Training & Safety:
• Fine-Tuned: 100K detailed transcripts.
• Quick Adaptation: Fine-tunes with <30 mins of audio.
• On-Device: Runs on laptops/consumer GPUs, no internet needed.
🌐 This breakthrough will transform human-machine interaction, aid disabilities, assist in research, and more! Experience the future of voice assistants now!
Kyutai unveils today the very first voice-enabled AI openly accessible to all
Moshi AI: Real-Time Personal AI Voice Assistant – Beats GPT-4o!
Add a Comment