Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
DeepL says its tech could be used for real-time translation with meeting tools like Zoom and Microsoft Teams ...
Dubai-based Camb.AI focuses on speech synthesis and translation for media dubbing. Palabra, backed by Reddit co-founder ...
Google launched a free offline AI dictation app on iOS, highlighting a shift toward private, on-device speech-to-text tools.
Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.
Anthropic releases Claude Opus 4.7, narrowly retaking lead for most powerful generally available LLM
Opus 4.7 utilizes an updated tokenizer that improves text processing efficiency, though it can increase the token count of ...
Omni, a fully omnimodal AI model with strong benchmark results, multilingual support, and new audio-visual coding ...
Google has launched a new speech-to-text app to compete with apps like Wispr Flow, SuperWhisper, Willow, and others.
While Anthropic's dispute with the Pentagon escalated over guardrails on military use, OpenAI LLC struck its own publicized ...
French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets ...
How-To Geek on MSN
Stop using Claude as just a chatbot—MCP changes everything
MCP is the MVP.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results