Published 19 Aug 2024 2 minutes read
Last Updated 04 Oct 2024

New AI Voices and Interfaces Unveiled

This blog explores AI voices and communication advancements, such as ChatGPT's new voice mode and Gemini AI's expanded features in Gmail and Google Docs, highlighting practical uses and limitations.

General

New Voices in AI Communication

AI Chatbot and professional voice thief ChatGPT is back with a dulcet set of new digital vocal cords for its advanced Voice Mode. This might seem familiar, as previously reported on ChatGPT giving actor Scarlett Johansson the digital Ursula treatment earlier this year.

OpenAI has trained the model to speak in only four preset voices, will “block outputs that differ from those voices,” and has “implemented guardrails to block requests for violent or copyrighted content.” Therefore, it maintains a controlled environment.

I’m definitely curious about these new voices. How un-ScarJo will they be? Will there be a masculine option? Accents? These presets are rolling out to a small cohort, with a wider launch planned for the fall.

Legal Issues and Practicality

Citing personal reasons, Johansson declined to lend her voice. However, the refusal didn’t stop Sam Altman, ChatGPT’s CEO, from using her voice without permission. Consequently, Johansson got lawyers involved, delaying Advanced Voice Mode as an alternative voice was found.

Regardless of these legal speed bumps, the capability to have “more natural, real-time conversations” is an appealing feature. Meanwhile, certain users may find this mode less appealing for everyday use.

Revamped Interfaces with Google’s Gemini AI

Generative AI is expanding its reach, turning up in Gmail, Google Docs, and a host of other applications, as detailed in this Gizmodo article. The addition of Gemini AI in Gmail allows users to effortlessly compose emails through detailed prompts.

However, this feature is only available for Google Workspace or Google One AI Premium plan subscribers. Nevertheless, it’s forecasted to become more accessible in the future.

Gemini Usage in Gmail

Artificial Intelligence has been present in Gmail for a while, thanks to Smart Reply and Smart Compose. However, Gemini takes text composition to new heights. Start a new email, and a pen with a star icon will enable you to generate content by entering a prompt. Therefore, this leads to efficient email creation.

Once Gemini processes the prompt, you can grade the outcomes with a thumbs up or down. Click Insert to accept or Refine to tweak the generated content. Similarly, existing email text can be refined through the same process.

Even though it can generate clear text, its practical usage might be limited. AI-generated emails are best suited for administrative purposes or non-critical communications.

Gemini in Google Docs

Gemini AI isn’t restricted to Gmail. In Google Docs, it offers ‘Help me write’ prompts. This allows users to produce content on a variety of topics effortlessly.

The Gemini AI button follows you, ready to assist in producing sentences or blocks of text. Moreover, it excels in summary creation and text refinement, making it an excellent tool for rewriting or summarizing documents.

For major text generation, limitations still exist. Many prefer their own creatively-written content. However, as an aide for rewriting, summarizing, and ideation, Gemini AI shows considerable promise. In short, it’s more a writing assistant than a complete replacement.

Published 19 Aug 2024
Category
General