🤖 Google Robots Take Over Office

PLUS: French AI Startup Launches ‘Moshi’

Welcome, AI enthusiasts!

Today’s AI insights:

  • 🍎 Google Gemini AI makes its robots smarter

  • 🇫🇷 French AI startup launches ‘Moshi’

  • 💼 SoftBank acquires UK AI chipmaker Graphcore

  • 🤖 Convert text to audio with AI

  • 🛠 New AI tools

  • 🗞️ Latest AI and Tech news

  • 📝 AI Meme of the day

Read Time: 3 mintues

AI NEWS

GOOGLE DEEPMIND

Image Source: Google DeepMind

Google’s DeepMind has made significant advancements in robotics with its office RT-2 robots. These robots, powered by the Gemini 1.5 Pro model, are designed to understand natural language and perform various office tasks more efficiently.

The Details:

  • Employees can activate the robots with a simple “Ok, Robot,” to which the robots respond, “Ok, give me a minute. Thinking with Gemini…” before executing the task.

  • The robots can handle various tasks like identifying power outlets for charging devices and navigating using simple maps.

  • They are trained with extensive video footage of the office, enabling them to recognize verbal, written, drawn, or gesture commands [Demo]

DeepMind’s RT-2 robots illustrate the potential of AI in enhancing workplace efficiency. Although they require 10-30 seconds to process instructions, their ability to perform complex tasks, like inventory checks in a fridge, highlights the growing capability of AI in practical applications. This development not only improves operational efficiency but also sets the stage for future innovations in robotic technology in everyday environments.

GPT-40 COMPETITOR

Image Source:kyutai

French startup Kyutai has unveiled Moshi, a cutting-edge AI voice assistant that boasts real-time interaction and a wide range of emotional responses, positioning it as a direct competitor to OpenAI’s similar but delayed Voice Mode feature.

The Details:

  • Moshi can listen and respond simultaneously, with a response latency of just 160 milliseconds, showcasing faster performance than similar technologies.

  • The assistant offers 70 different emotional tones and styles, including whispers and accents, enhancing user interaction.

  • Moshi is currently accessible for trials on Hugging Face, and Kyutai plans to open-source its underlying model and research soon.

Moshi represents a significant stride in voice AI technology, emphasizing the capabilities of French innovation in the AI sector. By making Moshi’s technology open-source, Kyutai not only challenges market leaders like OpenAI but also contributes to the broader AI community, potentially setting new standards for how interactive voice systems are developed and implemented globally.

AI TUTORIAL

AI TUTORIAL

Audioread is a tool that transforms written content into audio, making it easy to listen to articles, documents, and other text-based materials.

Steps to follow:

  1. Sign Up and Log In: Visit the Audioread website, sign up for an account, and log in to access your dashboard.

  2. Upload Your Text: Click “Upload” to add your document, article, or text file. You can also paste text directly into the provided text box.

  3. Choose Voice and Settings: Select your preferred voice and adjust settings like speed and tone to customize your audio output.

  4. Generate Audio: Click the “Convert” button to transform your text into audio. Audioread will process the text and generate an audio file.

  5. Download and Listen: Once the conversion is complete, download the audio file to your device and listen to your content on the go.

NEW AI TOOLS


NEW AI TOOLS:

🎥 Snapcut- AI-powered video editing for viral shorts

🎙️ Write Label- AI-powered audio ad script and production platform

📝 Siuuu AI- AI writing tools for writers, students, educators, and marketers

🗺️ Mapify- Free AI-powered mind mapping tool

🎤 Respeecher- Voice cloning for filmmakers and creators

📚 Flashka- AI-powered flashcards for accelerated learning

AI & TECH NEWS

Stability AI releases stable assistant features

Stability AI has enhanced its Stable Assistant chatbot with new features, including a Search & Replace tool for image object replacement and Stable Audio for generating three-minute musical tracks. Existing tools like upscaling, image editing, and video creation from images are complemented by Stable Diffusion 3, which enhances image generation capabilities.

Medal secures $13M for cutting-edge desktop AI assistant

Medal, known for its video game clipping product, raised $13 million at a $333 million valuation. The company also launched Highlight, a desktop app acting as a contextual AI assistant, capturing screen content to interact with large language models (LLMs). Highlight allows users to ask questions using tools like ChatGPT and Anthropic’s Claude.

SoftBank acquires UK AI chipmaker Graphcore

FRVR AI democratizes game development by allowing anyone to create games using simple text prompts. Users can generate, refine, and publish games effortlessly, making the process accessible and inclusive for all. Currently in beta, the tool supports creators on various devices, fostering a large community of developers and hobbyists.

AI MEME OF THE DAY

COLLABORATE WITH US:

AI Insights newsletter is read by thousands of AI and Tech professionals/enthusiasts around the world.

Get in touch to get your product seen today!

Or email us at: [email protected]

THANK YOU FOR READING

FEEDBACK

How would you rate today's newsletter?

Your feedback helps me improve my content!

Login or Subscribe to participate in polls.

We'd love to hear your feedback or any interesting thoughts you have!

Please share by replying to this email.

Reply

or to participate.