Gemini Now Transcribes Audio Files: Discover How It Works!

Jordan Park

September 22, 2025

X Facebook WhatsApp

Unlike ChatGPT, Gemini now has the capability to convert audio files into text. Learn how to use this new feature.

Scientists confirm: This is the most effective way to get your cat’s attention, according to new research

Elderly Couple Refuses Reserved Seats—Viral Train Standoff Sparks Fiery Debate on Courtesy

Contents

As of Monday, September 8, 2025, Gemini, Google’s conversational agent, can now analyze and transcribe the content of an audio file, announced Josh Woodward, Vice President of Google Labs and the Gemini application. However, some restrictions have been implemented for free users.

Papercut fixed: You can now upload any file to @GeminiApp. Including the #1 request: audio files are now supported! pic.twitter.com/4Te3xwLC6W

— Josh Woodward (@joshwoodward) September 8, 2025

Why You Should Never Reheat These Foods in the Microwave – The Hidden Dangers Experts Warn About

I tried the top 5 guard dogs—here’s what makes these breeds the ultimate protectors

To view this content from social networks, you must accept cookies and advertising trackers.

These cookies and trackers allow our partners to offer you ads and content tailored to your browsing, your profile and your interests.More info.

Gemini can now transcribe audio files

Google seems to have addressed one of its user’s top requests by allowing audio files to be uploaded to Gemini. Available on the web, iOS, and Android versions, the conversational agent can now transcribe a file in seconds, regardless of the file format (MP3, M4A, WAV, etc.). It can also analyze its content or summarize key points.

The transcription and analysis feature in Gemini is limited, Google specifies. Free version users can upload files up to 10 minutes long and are entitled to five prompts per day. Subscribers to Google AI Pro and Google AI Ultra plans, on the other hand, enjoy a duration extended to 3 hours. It is possible to import up to 10 files simultaneously, according to a help page.

A feature absent from ChatGPT

By integrating a feature already present on NotebookLM, another Google service, Gemini positions itself as an alternative to multilingual transcription solutions like Good Tape or Vook.ai, whose free versions are often limited. Importantly, it offers an option that ChatGPT does not yet have. Since last July, OpenAI’s tool does offer a Recording Mode on macOS, but it is limited to capturing meetings or brainstorming sessions. And it remains, for now, reserved for paying subscribers.

How to transcribe an audio file with Gemini

Here’s how to transcribe or analyze an audio file in Gemini:

Click on the + icon located in the input bar,

Select Import files,

Choose an audio file (MP3, MP4, M4A, etc.) with a maximum duration of 10 minutes,

In the input bar, add a prompt to specify your request (« Transcribe this file », « Summarize the key points », etc.).

Similar Posts

Rate this post

Jordan Park

Jordan Park writes in-depth reviews and editorial opinion pieces for Touch Reviews. With a background in UI/UX design, Jordan offers a unique perspective on device usability and user experience across smartphones, tablets, and mobile software.

X Facebook WhatsApp

iPhone Air’s Dynamic Island Positioned Lower: Discover the Impact on User Experience!

Apple Blames Brussels for Limited AirPods Performance in Europe: Find Out Why