Unlike ChatGPT, Gemini now has the capability to convert audio files into text. Learn how to use this new feature.
The little-known Japanese trick that gets rid of winter damp—no dehumidifier needed
Does Your Blood Type Really Reveal Your Intelligence, EQ, and Career Path? The Surprising Truth
As of Monday, September 8, 2025, Gemini, Google’s conversational agent, can now analyze and transcribe the content of an audio file, announced Josh Woodward, Vice President of Google Labs and the Gemini application. However, some restrictions have been implemented for free users.
Papercut fixed: You can now upload any file to @GeminiApp. Including the #1 request: audio files are now supported! pic.twitter.com/4Te3xwLC6W
— Josh Woodward (@joshwoodward) September 8, 2025
Is This the Breakthrough That Will Let the U.S. Turn Plastic Trash Into Clean Hydrogen and Big Profits?
Lamborghini’s First-Ever Yacht Sells in Weeks: The 7,600-HP Superboat That’s Redefining Nautical Luxury
To view this content from social networks, you must accept cookies and advertising trackers.
These cookies and trackers allow our partners to offer you ads and content tailored to your browsing, your profile and your interests.More info.
Gemini can now transcribe audio files
Google seems to have addressed one of its user’s top requests by allowing audio files to be uploaded to Gemini. Available on the web, iOS, and Android versions, the conversational agent can now transcribe a file in seconds, regardless of the file format (MP3, M4A, WAV, etc.). It can also analyze its content or summarize key points.
The transcription and analysis feature in Gemini is limited, Google specifies. Free version users can upload files up to 10 minutes long and are entitled to five prompts per day. Subscribers to Google AI Pro and Google AI Ultra plans, on the other hand, enjoy a duration extended to 3 hours. It is possible to import up to 10 files simultaneously, according to a help page.
A feature absent from ChatGPT
By integrating a feature already present on NotebookLM, another Google service, Gemini positions itself as an alternative to multilingual transcription solutions like Good Tape or Vook.ai, whose free versions are often limited. Importantly, it offers an option that ChatGPT does not yet have. Since last July, OpenAI’s tool does offer a Recording Mode on macOS, but it is limited to capturing meetings or brainstorming sessions. And it remains, for now, reserved for paying subscribers.
How to transcribe an audio file with Gemini
Here’s how to transcribe or analyze an audio file in Gemini:
- Click on the + icon located in the input bar,
- Select Import files,
- Choose an audio file (MP3, MP4, M4A, etc.) with a maximum duration of 10 minutes,
- In the input bar, add a prompt to specify your request (« Transcribe this file », « Summarize the key points », etc.).
Similar Posts
- Gemini App Confusion: Free vs. Paid Features Explained!
- Gemini Unveils Auto-Memory & Temporary Chats: Revolutionizing User Experience!
- Unlock Learning Like ChatGPT: Gemini Rolls Out Free Tools for Revision and Education
- Google Unveils Gemini CLI: Free Open Source AI Tool for Developers
- ChatGPT Not Working? Solve Common Issues Fast: Here’s How!

Jordan Park writes in-depth reviews and editorial opinion pieces for Touch Reviews. With a background in UI/UX design, Jordan offers a unique perspective on device usability and user experience across smartphones, tablets, and mobile software.