Unlike ChatGPT, Gemini now has the capability to convert audio files into text. Learn how to use this new feature.
Scientists confirm: This is the most effective way to get your cat’s attention, according to new research
Elderly Couple Refuses Reserved Seats—Viral Train Standoff Sparks Fiery Debate on Courtesy
As of Monday, September 8, 2025, Gemini, Google’s conversational agent, can now analyze and transcribe the content of an audio file, announced Josh Woodward, Vice President of Google Labs and the Gemini application. However, some restrictions have been implemented for free users.
Papercut fixed: You can now upload any file to @GeminiApp. Including the #1 request: audio files are now supported! pic.twitter.com/4Te3xwLC6W
— Josh Woodward (@joshwoodward) September 8, 2025
Why You Should Never Reheat These Foods in the Microwave – The Hidden Dangers Experts Warn About
I tried the top 5 guard dogs—here’s what makes these breeds the ultimate protectors
To view this content from social networks, you must accept cookies and advertising trackers.
These cookies and trackers allow our partners to offer you ads and content tailored to your browsing, your profile and your interests.More info.
Gemini can now transcribe audio files
Google seems to have addressed one of its user’s top requests by allowing audio files to be uploaded to Gemini. Available on the web, iOS, and Android versions, the conversational agent can now transcribe a file in seconds, regardless of the file format (MP3, M4A, WAV, etc.). It can also analyze its content or summarize key points.
The transcription and analysis feature in Gemini is limited, Google specifies. Free version users can upload files up to 10 minutes long and are entitled to five prompts per day. Subscribers to Google AI Pro and Google AI Ultra plans, on the other hand, enjoy a duration extended to 3 hours. It is possible to import up to 10 files simultaneously, according to a help page.
A feature absent from ChatGPT
By integrating a feature already present on NotebookLM, another Google service, Gemini positions itself as an alternative to multilingual transcription solutions like Good Tape or Vook.ai, whose free versions are often limited. Importantly, it offers an option that ChatGPT does not yet have. Since last July, OpenAI’s tool does offer a Recording Mode on macOS, but it is limited to capturing meetings or brainstorming sessions. And it remains, for now, reserved for paying subscribers.
How to transcribe an audio file with Gemini
Here’s how to transcribe or analyze an audio file in Gemini:
- Click on the + icon located in the input bar,
- Select Import files,
- Choose an audio file (MP3, MP4, M4A, etc.) with a maximum duration of 10 minutes,
- In the input bar, add a prompt to specify your request (« Transcribe this file », « Summarize the key points », etc.).
Similar Posts
- Gemini App Confusion: Free vs. Paid Features Explained!
- Revolutionize Your Instructions: Discover Google Gemini’s Innovative Drawing Feature!
- Google Chooses Gemini 3 Flash as Default Model: What You Need to Know!
- Google Workspace Update: Gemini Now Creates Documents, Spreadsheets, and Presentations!
- OpenAI Unveils ChatGPT Go: Pricing, Features, and How It Stacks Up Against ChatGPT Plus

Jordan Park writes in-depth reviews and editorial opinion pieces for Touch Reviews. With a background in UI/UX design, Jordan offers a unique perspective on device usability and user experience across smartphones, tablets, and mobile software.