Amazon Polly vs. Google Cloud Speech-to-Text

Google Cloud Speech-to-Text

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Amazon Polly	Score 10.0 out of 10	N/A	Amazon Polly turns text into lifelike speech, allowing users to create applications that talk, and build new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, users can build speech-enabled applications that work in different countries.	$4 Per Request
Google Cloud Speech-to-Text	Score 8.3 out of 10	N/A	Speech-to-Text on Google Cloud is a tool used to convert speech into text using an API powered by Google’s AI technologies. The vendor states users can transcribe content in real time or from stored files; deliver a better user experience in products through voice commands; and, gain insights from customer interactions to improve service.	$0.02 per min

Pricing

Amazon Polly

Google Cloud Speech-to-Text

Editions & Modules

Up to 1,000 Requests: $4.00
Per Request
Up to 10,000 Requests: $4.00
Per Request

Speech-to-Text V2 API: $0.016
per min
Speech-to-Text V1 API: $0.024
per min

Offerings

Pricing Offerings
Amazon Polly	Google Cloud Speech-to-Text
Free Trial
No	Yes
Free/Freemium Version
No	Yes
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

Additional Details

—

Speech-to-Text V1 API V1 offers data residency for multi region only. Models include short, long, phone call, and video. V1 does not include audit logging. New customers get $300 in free credits and 60 minutes for transcribing and analyzing audio free per month, not charged against your credits. Speech-to-Text V2 API V2 offers data residency for multi and single region. Models include short, long, telephony, video, and Chirp. V2 does include audit logging and support for customer managed encryption keys.

More Pricing Information

Community Pulse
	Amazon Polly	Google Cloud Speech-to-Text
Considered Both Products	Amazon Polly No answer on this topic	Google Cloud Speech-to-Text Verified User Consultant Chose Google Cloud Speech-to-Text Google Cloud Speech-to-Text outperformed its competitors significantly in terms of accuracy, surpassing any other product available. Additionally, its support for multiple languages was unrivaled in the market. Moreover, for clients with robust bandwidth, Google Cloud … Incentivized Helpful?

Best Alternatives
	Amazon Polly	Google Cloud Speech-to-Text
Small Businesses	No answers on this topic	RingCentral Contact Center Score 8.1 out of 10
Medium-sized Companies	No answers on this topic	Zoom Contact Center Score 8.3 out of 10
Enterprises	No answers on this topic	Verint Speech and Text Analytics Score 8.4 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Amazon Polly	Google Cloud Speech-to-Text
Likelihood to Recommend	- (0 ratings)	8.0 (31 ratings)
Usability	- (0 ratings)	8.3 (12 ratings)

User Testimonials
	Amazon Polly	Google Cloud Speech-to-Text
Likelihood to Recommend	Amazon AWS No answers on this topic	Google So, I've had scenarios like when I collaborate with a team where the people are from around the world. So, I used it there, and we spoke to each other in their native language. That boosts everyone's confidence in our collaborative efforts. I've also utilized its model and the API in my projects, including a Virtual assistant and a multilingual application that allows us to learn languages from around the world. We tested it with a group of 12 people, and that's when it failed. I mean, it's not a failure, but it can't detect every person. Incentivized Satyam Pandey Associate software developer Read full review
Pros	Amazon AWS No answers on this topic	Google An amazing tool which helps a lot in a meetings. It's an efficient tool for improving efficiency by saving a lot of time typing. It saves at least 40-50% of our time, thus increasing efficiency. Incredible accuracy with multiple accents & multiple language. It takes punctuation into consideration. Incentivized Verified User Anonymous Read full review
Cons	Amazon AWS No answers on this topic	Google Integration outside of the google eco system is challenging here. Google Cloud Speech-to-Text works only with active internet connection if the internet bandwidth is low it effect the transcription process and can lead to data inaccuracy. In terms of the pricing also this is at higher range which all the companies cannot afford like small scale organisation if they would like to use the tool they would look over the price to make the decision. Reducing the price can increase the product usage more Incentivized IS irfan shaik Technical Consultant Read full review
Usability	Amazon AWS No answers on this topic	Google The reasoning behind my 10 is that the UI is very intuitive; I didn't require any formal training to use it. Google's speech-to-text is not just a conversion tool; it helps automate mundane tasks, saves time, and has an almost human-like understanding. Incentivized Vaibhav Singh Sr. Analyst Read full review
Alternatives Considered	Amazon AWS No answers on this topic	Google Google Cloud Speech-to-Text outperformed its competitors significantly in terms of accuracy, surpassing any other product available. Additionally, its support for multiple languages was unrivaled in the market. Moreover, for clients with robust bandwidth, Google Cloud Speech-to-Text offered real-time transcription capabilities, enabling users to transcribe live audio streams with minimal delay. Incentivized Verified User Anonymous Read full review
Return on Investment	Amazon AWS No answers on this topic	Google It reduced our budget for assistants who transcribed files manually It speeds up the process, because we can have a transcriptions straight after the interviews It increased accuracy, because AI makes the transcriptions for every second, and you can find the words which were said at specific time. Incentivized Maria Sergeeva UX and Content Designer Read full review
ScreenShots		Google Cloud Speech-to-Text Screenshots