TrustRadius: an HG Insights company

Amazon Transcribe

Score7.8 out of 10

14 Reviews and Ratings

What is Amazon Transcribe?

Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, to automate closed captioning and subtitling, and to generate metadata for media assets to create a searchable archive. Amazon Transcribe Medical can be added to provide medical speech to text capabilities to clinical documentation applications.

Categories & Use Cases

A Must Have Text-to-Speech Solution as a Service: Amazon Transcribe

Use Cases and Deployment Scope

With Amazon Transcribe, it has been effective to extract actionable insights from clients' chats which advance engagements in real-time.

Pros

  • Content discovery advancements through audio and video contents conversions to texts.
  • Creating notes for meetings has been at ease with this solution.
  • Real-time transcription: sending live audio and videos in response to searchable texts.

Cons

  • It was not easy to bring Amazon Transcribe to life, but kudos to the vendor for the free support they offered.

Most Important Features

  • Audio input tools such as batch and streaming transcription.
  • Custom vocabulary: Help add new words to simplify transcripts.
  • Vocabulary filtering tools.
  • Data protection features.

Return on Investment

  • Support video files, so we save time as we do not need to change them to audio contents.
  • Automatic redaction of PII (personally identifiable information)
  • Real-time data processing reduces human hours needed to transcribe.

Alternatives Considered

Nuance Dragon Speech Recognition

Industrial scale speech to text solution

Use Cases and Deployment Scope

We use Amazon Transcribe to help the KYC process for digital certificates by transforming key phrases from speech to text during an authentication video. We need a fast and reliable system to transform our users' speech to text in a way that we can validate various outputs of this process via APIs and issue a final approval or denial of the user request.

Pros

  • It is fast.
  • It can be accessed via multiple types of SDKs and APIs.
  • It is effective and predictable.

Cons

  • The APIs and AWS ecosystem can be difficult to grasp if you are new.

Most Important Features

  • Security.
  • Efficiency.
  • Integrating it via APIs.

Return on Investment

  • Cost reductions in KYC automating our video approvals.
  • Better customer service.
  • A good base for building new products and services around speech to text.

Other Software Used

AWS Lambda, AWS Batch, AWS Certificate Manager

Great, feature-rich transcription which offers the most value and functionality in this category!

Use Cases and Deployment Scope

Our company began with AWS Transcribe when looking for a way to improve both the productivity of our agents and our customer experience. Transcribe is an easy way to quickly convert human speech into a readable (and reportable) test that helps us pull helpful data from. The functionality is nice as it lets you sort based on keywords or interruptions. One benefit we've seen is the ability to use both real-time transcribing and transcribing of finished audio files. The service is very feature-rich and provides many new options we haven't found in other services (such as allowing it to detect multiple speakers during a meeting, and track interruptions from sales agents). The tool is useful, particularly if you are able to use all of the features it offers.

Pros

  • Creating call transcripts from our call centers.
  • Searching calls for particular keywords to trace back problems.
  • Creating transcripts from company, or other, meetings.
  • Ability to protect caller data, such as credit card numbers or personal information by omitting it from transcripts.

Cons

  • There is a small learning curve to begin using ALL of the features the software offers. Additional tech support may be required for some integrations, so it's worth looking into if planning to use all of the features they offer.

Most Important Features

  • Speech to Text (both live audio or uploaded audio files).
  • Ability to search for text within an audio transcript.
  • Ability to redact or remove private information from transcripts.

Return on Investment

  • Cost savings due to time savings
  • Ability to catch issues quickly and coach staffing
  • Ability to track conversation model usage via scripts, and view responses to make changes in realtime

Other Software Used

Aircall

The Secure Way to Transcribe

Use Cases and Deployment Scope

We are using Amazon Transcribe extensively to transcribe conversations with our customers. We are also utilizing it in in-house video demos and presentations to add subtitles. The business problems we address are inclusive of reduced labor (if transcription were to be done manually in entirety). It is quite easy to implement and its integration with our existing databases was quite simple.

Pros

  • It converts live recordings to text with few errors.
  • It has powerful speech recognition models- it transcribes well even low quality audios.

Cons

  • Chat, meetings and call transcriptions for confidential use must be accompanied by human input to edit the errors.
  • While onboarding new users, (I'm in IT) I noticed the learning curve was slow.

Most Important Features

  • Live recording.
  • Personal identifiable information redaction feature.

Return on Investment

  • Working in the backend, I would say the most important ROI has been data security through implementation of enterprise-grade technical and physical controls which prevent unauthorized access to our content.

Other Software Used

Playbook AI

Save Time, Pay little, and Be More Productive

Use Cases and Deployment Scope

I use Amazon Transcribe in order to facilitate my video translation, transcription, and direct translation tasks. Instead of typing everything and wasting ages doing so, Amazon Transcribe does that in a very short time. Some speakers have heavy accents, and this service helps me to figure out most of the challenges in the tasks.

Pros

  • Handling tough accents
  • Handling various accents

Cons

  • Support of other local Arabic dialects.

Most Important Features

  • Fast
  • Automated

Return on Investment

  • It can transcribe different specialized fields.

Alternatives Considered

Google Cloud Speech-to-Text

Other Software Used

Google Cloud Speech-to-Text