Speech to Text

 

Introduction

Speech to Text (STT) service receives speech as input and converts it to text.

It can easily recognize continuous speech and record dictation. 

 

stt intro

 

STT service by LG AI Platform supports the following features:

Features of ASR Engine
Feature Description
Multi-language Support

Supports Korean and English

Customized results

Displays results of the speech analysis in intervals throughout dictation, or all at once after dictation is complete.

Secure connection to the server

Establishes connections to the server using the Transport Layer Security (TLS)-based HTTP/2 protocol for security.

 

Architecture

All functions of STT service are run on a server. STT service receives PCM data and JSON data as input, converts them to text data, and delivers text output accordingly.

 

stt architectture

 

Examples of Use

STT service can be used in various ways during everyday life.

 

  • Use voice to control devices while driving

When hands-on operation of personal devices is difficult or unsafe, use voice commands to send text messages or enter a destination into your navigation system.

Image that sends messages by voice or set the destination of the navigation  by voice while driving

  • Save call center conversations

Save important conversations as text by utilizing the service's voice recognition function. Conversations with customers can be stored and archived as text.

Image that saves the contents of a call as a text file

  • Prepare minutes

With the advancement of speech recognition technology, speech recognition can be used to automatically create minutes during important events.

Image that the meeting content is written as text