Abstract:
Provided are an automatic interpretation system and method for generating a synthetic sound having characteristics similar to those of an original speaker's voice. The automatic interpretation system for generating a synthetic sound having characteristics similar to those of an original speaker's voice includes a speech recognition module configured to generate text data by performing speech recognition for an original speech signal of an original speaker and extract at least one piece of characteristic information among pitch information, vocal intensity information, speech speed information, and vocal tract characteristic information of the original speech, an automatic translation module configured to generate a synthesis-target translation by translating the text data, and a speech synthesis module configured to generate a synthetic sound of the synthesis-target translation.
Abstract:
A voice recognition system that divides a search space for voice recognition into a general domain search space and a specific domain search space. A mobile terminal receives a voice recognition target word from a user, and a voice recognition server divides a search space for voice recognition into a general domain search space and a specific domain search space and stores the spaces and performs voice recognition for the voice recognition target word through linkage of the general domain search space and the specific domain search space.
Abstract:
The present invention relates to a voice recognition system for replacing a specific domain, a mobile device, and a method thereof, and more particularly, to a technology that divides a search space for voice recognition into a general domain search space and a specific domain search space.A voice recognition system according to an exemplary embodiment of the present invention includes: a mobile terminal receiving a voice recognition target word from a user; and a voice recognition server dividing a search space for voice recognition into a general domain search space and a specific domain search space and storing the spaces, and performing voice recognition for the voice recognition target word through linkage of the general domain search space and the specific domain search space.