摘要:
A multi-layered speech recognition apparatus and method, the apparatus includes a client checking whether the client recognizes the speech using a characteristic of speech to be recognized and recognizing the speech or transmitting the characteristic of the speech according to a checked result; and first through N-th servers, wherein the first server checks whether the first server recognizes the speech using the characteristic of the speech transmitted from the client, and recognizes the speech or transmits the characteristic according to a checked result, and wherein an n-th (2≦n≦N) server checks whether the n-th server recognizes the speech using the characteristic of the speech transmitted from an (n−1)-th server, and recognizes the speech or transmits the characteristic according to a checked result.
摘要:
A multi-layered speech recognition apparatus and method, the apparatus includes a client checking whether the client recognizes the speech using a characteristic of speech to be recognized and recognizing the speech or transmitting the characteristic of the speech according to a checked result; and first through N-th servers, wherein the first server checks whether the first server recognizes the speech using the characteristic of the speech transmitted from the client, and recognizes the speech or transmits the characteristic according to a checked result, and wherein an n-th (2≦n≦N) server checks whether the n-th server recognizes the speech using the characteristic of the speech transmitted from an (n−1)-th server, and recognizes the speech or transmits the characteristic according to a checked result.
摘要:
A multi-layered speech recognition apparatus and method, the apparatus includes a client checking whether the client recognizes the speech using a characteristic of speech to be recognized and recognizing the speech or transmitting the characteristic of the speech according to a checked result; and first through N-th servers, wherein the first server checks whether the first server recognizes the speech using the characteristic of the speech transmitted from the client, and recognizes the speech or transmits the characteristic according to a checked result, and wherein an n-th (2≦n≦N) server checks whether the n-th server recognizes the speech using the characteristic of the speech transmitted from an (n−1)-th server, and recognizes the speech or transmits the characteristic according to a checked result.
摘要:
A method, medium, and apparatus for generating a record sentence to establish a speech corpus, including generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text, selecting an unseen sentence including an unseen unit according to the synthesis information, generating a weight indicating a recording priority of the unseen unit included in the selected unseen sentence, and generating a record sentence by combining the unseen unit with the speech synthesis information according to the generated weight.
摘要:
A method, medium, and apparatus for generating a record sentence to establish a speech corpus, including generating a synthesized sentence of speech and synthesis information related to speech synthesis by performing speech synthesis for a predetermined sentence of text, selecting an unseen sentence including an unseen unit according to the synthesis information, generating a weight indicating a recording priority of the unseen unit included in the selected unseen sentence, and generating a record sentence by combining the unseen unit with the speech synthesis information according to the generated weight.
摘要:
A speech processing apparatus, medium, and method recognizing speech and responding to the speech. The speech processing apparatus may includes an entity extracting unit which extracts entity information and an upper entity corresponding to the entity information from input speech, a focus determination unit which determines a focus using the extracted entity information requiring a response, a mapping unit which maps lower entity corresponding to the focus with the extracted entity information, and a recognition unit which recognizes a result of arranging the extracted entity information according to semantic association among the lower entities as the input speech. Thus, the speech processing apparatus can accurately recognize grammatically correct speech as well as grammatically incorrect speech and then respond to the speech.
摘要:
A speech processing apparatus, medium, and method recognizing speech and responding to the speech. The speech processing apparatus may includes an entity extracting unit which extracts entity information and an upper entity corresponding to the entity information from input speech, a focus determination unit which determines a focus using the extracted entity information requiring a response, a mapping unit which maps lower entity corresponding to the focus with the extracted entity information, and a recognition unit which recognizes a result of arranging the extracted entity information according to semantic association among the lower entities as the input speech. Thus, the speech processing apparatus can accurately recognize grammatically correct speech as well as grammatically incorrect speech and then respond to the speech.
摘要:
A speech recognition method, medium, and system. The method includes detecting an energy change of each frame making up signals including speech and non-speech signals, and identifying a speech segment corresponding to frames that include only speech signals from among the frames based on the detected energy change.
摘要:
A speech enhancement apparatus and method and a computer-readable recording medium having a program recorded thereon execute a speech enhancement method. The speech enhancement apparatus includes a spectrum subtraction unit generating a subtracted spectrum by subtracting an estimated noise spectrum from a received speech spectrum, a correction function modeling unit generating a correction function to minimize a noise spectrum using variation of a noise spectrum included in training data, and a spectrum correction unit generating a corrected spectrum by correcting the subtracted spectrum using the correction function.
摘要:
A speech recognition method, medium, and system. The method includes detecting an energy change of each frame making up signals including speech and non-speech signals, and identifying a speech segment corresponding to frames that include only speech signals from among the frames based inclusive of the detected energy change.