摘要:
Disclosed are an apparatus and method of deducing a user's intention using multimodal information. The user's intention deduction apparatus includes a first predictor to predict a part of a user's intention using at least one piece of motion information, and a second predictor to predict the user's intention using the predicted part of the user's intention and multimodal information received from at least one multimodal sensor.
摘要:
Disclosed are an apparatus and method of deducing a user's intention using motion information. The user's intention deduction apparatus includes a speech intention determining unit configured to predict a speech intention regarding a user's speech using motion information sensed by at least one motion capture sensor, and a controller configured to control operation of detecting a voice section from a received sound signal based on the predicted speech intention.
摘要:
Disclosed are an apparatus and method of deducing a user's intention using motion information. The user's intention deduction apparatus includes a speech intention determining unit configured to predict a speech intention regarding a user's speech using motion information sensed by at least one motion capture sensor, and a controller configured to control operation of detecting a voice section from a received sound signal based on the predicted speech intention.
摘要:
An acoustic processing apparatus is provided. The acoustic processing apparatus including a first extracting unit configured to extract a first acoustic model that corresponds with a first position among positions set in a speech recognition target area, a second extracting unit configured to extract at least one second acoustic model that corresponds with, respectively, at least one second position in proximity to the first position, and an acoustic model generating unit configured to generate a third acoustic model based on the first acoustic model, the second acoustic model, or a combination thereof.
摘要:
Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog model and a statistical dialog model rule. The voice command recognition apparatus includes a command intention determining unit configured to correct an error in recognizing a voice command of a user, and an application processing unit configured to check whether the final command intention determined in the command intention determining unit comprises the input factors for execution of an application.
摘要:
A content synchronization apparatus is provided. The content synchronization apparatus includes a communication unit configured to communicate with a device that the content synchronization apparatus can synchronize content with, a control unit configured to, in response to a synchronization command to share current content being played by the device being received, share the current content by acquiring the current content and state information corresponding to the current content through the communication unit, synchronize the current content with the device using the current content and the state information, and configure a display screen based on the results of the synchronization of the current content with the device, and an output unit configured to display the configured display screen.
摘要:
Provided is an apparatus and method for automatically generating grammar for use in the processing of natural language. The apparatus may extract a corpus relevant to a target domain from a collection of corpora and may generate grammar for use in the target domain based on the extracted corpus. The apparatus may set one domain out of a plurality of domains as a target domain to be processed by an intention analysis system. The apparatus may extract a corpus relevant to the target domain from a collection of corpora and generate grammar based on the extracted corpus.
摘要:
A content synchronization apparatus is provided. The content synchronization apparatus includes a communication unit configured to communicate with a device that the content synchronization apparatus can synchronize content with, a control unit configured to, in response to a synchronization command to share current content being played by the device being received, share the current content by acquiring the current content and state information corresponding to the current content through the communication unit, synchronize the current content with the device using the current content and the state information, and configure a display screen based on the results of the synchronization of the current content with the device, and an output unit configured to display the configured display screen.
摘要:
An apparatus and method for recognizing a voice command for use in an interactive voice user interface are provided. The apparatus includes a command intention belief generation unit that is configured to recognize a first voice command and that may generate one or more command intention beliefs for the first voice command. The apparatus also includes a command intention belief update unit that is configured to update each of the command intention beliefs based on a system response to the first voice command and a second voice commands. The apparatus also includes a command intention belief selection unit that is configured to select one of the updated command intention beliefs for the first voice command. The apparatus also includes an operation signal output unit that is configured to select a final command intention from the selected updated command intention belief and to output an operation signal based on the selected final command intention.
摘要:
A speech recognition apparatus is provided. The speech recognition apparatus includes a primary speech recognition unit configured to perform speech recognition on input speech and thus to generate word lattice information, a word string generation unit configured to generate one or more word strings based on the word lattice information, a language model score calculation unit configured to calculate bidirectional language model scores of the generated word strings selectively using forward and backward language models for each of words in each of the generated word strings, and a sentence output unit configured to output one or more of the generated word strings with high scores as results of the speech recognition of the input speech based on the calculated bidirectional language model scores.