摘要:
A string of acoustic feature parameters of each of recognition-desired words and a string of acoustic feature parameters of each of reception words are registered in advance. When an uttered word is received, a string of acoustic feature parameters is extracted from the uttered word, the acoustic feature parameters of the uttered word is compared with the string of acoustic feature parameters of each recognition-desired word, and a recognition-desired word recognition score indicating a similarity degree between the uttered word and each recognition-desired word is calculated. Also, a reception word recognition score indicating a similarity degree between the uttered word and each reception word is calculated. In cases where a particular recognition-desired word recognition score corresponding to a particular recognition-desired word is higher than the highest reception word recognition score, the utter word is recognized as the particular recognition-desired word, and an operation of an electric apparatus is controlled according to the particular recognition-desired word. In contrast, in cases where a particular reception word recognition score corresponding to a particular reception word is higher than the highest recognition-desired word recognition score, the utter word is recognized as the particular reception word and is rejected, so that the electric apparatus is not operated.
摘要:
A metadata preparing device comprising a content reproducing unit (1) for reproducing and outputting content, a monitor (3) for monitoring the content reproduced by the content reproducing unit, a voice input unit (4), a voice recognition unit (5) for recognizing a voice signal input from the voice input unit, a metadata generation unit (6) for converting information recognized by the voice recognition unit into metadata, and an identification information imparting unit (7) for acquiring identification information that identifies respective parts in the content from the reproduced content supplied from the content reproducing unit, for imparting to metadata, wherein the generated metadata is so constructed as to be associated with respective parts in the content.
摘要:
There is provided a content tag attachment support device enabling a person to perform both tag attachment work and correction word and suppressing increase of the work time. In this device, audio recognition means (104) recognizes audio inputted. Tag generation means (103) gives data obtained by audio recognition as a tag to the content reproduced by content reproducing means (101). Tag correction means (108) sends tag correction information to the tag generation means (103) and sends tag correction start completion report information to content reproduction control means (109). The content reproduction control means (109) controls the content reproducing means (101) so as to temporarily stop the content reproduction in synchronization with a start of the tag correction work and resume the content reproduction in synchronization with the end of the tag correction work.