COMMUNICATION INTERFACE APPARATUS AND METHOD FOR MULTI-USER AND SYSTEM
    1.
    发明申请
    COMMUNICATION INTERFACE APPARATUS AND METHOD FOR MULTI-USER AND SYSTEM 有权
    通信接口设备和多用户和系统的方法

    公开(公告)号:US20120278066A1

    公开(公告)日:2012-11-01

    申请号:US13512222

    申请日:2010-11-09

    IPC分类号: G10L15/06 G10L15/26 G06F17/27

    摘要: A communication interface apparatus for a system and a plurality of users is provided. The communication interface apparatus for the system and the plurality of users includes a first process unit configured to receive voice information and face information from at least one user, and determine whether the received voice information is voice information of at least one registered user based on user models corresponding to the respective received voice information and face information; a second process unit configured to receive the face information, and determine whether the at least one user's attention is on the system based on the received face information; and a third process unit configured to receive the voice information, analyze the received voice information, and determine whether the received voice information is substantially meaningful to the system based on a dialog model that represents conversation flow on a situation basis.

    摘要翻译: 提供了一种用于系统和多个用户的通信接口装置。 用于系统和多个用户的通信接口装置包括:第一处理单元,被配置为从至少一个用户接收语音信息和面部信息,并且基于用户来确定接收的语音信息是否是至少一个注册用户的语音信息 对应于相应接收的语音信息和面部信息的模型; 第二处理单元,被配置为接收面部信息,并且基于所接收的脸部信息来确定至少一个用户的注意力是否在系统上; 以及第三处理单元,被配置为接收语音信息,分析所接收的语音信息,并且基于在情况基础上表示对话流的对话模型来确定所接收的语音信息对系统是否基本有意义。

    Apparatus and method for providing a reliable voice interface between a system and multiple users

    公开(公告)号:US09799332B2

    公开(公告)日:2017-10-24

    申请号:US13512222

    申请日:2010-11-09

    IPC分类号: G10L15/22 G10L17/10

    摘要: A communication interface apparatus for a system and a plurality of users is provided. The communication interface apparatus for the system and the plurality of users includes a first process unit configured to receive voice information and face information from at least one user, and determine whether the received voice information is voice information of at least one registered user based on user models corresponding to the respective received voice information and face information; a second process unit configured to receive the face information, and determine whether the at least one user's attention is on the system based on the received face information; and a third process unit configured to receive the voice information, analyze the received voice information, and determine whether the received voice information is substantially meaningful to the system based on a dialog model that represents conversation flow on a situation basis.

    Apparatus, medium, and method clustering audio files
    3.
    发明授权
    Apparatus, medium, and method clustering audio files 有权
    装置,媒介和方法聚类音频文件

    公开(公告)号:US07593937B2

    公开(公告)日:2009-09-22

    申请号:US11489463

    申请日:2006-07-20

    IPC分类号: G06F17/30 G06F7/00

    摘要: An apparatus, medium, and method providing audio files with clustering, with audio files having information similar to queries input from a user being extracted and undergo clustering. A method for providing audio files with clustering includes calculating scores between queries input from a user and a specified audio file, detecting audio files having specified scores with the queries input from the user on the basis of the result of calculation and performing a dynamic clustering of the audio files, detecting the audio files having the specified scores with the queries input from the user and performing a static clustering of the audio files, and displaying the dynamic cluster or the static cluster on a screen.

    摘要翻译: 提供具有聚类的音频文件的装置,介质和方法,其中音频文件具有类似于从提取和进行聚类的用户输入的查询的信息。 用于提供具有聚类的音频文件的方法包括计算从用户输入的查询和指定音频文件之间的分数,根据计算结果检测具有从用户输入的查询的指定分数的音频文件,并且执行动态聚类 音频文件,使用从用户输入的查询来检测具有指定分数的音频文件,并执行音频文件的静态聚类,以及在屏幕上显示动态集群或静态集群。

    Information retrieval method in mobile environment and clustering method and information retrieval system using personal search history
    4.
    发明申请
    Information retrieval method in mobile environment and clustering method and information retrieval system using personal search history 审中-公开
    移动环境中的信息检索方法和使用个人搜索历史的聚类方法和信息检索系统

    公开(公告)号:US20080071776A1

    公开(公告)日:2008-03-20

    申请号:US11882332

    申请日:2007-07-31

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535 Y02D10/45

    摘要: A mobile information retrieval method, clustering method, and an information retrieval system using a user's search history. The mobile information retrieval method includes receiving the user's query information and retrieving information related to the query information through predetermined networks in a database in which history information generated by previous retrieval is stored. The mobile information retrieval method, clustering method, and information retrieval system can relieve inconvenience of information retrieval caused by limits in terms of a display screen, battery capacity and computing resources, and can curtail charges for Internet use and data downloads.

    摘要翻译: 移动信息检索方法,聚类方法和使用用户搜索历史的信息检索系统。 移动信息检索方法包括通过存储由先前检索生成的历史信息的数据库中的预定网络来接收用户的查询信息和检索与查询信息相关的信息。 移动信息检索方法,聚类方法和信息检索系统可以减轻显示屏幕,电池容量和计算资源限制引起的信息检索的不便,并可减少互联网使用和数据下载的费用。

    Apparatus, medium, and method clustering audio files
    5.
    发明申请
    Apparatus, medium, and method clustering audio files 有权
    装置,媒介和方法聚类音频文件

    公开(公告)号:US20070043768A1

    公开(公告)日:2007-02-22

    申请号:US11489463

    申请日:2006-07-20

    IPC分类号: G06F7/00

    摘要: An apparatus, medium, and method providing audio files with clustering, with audio files having information similar to queries input from a user being extracted and undergo clustering. A method for providing audio files with clustering includes calculating scores between queries input from a user and a specified audio file, detecting audio files having specified scores with the queries input from the user on the basis of the result of calculation and performing a dynamic clustering of the audio files, detecting the audio files having the specified scores with the queries input from the user and performing a static clustering of the audio files, and displaying the dynamic cluster or the static cluster on a screen.

    摘要翻译: 提供具有聚类的音频文件的装置,介质和方法,其中音频文件具有类似于从提取和进行聚类的用户输入的查询的信息。 用于提供具有聚类的音频文件的方法包括计算从用户输入的查询和指定音频文件之间的分数,根据计算结果检测具有从用户输入的查询的指定分数的音频文件,并且执行动态聚类 音频文件,使用从用户输入的查询来检测具有指定分数的音频文件,并执行音频文件的静态聚类,以及在屏幕上显示动态集群或静态集群。

    Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms
    6.
    发明申请
    Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms 有权
    使用多个置信分数估计算法进行语音识别的装置和方法

    公开(公告)号:US20070136058A1

    公开(公告)日:2007-06-14

    申请号:US11517369

    申请日:2006-09-08

    IPC分类号: G10L15/00

    CPC分类号: G10L15/08 G10L2015/088

    摘要: An apparatus for speech recognition includes: a first confidence score calculator calculating a first confidence score using a ratio between a likelihood of a keyword model for feature vectors per frame of a speech signal and a likelihood of a Filler model for the feature vectors; a second confidence score calculator calculating a second confidence score by comparing a Gaussian distribution trace of the keyword model per frame of the speech signal with a Gaussian distribution trace sample of a stored corresponding keyword of the keyword model; and a determination module determining a confidence of a result using the keyword model in accordance with a position determined by the first and second confidence scores on a confidence coordinate system.

    摘要翻译: 一种用于语音识别的装置包括:第一置信度分数计算器,使用针对每个语音信号的每个特征向量的关键字模型的似然率与特征向量的填充模型的似然率之间的比率来计算第一置信度分数; 第二置信度计算器通过将所述语音信号的每帧的关键字模型的高斯分布轨迹与所述关键字模型的存储的对应关键字的高斯分布轨迹样本进行比较来计算第二置信度分数; 以及确定模块,其根据由置信坐标系上的第一和第二置信度得分确定的位置,使用关键字模型确定结果的置信度。

    Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms
    7.
    发明授权
    Apparatus and method for speech recognition using a plurality of confidence score estimation algorithms 有权
    使用多个置信分数估计算法进行语音识别的装置和方法

    公开(公告)号:US08543399B2

    公开(公告)日:2013-09-24

    申请号:US11517369

    申请日:2006-09-08

    IPC分类号: G10L15/00

    CPC分类号: G10L15/08 G10L2015/088

    摘要: An apparatus for speech recognition includes: a first confidence score calculator calculating a first confidence score using a ratio between a likelihood of a keyword model for feature vectors per frame of a speech signal and a likelihood of a Filler model for the feature vectors; a second confidence score calculator calculating a second confidence score by comparing a Gaussian distribution trace of the keyword model per frame of the speech signal with a Gaussian distribution trace sample of a stored corresponding keyword of the keyword model; and a determination module determining a confidence of a result using the keyword model in accordance with a position determined by the first and second confidence scores on a confidence coordinate system.

    摘要翻译: 一种用于语音识别的装置包括:第一置信度分数计算器,使用针对每个语音信号的每个特征向量的关键字模型的似然率与特征向量的填充模型的似然率之间的比率来计算第一置信度分数; 第二置信度计算器通过将所述语音信号的每帧的关键字模型的高斯分布轨迹与所述关键字模型的存储的对应关键字的高斯分布轨迹样本进行比较来计算第二置信度分数; 以及确定模块,其根据由置信坐标系上的第一和第二置信度得分确定的位置,使用关键字模型确定结果的置信度。

    Apparatus and method for recognizing voice
    8.
    发明授权
    Apparatus and method for recognizing voice 有权
    用于识别语音的装置和方法

    公开(公告)号:US08140334B2

    公开(公告)日:2012-03-20

    申请号:US11475963

    申请日:2006-06-28

    IPC分类号: G10L15/14 G10L15/00

    CPC分类号: G10L15/142

    摘要: An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.

    摘要翻译: 用于识别语音的装置和方法。 该装置包括:特征向量提取单元,将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络,预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元,通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元,通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度,以计算预设范围内的单一波形相似度; 以及输出单元,输出与多波形概率分布相对应的功能执行信号,能够计算所计算出的多重波形相似度的最高值。

    Apparatus and method for recognizing voice
    9.
    发明申请
    Apparatus and method for recognizing voice 有权
    用于识别语音的装置和方法

    公开(公告)号:US20070083371A1

    公开(公告)日:2007-04-12

    申请号:US11475963

    申请日:2006-06-28

    IPC分类号: G10L15/14

    CPC分类号: G10L15/142

    摘要: An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.

    摘要翻译: 用于识别语音的装置和方法。 该装置包括:特征向量提取单元,将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络,预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元,通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元,通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度,以计算预设范围内的单一波形相似度; 以及输出单元,输出与多波形概率分布相对应的功能执行信号,能够计算所计算出的多重波形相似度的最高值。

    Apparatus for positioning screen sound source, method of generating loudspeaker set information, and method of reproducing positioned screen sound source
    10.
    发明授权
    Apparatus for positioning screen sound source, method of generating loudspeaker set information, and method of reproducing positioned screen sound source 有权
    用于定位屏幕声源的装置,产生扬声器组信息的方法,以及再现定位的屏幕声源的方法

    公开(公告)号:US08208663B2

    公开(公告)日:2012-06-26

    申请号:US12482883

    申请日:2009-06-11

    IPC分类号: H04R5/02

    CPC分类号: H04R5/04

    摘要: An apparatus for positioning a screen sound source, a method of generating loudspeaker set information for screen sound source positioning, and a method of reproducing a positioned screen sound source are provided. The apparatus and methods relate to a screen sound source positioning technique. A plurality of loudspeakers, each configured to have approximately the same gain, are each disposed proximate to the edge of a display, and a loudspeaker set including at least two of the loudspeakers is selected to position a virtual sound source substantially synchronized with a visual object displayed at a position on the screen of the display. Accordingly, a virtual sound source may be positioned at a certain specific position on the screen of a display without sound source distortion.

    摘要翻译: 提供一种用于定位屏幕声源的装置,一种产生用于屏幕声源定位的扬声器组信息的方法以及再现定位的屏幕声源的方法。 该装置和方法涉及屏幕声源定位技术。 每个配置成具有近似相同增益的多个扬声器各自设置在显示器的边缘附近,并且选择包括至少两个扬声器的扬声器组,以将基本上与视觉对象同步的虚拟声源定位 显示在显示屏的屏幕上的位置。 因此,虚拟声源可以位于显示器的屏幕上的某个特定位置,而没有声源失真。