SPEECH RECOGNITION USER INTERFACE
    3.
    发明申请
    SPEECH RECOGNITION USER INTERFACE 审中-公开
    语音识别用户界面

    公开(公告)号:US20120089392A1

    公开(公告)日:2012-04-12

    申请号:US12900004

    申请日:2010-10-07

    IPC分类号: G10L15/00 G10L21/00

    摘要: Speech recognition techniques are disclosed herein. In one embodiment, a novice mode is available such that when the user is unfamiliar with the speech recognition system, a voice user interface (VUI) may be provided to guide them. The VUI may display one or more speech commands that are presently available. The VUI may also provide feedback to train the user. After the user becomes more familiar with speech recognition, the user may enter speech commands without the aid of the novice mode. In this “experienced mode,” the VUI need not be displayed. Therefore, the user interface is not cluttered.

    摘要翻译: 本文公开了语音识别技术。 在一个实施例中,新手模式是可用的,使得当用户不熟悉语音识别系统时,可以提供语音用户界面(VUI)来引导它们。 VUI可以显示当前可用的一个或多个语音命令。 VUI还可以提供反馈来训练用户。 在用户更熟悉语音识别之后,用户可以在没有新手模式的帮助下输入语音命令。 在这种“有经验的模式”中,VUI不需要显示。 因此,用户界面不会凌乱。

    SPEECH RECOGNITION ANALYSIS VIA IDENTIFICATION INFORMATION
    4.
    发明申请
    SPEECH RECOGNITION ANALYSIS VIA IDENTIFICATION INFORMATION 有权
    通过识别信息进行语音识别分析

    公开(公告)号:US20110184735A1

    公开(公告)日:2011-07-28

    申请号:US12692538

    申请日:2010-01-22

    摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.

    摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。

    Self-service checkout terminal
    5.
    发明授权
    Self-service checkout terminal 失效
    自助结帐终端

    公开(公告)号:US6167381A

    公开(公告)日:2000-12-26

    申请号:US20056

    申请日:1998-02-06

    IPC分类号: A47F9/04 G06F17/60

    摘要: A self-service checkout terminal includes a base having a bagwell defined therein. The terminal also includes a first counter supported on the base. The first counter has a first surface which is positioned at a first height. The terminal further includes a scanner secured at a first end of the first counter. The terminal yet further includes an automated teller machine secured at a second end of the first counter. Moreover, the terminal includes an arcuate shaped second counter secured to the first counter, the second counter having a second surface which is positioned at a second height. The first counter has a bagwell opening defined therein at a location interposed between the scanner and the automated teller machine. The bagwell opening is aligned with the bagwell. The first height is less than the second height.

    摘要翻译: 自助结账终端包括其中限定有袋状的基座。 终端还包括在基座上支撑的第一计数器。 第一计数器具有位于第一高度的第一表面。 终端还包括固定在第一计数器的第一端的扫描仪。 终端还包括固定在第一计数器的第二端的自动取款机。 此外,端子包括固定到第一计数器的弓形形状的第二计数器,第二计数器具有位于第二高度的第二表面。 第一个计数器在介于扫描仪和自动柜员机之间的位置处具有限定在其中的袋状开口。 袋状开口与袋状孔对齐。 第一个高度小于第二个高度。

    Speech recognition analysis via identification information
    7.
    发明授权
    Speech recognition analysis via identification information 有权
    通过识别信息进行语音识别分析

    公开(公告)号:US08676581B2

    公开(公告)日:2014-03-18

    申请号:US12692538

    申请日:2010-01-22

    IPC分类号: G10L15/00

    摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.

    摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。

    Method of monitoring item shuffling in a post-scan area of a self-service checkout terminal
    9.
    再颁专利
    Method of monitoring item shuffling in a post-scan area of a self-service checkout terminal 有权
    在自助结账终端的后扫描区域中监视项目改组的方法

    公开(公告)号:USRE41093E1

    公开(公告)日:2010-02-02

    申请号:US10038377

    申请日:2001-10-19

    IPC分类号: A63F9/02

    摘要: A method of monitoring item shuffling in a post-scan area of a self-service checkout terminal having a post-scan shelf, a bagwell with a grocery container positioned therein, and a weight scale positioned so as to detect weight of items positioned both on the post-scan shelf and in the grocery container, includes the step of detecting removal of a first number of items from the post-scan shelf with the weight scale and generating a first weight decrease value in response thereto which corresponds to the weight of the first number of items. The method also includes the step of detecting placement of a second number of items into the grocery container with the weight scale and generating a first weight increase value in response thereto which corresponds to the weight of the second number of items. The method further includes the step of comparing the first weight decrease value to the first weight increase value and generating a first match control signal in response thereto if the first weight decrease value matches the first weight increase value.

    摘要翻译: 一种在具有扫描后架子的自助服务检验终端的后扫描区域中监视项目改组的方法,其中定位有杂货容器的袋状空间和定位成检测位于两者上的物品的重量的重量 扫描后货架和杂货容器中的步骤包括以下步骤:检测具有重量标尺的扫描后货架中的第一数量物品的移除,并产生响应于该重量的第一减重值,其对应于 第一个数量的项目。 该方法还包括以下步骤:检测第二数量的物品到重量级的杂货容器中的位置,并响应于此产生对应于第二数量物品的重量的第一增重值。 该方法还包括以下步骤:如果第一减重值与第一加权增加值匹配,则比较第一减重值和第一加权增加值,并响应于此产生第一匹配控制信号。

    Compound gesture-speech commands
    10.
    发明授权
    Compound gesture-speech commands 有权
    复合手势 - 语音命令

    公开(公告)号:US08296151B2

    公开(公告)日:2012-10-23

    申请号:US12818898

    申请日:2010-06-18

    IPC分类号: G10L15/00 G10L21/00

    摘要: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.

    摘要翻译: 多媒体娱乐系统结合了手势和语音命令,以提供增强的控制方案。 用户的身体位置或运动可以被识别为手势,并且可以用于提供上下文以识别用户生成的声音,例如语音输入。 类似地,语音输入可以被识别为语音命令,并且可以用于提供上下文以将身体位置或运动识别为手势。 可以将权重分配给输入以便于处理。 当识别到手势时,加载与识别的手势相关联的一组有限的语音命令以供使用。 此外,附加语音命令集可以以分层方式构造,使得从一组语音命令发出语音命令导致系统加载下一组语音命令。