Physical interaction zone for gesture-based user interfaces
    1.
    发明授权
    Physical interaction zone for gesture-based user interfaces 有权
    用于基于手势的用户界面的物理交互区域

    公开(公告)号:US08659658B2

    公开(公告)日:2014-02-25

    申请号:US12703143

    申请日:2010-02-09

    IPC分类号: H04N7/18

    摘要: In a motion capture system having a depth camera, a physical interaction zone of a user is defined based on a size of the user and other factors. The zone is a volume in which the user performs hand gestures to provide inputs to an application. The shape and location of the zone can be customized for the user. The zone is anchored to the user so that the gestures can be performed from any location in the field of view. Also, the zone is kept between the user and the depth camera even as the user rotates his or her body so that the user is not facing the camera. A display provides feedback based on a mapping from a coordinate system of the zone to a coordinate system of the display. The user can move a cursor on the display or control an avatar.

    摘要翻译: 在具有深度相机的运动捕捉系统中,基于用户的大小和其他因素来定义用户的物理交互区域。 该区域是用户执行手势以向应用程序提供输入的卷。 可以为用户定制区域的形状和位置。 该区域被锚定到用户,使得可以在视野中的任何位置执行手势。 此外,即使用户旋转他或她的身体,使得用户不面对相机,该区域也保持在用户和深度相机之间。 显示器基于从区域的坐标系到显示器的坐标系的映射来提供反馈。 用户可以在显示器上移动光标或控制化身。

    PHYSICAL INTERACTION ZONE FOR GESTURE-BASED USER INTERFACES
    2.
    发明申请
    PHYSICAL INTERACTION ZONE FOR GESTURE-BASED USER INTERFACES 有权
    用于基于GESTURE的用户界面的物理交互区域

    公开(公告)号:US20110193939A1

    公开(公告)日:2011-08-11

    申请号:US12703143

    申请日:2010-02-09

    IPC分类号: H04N5/225 H04N13/02

    摘要: In a motion capture system having a depth camera, a physical interaction zone of a user is defined based on a size of the user and other factors. The zone is a volume in which the user performs hand gestures to provide inputs to an application. The shape and location of the zone can be customized for the user. The zone is anchored to the user so that the gestures can be performed from any location in the field of view. Also, the zone is kept between the user and the depth camera even as the user rotates his or her body so that the user is not facing the camera. A display provides feedback based on a mapping from a coordinate system of the zone to a coordinate system of the display. The user can move a cursor on the display or control an avatar.

    摘要翻译: 在具有深度相机的运动捕捉系统中,基于用户的大小和其他因素来定义用户的物理交互区域。 该区域是用户执行手势以向应用程序提供输入的卷。 可以为用户定制区域的形状和位置。 该区域被锚定到用户,使得可以在视野中的任何位置执行手势。 此外,即使用户旋转他或她的身体,使得用户不面对相机,该区域也保持在用户和深度相机之间。 显示器基于从区域的坐标系到显示器的坐标系的映射来提供反馈。 用户可以在显示器上移动光标或控制化身。

    SPEECH RECOGNITION USER INTERFACE
    3.
    发明申请
    SPEECH RECOGNITION USER INTERFACE 审中-公开
    语音识别用户界面

    公开(公告)号:US20120089392A1

    公开(公告)日:2012-04-12

    申请号:US12900004

    申请日:2010-10-07

    IPC分类号: G10L15/00 G10L21/00

    摘要: Speech recognition techniques are disclosed herein. In one embodiment, a novice mode is available such that when the user is unfamiliar with the speech recognition system, a voice user interface (VUI) may be provided to guide them. The VUI may display one or more speech commands that are presently available. The VUI may also provide feedback to train the user. After the user becomes more familiar with speech recognition, the user may enter speech commands without the aid of the novice mode. In this “experienced mode,” the VUI need not be displayed. Therefore, the user interface is not cluttered.

    摘要翻译: 本文公开了语音识别技术。 在一个实施例中,新手模式是可用的,使得当用户不熟悉语音识别系统时,可以提供语音用户界面(VUI)来引导它们。 VUI可以显示当前可用的一个或多个语音命令。 VUI还可以提供反馈来训练用户。 在用户更熟悉语音识别之后,用户可以在没有新手模式的帮助下输入语音命令。 在这种“有经验的模式”中,VUI不需要显示。 因此,用户界面不会凌乱。

    SPEECH RECOGNITION ANALYSIS VIA IDENTIFICATION INFORMATION
    4.
    发明申请
    SPEECH RECOGNITION ANALYSIS VIA IDENTIFICATION INFORMATION 有权
    通过识别信息进行语音识别分析

    公开(公告)号:US20110184735A1

    公开(公告)日:2011-07-28

    申请号:US12692538

    申请日:2010-01-22

    摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.

    摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。

    Speech recognition analysis via identification information
    5.
    发明授权
    Speech recognition analysis via identification information 有权
    通过识别信息进行语音识别分析

    公开(公告)号:US08676581B2

    公开(公告)日:2014-03-18

    申请号:US12692538

    申请日:2010-01-22

    IPC分类号: G10L15/00

    摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.

    摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。

    Compound gesture-speech commands
    6.
    发明授权
    Compound gesture-speech commands 有权
    复合手势 - 语音命令

    公开(公告)号:US08296151B2

    公开(公告)日:2012-10-23

    申请号:US12818898

    申请日:2010-06-18

    IPC分类号: G10L15/00 G10L21/00

    摘要: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.

    摘要翻译: 多媒体娱乐系统结合了手势和语音命令,以提供增强的控制方案。 用户的身体位置或运动可以被识别为手势,并且可以用于提供上下文以识别用户生成的声音,例如语音输入。 类似地,语音输入可以被识别为语音命令,并且可以用于提供上下文以将身体位置或运动识别为手势。 可以将权重分配给输入以便于处理。 当识别到手势时,加载与识别的手势相关联的一组有限的语音命令以供使用。 此外,附加语音命令集可以以分层方式构造,使得从一组语音命令发出语音命令导致系统加载下一组语音命令。

    COMPOUND GESTURE-SPEECH COMMANDS
    7.
    发明申请
    COMPOUND GESTURE-SPEECH COMMANDS 有权
    化合词语音命令

    公开(公告)号:US20110313768A1

    公开(公告)日:2011-12-22

    申请号:US12818898

    申请日:2010-06-18

    IPC分类号: G10L15/00 G06F3/16 G06K9/46

    摘要: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.

    摘要翻译: 多媒体娱乐系统结合了手势和语音命令,以提供增强的控制方案。 用户的身体位置或运动可以被识别为手势,并且可以用于提供上下文以识别用户生成的声音,例如语音输入。 类似地,语音输入可以被识别为语音命令,并且可以用于提供上下文以将身体位置或运动识别为手势。 可以将权重分配给输入以便于处理。 当识别到手势时,加载与识别的手势相关联的一组有限的语音命令以供使用。 此外,附加语音命令集可以以分层方式构造,使得从一组语音命令发出语音命令导致系统加载下一组语音命令。

    Self-service checkout terminal
    9.
    发明授权
    Self-service checkout terminal 失效
    自助结帐终端

    公开(公告)号:US6167381A

    公开(公告)日:2000-12-26

    申请号:US20056

    申请日:1998-02-06

    IPC分类号: A47F9/04 G06F17/60

    摘要: A self-service checkout terminal includes a base having a bagwell defined therein. The terminal also includes a first counter supported on the base. The first counter has a first surface which is positioned at a first height. The terminal further includes a scanner secured at a first end of the first counter. The terminal yet further includes an automated teller machine secured at a second end of the first counter. Moreover, the terminal includes an arcuate shaped second counter secured to the first counter, the second counter having a second surface which is positioned at a second height. The first counter has a bagwell opening defined therein at a location interposed between the scanner and the automated teller machine. The bagwell opening is aligned with the bagwell. The first height is less than the second height.

    摘要翻译: 自助结账终端包括其中限定有袋状的基座。 终端还包括在基座上支撑的第一计数器。 第一计数器具有位于第一高度的第一表面。 终端还包括固定在第一计数器的第一端的扫描仪。 终端还包括固定在第一计数器的第二端的自动取款机。 此外,端子包括固定到第一计数器的弓形形状的第二计数器,第二计数器具有位于第二高度的第二表面。 第一个计数器在介于扫描仪和自动柜员机之间的位置处具有限定在其中的袋状开口。 袋状开口与袋状孔对齐。 第一个高度小于第二个高度。