-
公开(公告)号:US08659658B2
公开(公告)日:2014-02-25
申请号:US12703143
申请日:2010-02-09
IPC分类号: H04N7/18
CPC分类号: G06F3/011 , A63F2300/1093 , A63F2300/8011 , G06F3/017 , G06F3/0304 , G06K9/00369 , G06T7/251 , G06T2207/10028 , G06T2207/30196
摘要: In a motion capture system having a depth camera, a physical interaction zone of a user is defined based on a size of the user and other factors. The zone is a volume in which the user performs hand gestures to provide inputs to an application. The shape and location of the zone can be customized for the user. The zone is anchored to the user so that the gestures can be performed from any location in the field of view. Also, the zone is kept between the user and the depth camera even as the user rotates his or her body so that the user is not facing the camera. A display provides feedback based on a mapping from a coordinate system of the zone to a coordinate system of the display. The user can move a cursor on the display or control an avatar.
摘要翻译: 在具有深度相机的运动捕捉系统中,基于用户的大小和其他因素来定义用户的物理交互区域。 该区域是用户执行手势以向应用程序提供输入的卷。 可以为用户定制区域的形状和位置。 该区域被锚定到用户,使得可以在视野中的任何位置执行手势。 此外,即使用户旋转他或她的身体,使得用户不面对相机,该区域也保持在用户和深度相机之间。 显示器基于从区域的坐标系到显示器的坐标系的映射来提供反馈。 用户可以在显示器上移动光标或控制化身。
-
2.
公开(公告)号:US20110193939A1
公开(公告)日:2011-08-11
申请号:US12703143
申请日:2010-02-09
CPC分类号: G06F3/011 , A63F2300/1093 , A63F2300/8011 , G06F3/017 , G06F3/0304 , G06K9/00369 , G06T7/251 , G06T2207/10028 , G06T2207/30196
摘要: In a motion capture system having a depth camera, a physical interaction zone of a user is defined based on a size of the user and other factors. The zone is a volume in which the user performs hand gestures to provide inputs to an application. The shape and location of the zone can be customized for the user. The zone is anchored to the user so that the gestures can be performed from any location in the field of view. Also, the zone is kept between the user and the depth camera even as the user rotates his or her body so that the user is not facing the camera. A display provides feedback based on a mapping from a coordinate system of the zone to a coordinate system of the display. The user can move a cursor on the display or control an avatar.
摘要翻译: 在具有深度相机的运动捕捉系统中,基于用户的大小和其他因素来定义用户的物理交互区域。 该区域是用户执行手势以向应用程序提供输入的卷。 可以为用户定制区域的形状和位置。 该区域被锚定到用户,使得可以在视野中的任何位置执行手势。 此外,即使用户旋转他或她的身体,使得用户不面对相机,该区域也保持在用户和深度相机之间。 显示器基于从区域的坐标系到显示器的坐标系的映射来提供反馈。 用户可以在显示器上移动光标或控制化身。
-
公开(公告)号:US20120089392A1
公开(公告)日:2012-04-12
申请号:US12900004
申请日:2010-10-07
CPC分类号: G10L15/22 , G06F3/167 , G10L15/063
摘要: Speech recognition techniques are disclosed herein. In one embodiment, a novice mode is available such that when the user is unfamiliar with the speech recognition system, a voice user interface (VUI) may be provided to guide them. The VUI may display one or more speech commands that are presently available. The VUI may also provide feedback to train the user. After the user becomes more familiar with speech recognition, the user may enter speech commands without the aid of the novice mode. In this “experienced mode,” the VUI need not be displayed. Therefore, the user interface is not cluttered.
摘要翻译: 本文公开了语音识别技术。 在一个实施例中,新手模式是可用的,使得当用户不熟悉语音识别系统时,可以提供语音用户界面(VUI)来引导它们。 VUI可以显示当前可用的一个或多个语音命令。 VUI还可以提供反馈来训练用户。 在用户更熟悉语音识别之后,用户可以在没有新手模式的帮助下输入语音命令。 在这种“有经验的模式”中,VUI不需要显示。 因此,用户界面不会凌乱。
-
公开(公告)号:US20110184735A1
公开(公告)日:2011-07-28
申请号:US12692538
申请日:2010-01-22
申请人: Jason Flaks , Dax Hawkins , Christian Klein , Mitchell Stephen Dernis , Tommer Leyvand , Ali M. Vassigh , Duncan McKay
发明人: Jason Flaks , Dax Hawkins , Christian Klein , Mitchell Stephen Dernis , Tommer Leyvand , Ali M. Vassigh , Duncan McKay
CPC分类号: G10L15/24 , A63F2300/1081 , A63F2300/1087 , A63F2300/6072 , G06K9/0057 , G10L17/00 , G10L2015/228 , G10L2021/02166
摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.
摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。
-
公开(公告)号:US08676581B2
公开(公告)日:2014-03-18
申请号:US12692538
申请日:2010-01-22
申请人: Jason Flaks , Dax Hawkins , Christian Klein , Mitchell Stephen Dernis , Tommer Leyvand , Ali M. Vassigh , Duncan McKay
发明人: Jason Flaks , Dax Hawkins , Christian Klein , Mitchell Stephen Dernis , Tommer Leyvand , Ali M. Vassigh , Duncan McKay
IPC分类号: G10L15/00
CPC分类号: G10L15/24 , A63F2300/1081 , A63F2300/1087 , A63F2300/6072 , G06K9/0057 , G10L17/00 , G10L2015/228 , G10L2021/02166
摘要: Embodiments are disclosed that relate to the use of identity information to help avoid the occurrence of false positive speech recognition events in a speech recognition system. One embodiment provides a method comprising receiving speech recognition data comprising a recognized speech segment, acoustic locational data related to a location of origin of the recognized speech segment as determined via signals from the microphone array, and confidence data comprising a recognition confidence value, and also receiving image data comprising visual locational information related to a location of each person in an image. The acoustic locational data is compared to the visual locational data to determine whether the recognized speech segment originated from a person in the field of view of the image sensor, and the confidence data is adjusted depending on this determination.
摘要翻译: 公开了涉及使用身份信息来帮助避免在语音识别系统中发生假阳性语音识别事件的实施例。 一个实施例提供了一种方法,包括接收包括识别的语音段的语音识别数据,与通过来自麦克风阵列的信号确定的识别的语音段的原点的位置有关的声学位置数据,以及包括识别置信度的置信度数据,以及 接收图像数据,其包括与图像中的每个人的位置相关的视觉位置信息。 将声学位置数据与视觉位置数据进行比较,以确定识别的语音片段是否源于图像传感器的视野中的人,并且根据该确定来调整置信度数据。
-
公开(公告)号:US08296151B2
公开(公告)日:2012-10-23
申请号:US12818898
申请日:2010-06-18
CPC分类号: G06F3/017 , G06F3/038 , G06F3/167 , G06F2203/0381
摘要: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.
摘要翻译: 多媒体娱乐系统结合了手势和语音命令,以提供增强的控制方案。 用户的身体位置或运动可以被识别为手势,并且可以用于提供上下文以识别用户生成的声音,例如语音输入。 类似地,语音输入可以被识别为语音命令,并且可以用于提供上下文以将身体位置或运动识别为手势。 可以将权重分配给输入以便于处理。 当识别到手势时,加载与识别的手势相关联的一组有限的语音命令以供使用。 此外,附加语音命令集可以以分层方式构造,使得从一组语音命令发出语音命令导致系统加载下一组语音命令。
-
公开(公告)号:US20110313768A1
公开(公告)日:2011-12-22
申请号:US12818898
申请日:2010-06-18
CPC分类号: G06F3/017 , G06F3/038 , G06F3/167 , G06F2203/0381
摘要: A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user's body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.
摘要翻译: 多媒体娱乐系统结合了手势和语音命令,以提供增强的控制方案。 用户的身体位置或运动可以被识别为手势,并且可以用于提供上下文以识别用户生成的声音,例如语音输入。 类似地,语音输入可以被识别为语音命令,并且可以用于提供上下文以将身体位置或运动识别为手势。 可以将权重分配给输入以便于处理。 当识别到手势时,加载与识别的手势相关联的一组有限的语音命令以供使用。 此外,附加语音命令集可以以分层方式构造,使得从一组语音命令发出语音命令导致系统加载下一组语音命令。
-
公开(公告)号:USD641373S1
公开(公告)日:2011-07-12
申请号:US29363656
申请日:2010-06-11
申请人: David E. Gardner , Ali M. Vassigh , Cyrus Kanga
设计人: David E. Gardner , Ali M. Vassigh , Cyrus Kanga
-
公开(公告)号:US6167381A
公开(公告)日:2000-12-26
申请号:US20056
申请日:1998-02-06
CPC分类号: A47F9/047 , G06Q20/1085 , G06Q20/20 , G06Q20/204 , G07G1/0036
摘要: A self-service checkout terminal includes a base having a bagwell defined therein. The terminal also includes a first counter supported on the base. The first counter has a first surface which is positioned at a first height. The terminal further includes a scanner secured at a first end of the first counter. The terminal yet further includes an automated teller machine secured at a second end of the first counter. Moreover, the terminal includes an arcuate shaped second counter secured to the first counter, the second counter having a second surface which is positioned at a second height. The first counter has a bagwell opening defined therein at a location interposed between the scanner and the automated teller machine. The bagwell opening is aligned with the bagwell. The first height is less than the second height.
摘要翻译: 自助结账终端包括其中限定有袋状的基座。 终端还包括在基座上支撑的第一计数器。 第一计数器具有位于第一高度的第一表面。 终端还包括固定在第一计数器的第一端的扫描仪。 终端还包括固定在第一计数器的第二端的自动取款机。 此外,端子包括固定到第一计数器的弓形形状的第二计数器,第二计数器具有位于第二高度的第二表面。 第一个计数器在介于扫描仪和自动柜员机之间的位置处具有限定在其中的袋状开口。 袋状开口与袋状孔对齐。 第一个高度小于第二个高度。
-
公开(公告)号:US08954330B2
公开(公告)日:2015-02-10
申请号:US13305429
申请日:2011-11-28
申请人: Michael F. Koenig , Oscar Enrique Murillo , Ira Lynn Snyder, Jr. , Andrew D. Wilson , Kenneth P. Hinckley , Ali M. Vassigh
发明人: Michael F. Koenig , Oscar Enrique Murillo , Ira Lynn Snyder, Jr. , Andrew D. Wilson , Kenneth P. Hinckley , Ali M. Vassigh
CPC分类号: G10L21/10 , G06F17/27 , G06F17/271 , G06Q10/10 , G10L21/00
摘要: The subject disclosure is directed towards detecting symbolic activity within a given environment using a context-dependent grammar. In response to receiving sets of input data corresponding to one or more input modalities, a context-aware interactive system processes a model associated with interpreting the symbolic activity using context data for the given environment. Based on the model, related sets of input data are determined. The context-aware interactive system uses the input data to interpret user intent with respect to the input and thereby, identify one or more commands for a target output mechanism.
摘要翻译: 本发明涉及使用上下文相关语法在给定环境内检测符号活动。 响应于对应于一个或多个输入模态的接收输入数据集,上下文感知交互系统使用用于给定环境的上下文数据来处理与解释符号活动相关联的模型。 基于该模型,确定相关的输入数据集。 上下文感知交互系统使用输入数据来解释用户对输入的意图,从而识别用于目标输出机制的一个或多个命令。
-
-
-
-
-
-
-
-
-