System and method for segmentation and recognition of speech signals
    21.
    发明授权
    System and method for segmentation and recognition of speech signals 有权
    用于语音信号的分割和识别的系统和方法

    公开(公告)号:US06278972B1

    公开(公告)日:2001-08-21

    申请号:US09225891

    申请日:1999-01-04

    CPC classification number: G10L15/04

    Abstract: A system and method for forming a segmented speech signal from an input speech signal having a plurality of frames. The input speech signal is converted from a time domain signal to a frequency domain signal having a plurality of speech frames, wherein each speech frame in the frequency domain signal is represented by at least one spectral value associated with the speech frame. A spectral difference value is then determined for each pair of adjacent frames in the frequency domain signal, wherein the spectral difference value for each pair of adjacent frames is representative of a difference between the at least one spectral value associated with each frame in the pair of adjacent frames. An initial cluster boundary is set between each pair of adjacent frames in the frequency domain signal, and a variance value is assigned to each cluster in the frequency domain signal, wherein the variance value for each cluster is equal to one of the determined spectral difference values. Next, a plurality of cluster merge parameters is calculated, wherein each of the cluster merge parameters is associated with a pair of adjacent clusters in the frequency domain signal. A minimum cluster merge parameter is selected from the plurality of cluster merge parameters. A merged cluster is then formed by canceling a cluster boundary between the clusters associated with the minimum merge parameter and assigning a merged variance value to the merged cluster, wherein the merged variance value is representative of the variance values assigned to the clusters associated with the minimum merge parameter. The process is repeated in order to form a plurality of merged clusters, and the segmented speech signal is formed in accordance with the plurality of merged clusters.

    Abstract translation: 一种用于从具有多个帧的输入语音信号形成分段语音信号的系统和方法。 输入语音信号从时域信号转换为具有多个语音帧的频域信号,其中频域信号中的每个语音帧由与语音帧相关联的至少一个频谱值表示。 然后对频域信号中的每对相邻帧确定频谱差值,其中每对相邻帧的频谱差值表示与该对相邻帧中的每个帧相关联的至少一个频谱值之间的差异 相邻帧。 在频域信号中的每对相邻帧之间设置初始簇边界,并且将频域值分配给频域信号中的每个簇,其中每个簇的方差值等于所确定的光谱差值之一 。 接下来,计算多个集群合并参数,其中每个集群合并参数与频域信号中的一对相邻集群相关联。 从多个集群合并参数中选择最小集群合并参数。 然后通过消除与最小合并参数相关联的集群之间的集群边界并将合并的方差值分配给合并的集群来形成合并的集群,其中合并的方差值表示分配给与最小合并参数相关联的集群的方差值 合并参数。 重复该过程以形成多个合并的群集,并且根据多个合并的群集形成分段语音信号。

    System and method to display content based on viewing orientation
    23.
    发明授权
    System and method to display content based on viewing orientation 有权
    基于观看方向显示内容的系统和方法

    公开(公告)号:US09285883B2

    公开(公告)日:2016-03-15

    申请号:US13038166

    申请日:2011-03-01

    Abstract: An apparatus and method for displaying content is disclosed. A particular method includes determining a viewing orientation of a user relative to a display and providing a portion of content to the display based on the viewing orientation. The portion includes at least a first viewable element of the content and does not include at least one second viewable element of the content. The method also includes determining an updated viewing orientation of the user and updating the portion of the content based on the updated viewing orientation. The updated portion includes at least the second viewable element. A display difference between the portion and the updated portion is non-linearly related to an orientation difference between the viewing orientation and the updated viewing orientation.

    Abstract translation: 公开了一种用于显示内容的装置和方法。 一种特定的方法包括基于观看方向来确定用户相对于显示器的观看方向并且向显示器提供内容的一部分。 该部分至少包括内容的第一可见元素,并且不包括内容的至少一个第二可见元素。 该方法还包括确定用户的更新的观看方向并基于更新的观看方向来更新内容的部分。 更新部分至少包括第二可见元素。 该部分和更新部分之间的显示差异与观看方向和更新的观看方向之间的取向差异非线性相关。

    Virtual keyboards and methods of providing the same
    25.
    发明授权
    Virtual keyboards and methods of providing the same 有权
    虚拟键盘和提供相同的方法

    公开(公告)号:US08928589B2

    公开(公告)日:2015-01-06

    申请号:US13090497

    申请日:2011-04-20

    Applicant: Ning Bi

    Inventor: Ning Bi

    Abstract: The present disclosure provides systems, methods and apparatus, including computer programs encoded on computer storage media, for providing virtual keyboards. In one aspect, a system includes a camera, a display, a video feature extraction module and a gesture pattern matching module. The camera captures a sequence of images containing a finger of a user, and the display displays each image combined with a virtual keyboard having a plurality of virtual keys. The video feature extraction module detects motion of the finger in the sequence of images relative to virtual sensors of the virtual keys, and determines sensor actuation data based on the detected motion relative to the virtual sensors. The gesture pattern matching module uses the sensor actuation data to recognize a gesture.

    Abstract translation: 本公开提供了系统,方法和装置,包括在计算机存储介质上编码的用于提供虚拟键盘的计算机程序。 一方面,系统包括相机,显示器,视频特征提取模块和手势模式匹配模块。 相机拍摄包含用户手指的一系列图像,并且显示器显示与具有多个虚拟键的虚拟键盘组合的每个图像。 视频特征提取模块相对于虚拟键的虚拟传感器检测图像序列中的手指的运动,并且基于相对于虚拟传感器的检测到的运动来确定传感器致动数据。 手势模式匹配模块使用传感器致动数据来识别手势。

    STEREOSCOPIC CONVERSION FOR SHADER BASED GRAPHICS CONTENT
    26.
    发明申请
    STEREOSCOPIC CONVERSION FOR SHADER BASED GRAPHICS CONTENT 有权
    基于阴影的图形内容的立体转换

    公开(公告)号:US20120235999A1

    公开(公告)日:2012-09-20

    申请号:US13350467

    申请日:2012-01-13

    Abstract: The example techniques of this disclosure are directed to generating a stereoscopic view from an application designed to generate a mono view. For example, the techniques may modify source code of a vertex shader to cause the modified vertex shader, when executed, to generate graphics content for the images of the stereoscopic view. As another example, the techniques may modify a command that defines a viewport for the mono view to commands that define the viewports for the images of the stereoscopic view.

    Abstract translation: 本公开的示例性技术涉及从被设计成生成单视图的应用产生立体视图。 例如,这些技术可以修改顶点着色器的源代码,以便在被执行时使经修改的顶点着色器生成立体视图的图像的图形内容。 作为另一示例,这些技术可以修改将单声道视图的视口定义为定义立体视图的图像的视口的命令的命令。

    VIEWPOINT DETECTOR BASED ON SKIN COLOR AREA AND FACE AREA
    27.
    发明申请
    VIEWPOINT DETECTOR BASED ON SKIN COLOR AREA AND FACE AREA 有权
    基于皮肤颜色区域和面部的视点检测器

    公开(公告)号:US20110262001A1

    公开(公告)日:2011-10-27

    申请号:US12765292

    申请日:2010-04-22

    CPC classification number: G06K9/00234

    Abstract: In a particular illustrative embodiment, a method of determining a viewpoint of a person based on skin color area and face area is disclosed. The method includes receiving image data corresponding to an image captured by a camera, the image including at least one object to be displayed at a device coupled to the camera. The method further includes determining a viewpoint of the person relative to a display of the device coupled to the camera. The viewpoint of the person may be determined by determining a face area of the person based on a determined skin color area of the person and tracking a face location of the person based on the face area. One or more objects displayed at the display may be moved in response to the determined viewpoint of the person.

    Abstract translation: 在具体的说明性实施例中,公开了一种基于皮肤颜色区域和面部区域来确定人的视点的方法。 该方法包括接收与由相机拍摄的图像相对应的图像数据,所述图像包括要耦合到相机的设备中要显示的至少一个对象。 该方法还包括确定人相对于耦合到相机的设备的显示的视点。 可以通过基于人的确定的皮肤颜色区域来确定人的脸部区域并且基于面部区域来跟踪人的面部位置来确定人的观点。 可以响应于所确定的人的观点而移动在显示器处显示的一个或多个对象。

    combined engine system and method for voice recognition
    28.
    发明授权
    combined engine system and method for voice recognition 有权
    组合发动机系统和语音识别方法

    公开(公告)号:US06671669B1

    公开(公告)日:2003-12-30

    申请号:US09618177

    申请日:2000-07-18

    CPC classification number: G10L15/32

    Abstract: A method and system that combines voice recognition engines and resolves any differences between the results of individual voice recognition engines. A speaker independent (SI) Hidden Markov Model (HMM) engine, a speaker independent Dynamic Time Warping (DTW-SI) engine and a speaker dependent Dynamic Time Warping (DTW-SD) engine are combined. Combining and resolving the results of these engines results in a system with better recognition accuracy and lower rejection rates than using the results of only one engine.

    Abstract translation: 一种组合语音识别引擎并解决各个语音识别引擎结果之间差异的方法和系统。 独立于扬声器(SI)隐马尔可夫模型(HMM)引擎,独立于扬声器的动态时间扭曲(DTW-SI)引擎和与扬声器相关的动态时间扭曲(DTW-SD)引擎。 结合和解决这些发动机的结果导致与使用仅一个发动机的结果相比,具有更好的识别精度和更低的排除率的系统。

    CONTENT-ADAPTIVE SYSTEMS, METHODS AND APPARATUS FOR DETERMINING OPTICAL FLOW
    29.
    发明申请
    CONTENT-ADAPTIVE SYSTEMS, METHODS AND APPARATUS FOR DETERMINING OPTICAL FLOW 有权
    内容自适应系统,用于确定光学流量的方法和装置

    公开(公告)号:US20120321139A1

    公开(公告)日:2012-12-20

    申请号:US13160457

    申请日:2011-06-14

    CPC classification number: G06K9/00536 G06T7/20 G06T2207/20012

    Abstract: Embodiments include methods and systems which determine pixel displacement between frames based on a respective weighting-value for each pixel or a group of pixels. The weighting-values provide an indication as to which pixels are more pertinent to optical flow computations. Computational resources and effort can be focused on pixels with higher weights, which are generally more pertinent to optical flow determinations.

    Abstract translation: 实施例包括基于每个像素或一组像素的相应加权值来确定帧之间的像素位移的方法和系统。 加权值提供关于哪些像素与光流计算更相关的指示。 计算资源和努力可以集中在具有较高权重的像素上,这通常与光流测定更相关。

    FINGERTIP TRACKING FOR TOUCHLESS USER INTERFACE
    30.
    发明申请
    FINGERTIP TRACKING FOR TOUCHLESS USER INTERFACE 有权
    用于无连接用户界面的指纹跟踪

    公开(公告)号:US20120113241A1

    公开(公告)日:2012-05-10

    申请号:US13082295

    申请日:2011-04-07

    Abstract: In general, this disclosure describes techniques for providing a gesture-based user interface. For example, according to some aspects of the disclosure, a user interface generally includes a camera and a computing device that identifies and tracks the motion of one or more fingertips of a user. In some examples, the user interface is configured to identify predefined gestures (e.g., patterns of motion) associated with certain motions of the user's fingertips. In another example, the user interface is configured to identify hand postures (e.g., patterns of showing up of fingertips). Accordingly, the user can interact with the computing device by performing the gestures.

    Abstract translation: 通常,本公开描述了用于提供基于手势的用户界面的技术。 例如,根据本公开的一些方面,用户界面通常包括识别和跟踪用户的一个或多个指尖的运动的照相机和计算设备。 在一些示例中,用户界面被配置为识别与用户的指尖的某些运动相关联的预定手势(例如,运动模式)。 在另一示例中,用户界面被配置为识别手势(例如,指尖的显示图案)。 因此,用户可以通过执行手势与计算设备交互。

Patent Agency Ranking