System and method for segmentation and recognition of speech signals
    21.
    发明授权
    System and method for segmentation and recognition of speech signals 有权
    用于语音信号的分割和识别的系统和方法

    公开(公告)号:US06278972B1

    公开(公告)日:2001-08-21

    申请号:US09225891

    申请日:1999-01-04

    IPC分类号: G01L1504

    CPC分类号: G10L15/04

    摘要: A system and method for forming a segmented speech signal from an input speech signal having a plurality of frames. The input speech signal is converted from a time domain signal to a frequency domain signal having a plurality of speech frames, wherein each speech frame in the frequency domain signal is represented by at least one spectral value associated with the speech frame. A spectral difference value is then determined for each pair of adjacent frames in the frequency domain signal, wherein the spectral difference value for each pair of adjacent frames is representative of a difference between the at least one spectral value associated with each frame in the pair of adjacent frames. An initial cluster boundary is set between each pair of adjacent frames in the frequency domain signal, and a variance value is assigned to each cluster in the frequency domain signal, wherein the variance value for each cluster is equal to one of the determined spectral difference values. Next, a plurality of cluster merge parameters is calculated, wherein each of the cluster merge parameters is associated with a pair of adjacent clusters in the frequency domain signal. A minimum cluster merge parameter is selected from the plurality of cluster merge parameters. A merged cluster is then formed by canceling a cluster boundary between the clusters associated with the minimum merge parameter and assigning a merged variance value to the merged cluster, wherein the merged variance value is representative of the variance values assigned to the clusters associated with the minimum merge parameter. The process is repeated in order to form a plurality of merged clusters, and the segmented speech signal is formed in accordance with the plurality of merged clusters.

    摘要翻译: 一种用于从具有多个帧的输入语音信号形成分段语音信号的系统和方法。 输入语音信号从时域信号转换为具有多个语音帧的频域信号,其中频域信号中的每个语音帧由与语音帧相关联的至少一个频谱值表示。 然后对频域信号中的每对相邻帧确定频谱差值,其中每对相邻帧的频谱差值表示与该对相邻帧中的每个帧相关联的至少一个频谱值之间的差异 相邻帧。 在频域信号中的每对相邻帧之间设置初始簇边界,并且将频域值分配给频域信号中的每个簇,其中每个簇的方差值等于所确定的光谱差值之一 。 接下来,计算多个集群合并参数,其中每个集群合并参数与频域信号中的一对相邻集群相关联。 从多个集群合并参数中选择最小集群合并参数。 然后通过消除与最小合并参数相关联的集群之间的集群边界并将合并的方差值分配给合并的集群来形成合并的集群,其中合并的方差值表示分配给与最小合并参数相关联的集群的方差值 合并参数。 重复该过程以形成多个合并的群集,并且根据多个合并的群集形成分段语音信号。

    MULTI-STAGE TESSELLATION FOR GRAPHICS RENDERING
    23.
    发明申请
    MULTI-STAGE TESSELLATION FOR GRAPHICS RENDERING 有权
    用于图形渲染的多阶段测量

    公开(公告)号:US20090237401A1

    公开(公告)日:2009-09-24

    申请号:US12052628

    申请日:2008-03-20

    IPC分类号: G06T17/00

    CPC分类号: G06T11/203

    摘要: This disclosure describes a multi-stage tessellation technique for tessellating a curve during graphics rendering. In particular, a first tessellation stage tessellates the curve into a first set of line segments that each represents a portion of the curve. A second tessellation stage further tessellates the portion of the curve represented by each of the line segments of the first set into additional line segments that more finely represent the shape of the curve. In this manner, each portion of the curve that was represented by only one line segment after the first tessellation stage is represented by more than one line segment after the second tessellation stage. In some instances, more than two tessellation stages may be performed to tessellate the curve.

    摘要翻译: 本公开描述了用于在图形渲染期间细分曲线的多阶段镶嵌技术。 特别地,第一细分阶段将曲线细分为第一组线段,每组线段表示曲线的一部分。 第二细分阶段进一步将由第一组的每个线段表示的曲线的部分细分为更精细地表示曲线形状的附加线段。 以这种方式,在第一细分阶段之后仅由一个线段表示的曲线的每个部分在第二细分阶段之后被多于一个线段表示。 在一些情况下,可以执行多于两个的细分阶段来细分曲线。

    Noise-compensated speech recognition templates
    24.
    发明授权
    Noise-compensated speech recognition templates 失效
    噪声补偿语音识别模板

    公开(公告)号:US06381569B1

    公开(公告)日:2002-04-30

    申请号:US09018257

    申请日:1998-02-04

    IPC分类号: G10L1520

    CPC分类号: G10L15/20 G10L21/0216

    摘要: The speech recognition training unit is modified to store digitized speech samples into a speech database that can be accessed at recognition time. The improved recognition unit comprises a noise analysis, modeling, and synthesis unit which continually analyzes the noise characteristics present in the audio environment and produces an estimated noise signal with similar characteristics. The recognition unit then constructs a noise-compensated template database by adding the estimated noise signal to each of the speech samples in the speech database and performing parameter determination on the resulting sums. This procedure accounts for the presence of noise in the recognition phase by retraining all the templates using an estimated noise signal with similar characteristics as the actual noise signal that corrupted the word to be recognized. This method improves the likelihood of a good template match, which increases the recognition accuracy.

    摘要翻译: 修改语音识别训练单元以将数字化语音样本存储到可在识别时被访问的语音数据库中。 改进的识别单元包括噪声分析,建模和合成单元,其连续分析存在于音频环境中的噪声特性并产生具有相似特性的估计噪声信号。 然后,识别单元通过将估计的噪声信号加到语音数据库中的每个语音样本上并对所得到的和进行参数确定来构建噪声补偿模板数据库。 该过程通过使用具有与损坏要识别的字的实际噪声信号相似的特性的估计噪声信号重新训练所有模板来解决识别阶段中的噪声的存在。 该方法提高了模板匹配的可能性,从而提高了识别精度。

    VIRTUAL KEYBOARDS AND METHODS OF PROVIDING THE SAME
    25.
    发明申请
    VIRTUAL KEYBOARDS AND METHODS OF PROVIDING THE SAME 有权
    虚拟键盘及其提供方法

    公开(公告)号:US20120268376A1

    公开(公告)日:2012-10-25

    申请号:US13090497

    申请日:2011-04-20

    申请人: Ning Bi

    发明人: Ning Bi

    IPC分类号: G06F3/00

    摘要: The present disclosure provides systems, methods and apparatus, including computer programs encoded on computer storage media, for providing virtual keyboards. In one aspect, a system includes a camera, a display, a video feature extraction module and a gesture pattern matching module. The camera captures a sequence of images containing a finger of a user, and the display displays each image combined with a virtual keyboard having a plurality of virtual keys. The video feature extraction module detects motion of the finger in the sequence of images relative to virtual sensors of the virtual keys, and determines sensor actuation data based on the detected motion relative to the virtual sensors. The gesture pattern matching module uses the sensor actuation data to recognize a gesture.

    摘要翻译: 本公开提供了系统,方法和装置,包括在计算机存储介质上编码的用于提供虚拟键盘的计算机程序。 一方面,系统包括相机,显示器,视频特征提取模块和手势模式匹配模块。 相机拍摄包含用户手指的一系列图像,并且显示器显示与具有多个虚拟键的虚拟键盘组合的每个图像。 视频特征提取模块相对于虚拟键的虚拟传感器检测图像序列中的手指的运动,并且基于相对于虚拟传感器的检测到的运动来确定传感器致动数据。 手势模式匹配模块使用传感器致动数据来识别手势。

    SYSTEM AND METHOD TO DISPLAY CONTENT
    27.
    发明申请
    SYSTEM AND METHOD TO DISPLAY CONTENT 有权
    系统和方法来显示内容

    公开(公告)号:US20120223884A1

    公开(公告)日:2012-09-06

    申请号:US13038166

    申请日:2011-03-01

    IPC分类号: G06F3/033

    摘要: An apparatus and method for displaying content is disclosed. A particular method includes determining a viewing orientation of a user relative to a display and providing a portion of content to the display based on the viewing orientation. The portion includes at least a first viewable element of the content and does not include at least one second viewable element of the content. The method also includes determining an updated viewing orientation of the user and updating the portion of the content based on the updated viewing orientation. The updated portion includes at least the second viewable element. A display difference between the portion and the updated portion is non-linearly related to an orientation difference between the viewing orientation and the updated viewing orientation.

    摘要翻译: 公开了一种用于显示内容的装置和方法。 一种特定的方法包括基于观看方向来确定用户相对于显示器的观看方向并且向显示器提供内容的一部分。 该部分至少包括内容的第一可见元素,并且不包括内容的至少一个第二可见元素。 该方法还包括确定用户的更新的观看方向并基于更新的观看方向来更新内容的部分。 更新部分至少包括第二可见元素。 该部分和更新部分之间的显示差异与观看方向和更新的观看方向之间的取向差异非线性相关。

    Method and device for performing user-defined clipping in object space
    28.
    发明授权
    Method and device for performing user-defined clipping in object space 有权
    用于在对象空间中执行用户定义剪辑的方法和设备

    公开(公告)号:US08237739B2

    公开(公告)日:2012-08-07

    申请号:US11531205

    申请日:2006-09-12

    IPC分类号: G09G5/00

    CPC分类号: G06T15/30

    摘要: A method and device for performing and processing user-defined clipping in object space to reduce the number of computations needed for the clipping operation. The method and device also combine the modelview transformation of the vertex coordinates with projection transform. The user-defined clipping in object space provides a higher performance and less power consumption by avoiding generation of eye coordinates if there is no lighting. The device includes a driver for the user-defined clipping in the object space to perform dual mode user-defined clipping in object space when a lighting function is disabled and in eye space when the lighting function is enabled.

    摘要翻译: 一种用于在对象空间中执行和处理用户定义的限幅以减少剪切操作所需的计算次数的方法和装置。 该方法和装置还将顶点坐标的modelview变换与投影变换相结合。 对象空间中的用户定义的剪辑通过避免在没有照明的情况下产生眼睛坐标来提供更高的性能和更低的功耗。 该设备包括用于在对象空间中的用户定义的剪切的驱动器,以在禁用照明功能时在对象空间中执行双模式用户定义的剪辑,并且在启用照明功能时在眼睛空间中执行双模式用户定义的剪辑。

    System and method to display content based on viewing orientation
    30.
    发明授权
    System and method to display content based on viewing orientation 有权
    基于观看方向显示内容的系统和方法

    公开(公告)号:US09285883B2

    公开(公告)日:2016-03-15

    申请号:US13038166

    申请日:2011-03-01

    摘要: An apparatus and method for displaying content is disclosed. A particular method includes determining a viewing orientation of a user relative to a display and providing a portion of content to the display based on the viewing orientation. The portion includes at least a first viewable element of the content and does not include at least one second viewable element of the content. The method also includes determining an updated viewing orientation of the user and updating the portion of the content based on the updated viewing orientation. The updated portion includes at least the second viewable element. A display difference between the portion and the updated portion is non-linearly related to an orientation difference between the viewing orientation and the updated viewing orientation.

    摘要翻译: 公开了一种用于显示内容的装置和方法。 一种特定的方法包括基于观看方向来确定用户相对于显示器的观看方向并且向显示器提供内容的一部分。 该部分至少包括内容的第一可见元素,并且不包括内容的至少一个第二可见元素。 该方法还包括确定用户的更新的观看方向并基于更新的观看方向来更新内容的部分。 更新部分至少包括第二可见元素。 该部分和更新部分之间的显示差异与观看方向和更新的观看方向之间的取向差异非线性相关。