Enhanced image/video quality through artifact evaluation
    11.
    发明授权
    Enhanced image/video quality through artifact evaluation 有权
    通过神器评估增强图像/视频质量

    公开(公告)号:US07873224B2

    公开(公告)日:2011-01-18

    申请号:US11366787

    申请日:2006-03-01

    IPC分类号: G06K9/36

    摘要: In an image/video encoding and decoding system employing an artifact evaluator a method and/or apparatus to process video blocks comprising a decoder operable to synthesize an un-filtered reconstructed video block or frame and an artifact filter operable to receive the un-filtered reconstructed video block or frame, which generates a filtered reconstructed video block or frame. A memory buffer operable to store either the filtered reconstructed video block or frame or the un-filtered reconstructed video block or frame, and an artifact evaluator operable to update the memory buffer after evaluating and determining which of the filtered video block or frame, or the un-filtered video block or frame yields better image/video quality.

    摘要翻译: 在使用伪影评估器的图像/视频编码和解码系统中,处理视频块的方法和/或装置包括可操作以合成未经滤波的重构视频块或帧的解码器和可伪造的未重构 视频块或帧,其生成经滤波的重构视频块或帧。 存储器缓冲器,其可操作以存储经滤波的重建视频块或帧或未经滤波的重构视频块或帧,以及伪像评估器,其可操作以在评估和确定滤波后的视频块或帧中的哪一个或 未经滤波的视频块或帧产生更好的图像/视频质量。

    Motion estimation techniques for video encoding
    12.
    发明授权
    Motion estimation techniques for video encoding 有权
    视频编码的运动估计技术

    公开(公告)号:US07817717B2

    公开(公告)日:2010-10-19

    申请号:US10176028

    申请日:2002-06-18

    IPC分类号: H04N7/18

    摘要: Video encoding techniques are described. In one example, a video encoding technique includes identifying a pixel location associated with a video block in a search space based on motion vectors associated with a set of video blocks within a video frame to be encoded, wherein the video blocks in the set are spatially located at defined locations relative to a current video block of the video frame to be encoded. A motion estimation routine can then be initialized for the current video block at the identified pixel location. By identifying a pixel location associated with a video block in a search space based on motion vectors associated with a set of video blocks within a video frame, the phenomenon of spatial redundancy can be more readily exploited to accelerate and improve the encoding process.

    摘要翻译: 描述视频编码技术。 在一个示例中,视频编码技术包括基于与要编码的视频帧内的一组视频块相关联的运动矢量来识别与搜索空间中的视频块相关联的像素位置,其中该组中的视频块在空间上 位于相对于要编码的视频帧的当前视频块的定义位置处。 然后可以针对所识别的像素位置处的当前视频块初始化运动估计例程。 通过基于与视频帧内的一组视频块相关联的运动矢量来识别与搜索空间中的视频块相关联的像素位置,可以更容易地利用空间冗余现象来加速和改进编码处理。

    Enhanced image/video quality through artifact evaluation
    13.
    发明申请
    Enhanced image/video quality through artifact evaluation 有权
    通过神器评估增强图像/视频质量

    公开(公告)号:US20070206871A1

    公开(公告)日:2007-09-06

    申请号:US11366787

    申请日:2006-03-01

    IPC分类号: G06K9/36

    摘要: In an image/video encoding and decoding system employing an artifact evaluator a method and/or apparatus to process video blocks comprising a decoder operable to synthesize an un-filtered reconstructed video block or frame and an artifact filter operable to receive the un-filtered reconstructed video block or frame, which generates a filtered reconstructed video block or frame. A memory buffer operable to store either the filtered reconstructed video block or frame or the un-filtered reconstructed video block or frame, and an artifact evaluator operable to update the memory buffer after evaluating and determining which of the filtered video block or frame, or the un-filtered video block or frame yields better image/video quality.

    摘要翻译: 在使用伪影评估器的图像/视频编码和解码系统中,处理视频块的方法和/或装置包括可操作以合成未经滤波的重构视频块或帧的解码器和可伪造的未重构 视频块或帧,其生成经滤波的重构视频块或帧。 存储器缓冲器,其可操作以存储经滤波的重建视频块或帧或未经滤波的重构视频块或帧,以及伪像评估器,其可操作以在评估和确定滤波后的视频块或帧中的哪一个或 未经滤波的视频块或帧产生更好的图像/视频质量。

    System and method for segmentation and recognition of speech signals
    14.
    发明授权
    System and method for segmentation and recognition of speech signals 有权
    用于语音信号的分割和识别的系统和方法

    公开(公告)号:US06278972B1

    公开(公告)日:2001-08-21

    申请号:US09225891

    申请日:1999-01-04

    IPC分类号: G01L1504

    CPC分类号: G10L15/04

    摘要: A system and method for forming a segmented speech signal from an input speech signal having a plurality of frames. The input speech signal is converted from a time domain signal to a frequency domain signal having a plurality of speech frames, wherein each speech frame in the frequency domain signal is represented by at least one spectral value associated with the speech frame. A spectral difference value is then determined for each pair of adjacent frames in the frequency domain signal, wherein the spectral difference value for each pair of adjacent frames is representative of a difference between the at least one spectral value associated with each frame in the pair of adjacent frames. An initial cluster boundary is set between each pair of adjacent frames in the frequency domain signal, and a variance value is assigned to each cluster in the frequency domain signal, wherein the variance value for each cluster is equal to one of the determined spectral difference values. Next, a plurality of cluster merge parameters is calculated, wherein each of the cluster merge parameters is associated with a pair of adjacent clusters in the frequency domain signal. A minimum cluster merge parameter is selected from the plurality of cluster merge parameters. A merged cluster is then formed by canceling a cluster boundary between the clusters associated with the minimum merge parameter and assigning a merged variance value to the merged cluster, wherein the merged variance value is representative of the variance values assigned to the clusters associated with the minimum merge parameter. The process is repeated in order to form a plurality of merged clusters, and the segmented speech signal is formed in accordance with the plurality of merged clusters.

    摘要翻译: 一种用于从具有多个帧的输入语音信号形成分段语音信号的系统和方法。 输入语音信号从时域信号转换为具有多个语音帧的频域信号,其中频域信号中的每个语音帧由与语音帧相关联的至少一个频谱值表示。 然后对频域信号中的每对相邻帧确定频谱差值,其中每对相邻帧的频谱差值表示与该对相邻帧中的每个帧相关联的至少一个频谱值之间的差异 相邻帧。 在频域信号中的每对相邻帧之间设置初始簇边界,并且将频域值分配给频域信号中的每个簇,其中每个簇的方差值等于所确定的光谱差值之一 。 接下来,计算多个集群合并参数,其中每个集群合并参数与频域信号中的一对相邻集群相关联。 从多个集群合并参数中选择最小集群合并参数。 然后通过消除与最小合并参数相关联的集群之间的集群边界并将合并的方差值分配给合并的集群来形成合并的集群,其中合并的方差值表示分配给与最小合并参数相关联的集群的方差值 合并参数。 重复该过程以形成多个合并的群集,并且根据多个合并的群集形成分段语音信号。

    Distributed voice recognition system
    15.
    发明授权
    Distributed voice recognition system 有权
    分布式语音识别系统

    公开(公告)号:US06411926B1

    公开(公告)日:2002-06-25

    申请号:US09246413

    申请日:1999-02-08

    申请人: Chienchung Chang

    发明人: Chienchung Chang

    IPC分类号: G01L1924

    CPC分类号: G10L15/30

    摘要: A distributed voice recognition system includes a digital signal processor (DSP), a nonvolatile storage medium, and a microprocessor. The DSP is configured to extract parameters from digitized input speech samples and provide the extracted parameters to the microprocessor. The nonvolatile storage medium contains a database of speech templates. The microprocessor is configured to read the contents of the nonvolatile storage medium, compare the parameters with the contents, and select a speech template based upon the comparison. The nonvolatile storage medium may be a flash memory. The DSP may be a vocoder. If the DSP is a vocoder, the parameters may be diagnostic data generated by the vocoder. The distributed voice recognition system may reside on an application specific integrated circuit (ASIC).

    摘要翻译: 分布式语音识别系统包括数字信号处理器(DSP),非易失性存储介质和微处理器。 DSP被配置为从数字化输入语音样本中提取参数,并将提取的参数提供给微处理器。 非易失性存储介质包含语音模板数据库。 微处理器被配置为读取非易失性存储介质的内容,将参数与内容进行比较,并且基于比较来选择语音模板。 非易失性存储介质可以是闪速存储器。 DSP可以是声码器。 如果DSP是声码器,则参数可以是由声码器产生的诊断数据。 分布式语音识别系统可以驻留在专用集成电路(ASIC)上。

    USER INTERFACE FOR MOBILE DEVICES
    17.
    发明申请
    USER INTERFACE FOR MOBILE DEVICES 有权
    移动设备的用户界面

    公开(公告)号:US20100174421A1

    公开(公告)日:2010-07-08

    申请号:US12401758

    申请日:2009-03-11

    IPC分类号: G05D1/02 H04M1/00 G06F3/048

    摘要: A mobile user interface suitable for mobile computing devices uses device position/orientation in real space to select a portion of content that is displayed. Content (e.g., documents, files or a desktop) is presumed fixed in virtual space with the mobile user interface displaying a portion of the content as if viewed through a camera or magnifying glass. Data from motion, distance or position sensors are used to determine the relative position/orientation of the device with respect to the content to select the portion for display. Content elements can be selected by centering the display on the desired portion, obviating the need for cursors and pointing devices (e.g., mouse or touchscreen). Magnification can be manipulated by moving the device away from or towards the user. 3-D content viewing may be enabled by sensing the device orientation and displaying content that is above or below the display in 3-D virtual space.

    摘要翻译: 适合于移动计算设备的移动用户界面使用实际空间中的设备位置/方位来选择显示的内容的一部分。 内容(例如,文档,文件或桌面)被假设固定在虚拟空间中,移动用户界面显示内容的一部分,好像通过相机或放大镜观看。 使用来自运动,距离或位置传感器的数据来确定设备相对于内容的相对位置/方向,以选择显示部分。 可以通过将显示器对准所需部分来选择内容元素,消除对光标和指点设备(例如,鼠标或触摸屏)的需要。 可以通过将设备移离或远离用户来操纵放大。 可以通过感测设备方向并显示3-D虚拟空间中的显示器上方或下方的内容来启用3-D内容观看。

    Video encoding techniques
    18.
    发明授权
    Video encoding techniques 失效
    视频编码技术

    公开(公告)号:US07039246B2

    公开(公告)日:2006-05-02

    申请号:US10139772

    申请日:2002-05-03

    IPC分类号: G06K9/36

    摘要: This disclosure is directed to encoding techniques that can be used to improve encoding of digital video data. The techniques can be implemented by an encoder of a digital video device in order to reduce the number of computations and possibly reduce power consumption during video encoding. More specifically, video encoding techniques are describe which utilize one or more programmable thresholds in order to terminate the execution of various computations when the computations would be unlikely to improve the encoding. By terminating computations prematurely, the amount of processing required for video encoding can be reduced, and power can be conserved.

    摘要翻译: 本公开涉及可用于改进数字视频数据的编码的编码技术。 这些技术可以由数字视频设备的编码器来实现,以减少计算次数并且可能降低视频编码期间的功耗。 更具体地,描述当视频编码技术利用一个或多个可编程阈值,以便当计算不太可能改进编码时终止各种计算的执行。 通过过早地终止计算,可以减少视频编码所需的处理量,并且可以节省功率。

    Method and apparatus for eighth-rate random number generation for speech coders
    19.
    发明授权
    Method and apparatus for eighth-rate random number generation for speech coders 有权
    用于语音编码器的八进制随机数生成的方法和装置

    公开(公告)号:US06226607B1

    公开(公告)日:2001-05-01

    申请号:US09248516

    申请日:1999-02-08

    IPC分类号: G10L1912

    CPC分类号: G10L19/012 G10L19/24

    摘要: A method and apparatus for eighth-rate random number generation for speech coders includes a random number generator configured to generate values of a first random variable. A lookup table is used to store values of a second random variable. The lookup table is addressed with the values of the first random variable. The second random variable is an inverse transform of a cumulative distribution function of the first random variable. An codec encodes input silence frames with the values of the first and second random variables, and regenerates the silence frames with the values of the first and second random variables. The speech coder may be an enhanced variable rate coder, and the silence frames may be encoded at eighth rate. The random variables are advantageously Gaussian random variables with values that are uniformly distributed between zero and one.

    摘要翻译: 用于语音编码器的用于八进制随机数生成的方法和装置包括被配置为生成第一随机变量值的随机数发生器。 查找表用于存储第二个随机变量的值。 查找表用第一个随机变量的值进行寻址。 第二随机变量是第一随机变量的累积分布函数的逆变换。 编解码器使用第一和第二随机变量的值对输入的静默帧进行编码,并且利用第一和第二随机变量的值重新生成静默帧。 语音编码器可以是增强型可变速率编码器,并且静音帧可以以第八速率进行编码。 随机变量有利的是高斯随机变量,其值均匀分布在零和一之间。