Sound source localization using phase spectrum
    1.
    发明授权
    Sound source localization using phase spectrum 有权
    声源定位使用相位谱

    公开(公告)号:US09435873B2

    公开(公告)日:2016-09-06

    申请号:US13182449

    申请日:2011-07-14

    CPC classification number: G01S3/8083 G01S3/8006 G01S3/82 H04R3/005

    Abstract: An array of microphones placed on a mobile robot provides multiple channels of audio signals. A received set of audio signals is called an audio segment, which is divided into multiple frames. A phase analysis is performed on a frame of the signals from each pair of microphones. If both microphones are in an active state during the frame, a candidate angle is generated for each such pair of microphones. The result is a list of candidate angles for the frame. This list is processed to select a final candidate angle for the frame. The list of candidate angles is tracked over time to assist in the process of selecting the final candidate angle for an audio segment.

    Abstract translation: 放置在移动机器人上的麦克风阵列提供多个音频信号通道。 接收到的一组音频信号被称为音频片段,其被分成多个帧。 对来自每对麦克风的信号的帧进行相位分析。 如果在帧期间两个麦克风处于活动状态,则为每个这样的麦克风生成候选角度。 结果是帧的候选角度列表。 处理该列表以选择帧的最终候选角度。 候选角度的列表随着时间被跟踪以帮助选择音频片段的最终候选角度的过程。

    Entropy coding efficiency enhancement utilizing energy distribution remapping
    2.
    发明授权
    Entropy coding efficiency enhancement utilizing energy distribution remapping 有权
    使用能量分配重新映射的熵编码效率增强

    公开(公告)号:US09398314B2

    公开(公告)日:2016-07-19

    申请号:US12026534

    申请日:2008-02-05

    CPC classification number: H04N19/85

    Abstract: Architecture for enhancing the compression (e.g., luma, chroma) of a video signal and improving the perceptual quality of the video compression schemes. The architecture operates to reshape the normal multimodal energy distribution of the input video signal to a new energy distribution. In the context of luma, the algorithm maps the black and white (or contrast) information of a picture to a new energy distribution. For example, the contrast can be enhanced in the middle range of the luma spectrum, thereby improving the contrast between a light foreground object and a dark background. At the same time, the algorithm reduces the bit-rate requirements at a particular quantization step size. The algorithm can be utilized also in post-processing to improve the quality of decoded video.

    Abstract translation: 用于增强视频信号的压缩(例如,亮度,色度)并提高视频压缩方案的感知质量的体系结构。 该架构用于将输入视频信号的正常多模态能量分布重新形成新的能量分布。 在亮度的上下文中,该算法将图像的黑白(或对比)信息映射到新的能量分布。 例如,可以在亮度光谱的中间范围内增强对比度,从而改善光前景物体和暗背景之间的对比度。 同时,该算法降低了特定量化步长的比特率要求。 该算法也可用于后处理,以提高解码视频的质量。

    Flexible range reduction
    3.
    发明授权
    Flexible range reduction 有权
    弹性范围缩小

    公开(公告)号:US08855202B2

    公开(公告)日:2014-10-07

    申请号:US13191335

    申请日:2011-07-26

    Abstract: Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations.

    Abstract translation: 描述了技术和工具,用于灵活地缩减视频样本。 例如,编码器用信号通知一个或多个语法元素的第一组,用于亮度样本的范围减小,并且为第二组一个或多个语法元素用于色度样本的范围缩小。 编码器分别以与第一语法元素和第二语法元素一致的方式选择性地缩小亮度样本和色度样本。 或者,编码器对入口点段的入口点标题中的范围缩小语法元素进行信号,其中语法元素适用于入口点段中的图像。 如果图像使用范围缩小,则编码器缩小图像的样本。 否则,编码器跳过缩小比例。 解码器执行相应的解析和放大操作。

    Motion-compensated prediction of inter-layer residuals
    4.
    发明授权
    Motion-compensated prediction of inter-layer residuals 有权
    层间残差运动补偿预测

    公开(公告)号:US08711948B2

    公开(公告)日:2014-04-29

    申请号:US12077768

    申请日:2008-03-21

    CPC classification number: H04N19/00442 H04N19/30 H04N19/34 H04N19/61 H04N19/63

    Abstract: Techniques and tools are described for scalable video encoding and decoding. In some embodiments, an encoding tool encodes base layer video and outputs encoded base layer video in a base layer bit stream. The encoding tool encodes inter-layer residual video (representing differences between input video and reconstructed base layer video) using motion compensation relative to previously reconstructed inter-layer residual video. For the inter-layer residual video, the encoding tool outputs motion information and motion-compensated prediction residuals in an enhancement layer bit stream. A decoding tool receives the base layer bit stream and enhancement layer bit stream, reconstructs base layer video, reconstructs inter-layer residual video, and combines the reconstructed base layer video and reconstructed inter-layer residual video. Using motion compensation for the inter-layer residual video facilitates the use of separate motion vectors and separate codecs for the base layer video and inter-layer residual video.

    Abstract translation: 描述了可扩展视频编码和解码的技术和工具。 在一些实施例中,编码工具对基层视频进行编码并在基层比特流中输出经编码的基层视频。 编码工具使用相对于先前重建的层间残留视频的运动补偿来编码层间残差视频(表示输入视频和重建的基本层视频之间的差异)。 对于层间残差视频,编码工具在增强层比特流中输出运动信息和运动补偿预测残差。 解码工具接收基层比特流和增强层比特流,重构基层视频,重建层间残留视频,并组合重建的基层视频和重建层间残留视频。 对于层间残留视频使用运动补偿有助于使用单独的运动矢量和单独的编解码器用于基本层视频和层间残留视频。

    MULTI-LEVEL REPRESENTATION OF REORDERED TRANSFORM COEFFICIENTS
    5.
    发明申请
    MULTI-LEVEL REPRESENTATION OF REORDERED TRANSFORM COEFFICIENTS 有权
    多级变换系数的多级表示

    公开(公告)号:US20120243615A1

    公开(公告)日:2012-09-27

    申请号:US13458774

    申请日:2012-04-27

    Abstract: Techniques and tools for encoding and decoding a block of frequency coefficients are presented. An encoder selects a scan order from multiple available scan orders and then applies the selected scan order to a two-dimensional matrix of transform coefficients, grouping non-zero values of the frequency coefficients together in a one-dimensional string. The encoder entropy encodes the one-dimensional string of coefficient values according to a multi-level nested set representation. In decoding, a decoder entropy decodes the one-dimensional string of coefficient values from the multi-level nested set representation. The decoder selects the scan order from among multiple available scan orders and then reorders the coefficients back into a two-dimensional matrix using the selected scan order.

    Abstract translation: 提出了用于编码和解码频率系数块的技术和工具。 编码器从多个可用扫描顺序中选择扫描顺序,然后将所选择的扫描顺序应用于变换系数的二维矩阵,将频率系数的非零值在一维串中分组。 编码器根据多级嵌套集合表示对一维系列值串进行编码。 在解码中,解码器熵从多级嵌套集合表示解码系数值的一维串。 解码器从多个可用扫描顺序中选择扫描顺序,然后使用所选择的扫描顺序将系数重新排序成二维矩阵。

    Skip modes for inter-layer residual video coding and decoding
    6.
    发明授权
    Skip modes for inter-layer residual video coding and decoding 有权
    跳过模式进行层间残留视频编码和解码

    公开(公告)号:US08213503B2

    公开(公告)日:2012-07-03

    申请号:US12205689

    申请日:2008-09-05

    Abstract: Techniques and tools for skip modes in encoding and decoding of inter-layer residual video are described. For example, an encoder encodes multiple macroblocks of a picture of inter-layer residual video. For a current macroblock that is skipped, the encoder selects a skip mode from among multiple available skip modes and uses the selected skip mode when encoding the current macroblock. The skip modes can include intra skip mode and predicted-motion skip mode. A corresponding decoder, for the current macroblock, selects and uses the skip mode for the current macroblock during decoding. As another example, an encoder encodes multiple channels of a picture of inter-layer residual video. For each channel, the encoder determines whether to skip the channel. The encoder signals channel skip information to indicate which channels are skipped. A corresponding decoder parses the channel skip information and determines on a channel-by-channel basis whether to skip the respective channels.

    Abstract translation: 描述了层间残差视频编码和解码中跳过模式的技术和工具。 例如,编码器编码层间残留视频的图像的多个宏块。 对于跳过的当前宏块,编码器从多个可用跳过模式中选择跳过模式,并且在对当前宏块进行编码时使用所选择的跳过模式。 跳过模式可以包括帧内跳过模式和预测运动跳过模式。 对于当前宏块,对应的解码器在解码期间选择并使用当前宏块的跳过模式。 作为另一示例,编码器对层间残留视频的图像的多个信道进行编码。 对于每个通道,编码器确定是否跳过通道。 编码器信号通道跳过信息以指示哪些通道被跳过。 相应的解码器解析信道跳过信息,并且逐个信道地确定是否跳过相应的信道。

    ESTIMATING SAMPLE-DOMAIN DISTORTION IN THE TRANSFORM DOMAIN WITH ROUNDING COMPENSATION
    8.
    发明申请
    ESTIMATING SAMPLE-DOMAIN DISTORTION IN THE TRANSFORM DOMAIN WITH ROUNDING COMPENSATION 有权
    估算变形域中的样本域失真与圆周补偿

    公开(公告)号:US20120020409A1

    公开(公告)日:2012-01-26

    申请号:US13248784

    申请日:2011-09-29

    CPC classification number: H04N19/625 H04N19/126 H04N19/61

    Abstract: Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values.

    Abstract translation: 描述了在估计变换域中的样本域失真时补偿舍入的技术和工具。 例如,视频编码器在补偿块的DC系数中的舍入后估计变换系数块的变换域中的像素域失真。 以这种方式,视频编码器提高了像素域失真估计的精度,但保留了在变换域中执行估计的计算优点。 舍入补偿包括例如在舍入偏移表中查找索引(来自去量化的变换系数)以确定舍入偏移,然后将系数调整为偏移。 本文描述的其他技术和工具旨在创建舍入偏移表和编码器,其在考虑在对于量化后的变换系数值进行逆频率变换之后出现的舍入效应时进行编码决定。

    NOISE ROBUST SPEECH CLASSIFIER ENSEMBLE
    9.
    发明申请
    NOISE ROBUST SPEECH CLASSIFIER ENSEMBLE 有权
    噪音强大的语音分类器ENSEMBLE

    公开(公告)号:US20100280827A1

    公开(公告)日:2010-11-04

    申请号:US12433143

    申请日:2009-04-30

    Abstract: Embodiments for implementing a speech recognition system that includes a speech classifier ensemble are disclosed. In accordance with one embodiment, the speech recognition system includes a classifier ensemble to convert feature vectors that represent a speech vector into log probability sets. The classifier ensemble includes a plurality of classifiers. The speech recognition system includes a decoder ensemble to transform the log probability sets into output symbol sequences. The speech recognition system further includes a query component to retrieve one or more speech utterances from a speech database using the output symbol sequences.

    Abstract translation: 公开了实现包括语音分类器集合的语音识别系统的实施例。 根据一个实施例,语音识别系统包括将表示语音向量的特征向量转换为对数概率集的分类器集合。 分类器集合包括多个分类器。 语音识别系统包括将对数概率集合变换为输出符号序列的解码器集合。 该语音识别系统还包括一个查询组件,用于使用输出符号序列从语音数据库中检索一个或多个语音话语。

    SKIP MODES FOR INTER-LAYER RESIDUAL VIDEO CODING AND DECODING
    10.
    发明申请
    SKIP MODES FOR INTER-LAYER RESIDUAL VIDEO CODING AND DECODING 有权
    用于层间残留视频编码和解码的跳过模式

    公开(公告)号:US20100061447A1

    公开(公告)日:2010-03-11

    申请号:US12205689

    申请日:2008-09-05

    Abstract: Techniques and tools for skip modes in encoding and decoding of inter-layer residual video are described. For example, an encoder encodes multiple macroblocks of a picture of inter-layer residual video. For a current macroblock that is skipped, the encoder selects a skip mode from among multiple available skip modes and uses the selected skip mode when encoding the current macroblock. The skip modes can include intra skip mode and predicted-motion skip mode. A corresponding decoder, for the current macroblock, selects and uses the skip mode for the current macroblock during decoding. As another example, an encoder encodes multiple channels of a picture of inter-layer residual video. For each channel, the encoder determines whether to skip the channel. The encoder signals channel skip information to indicate which channels are skipped. A corresponding decoder parses the channel skip information and determines on a channel-by-channel basis whether to skip the respective channels.

    Abstract translation: 描述了层间残差视频编码和解码中跳过模式的技术和工具。 例如,编码器编码层间残留视频的图像的多个宏块。 对于跳过的当前宏块,编码器从多个可用跳过模式中选择跳过模式,并且在对当前宏块进行编码时使用所选择的跳过模式。 跳过模式可以包括帧内跳过模式和预测运动跳过模式。 对于当前宏块,对应的解码器在解码期间选择并使用当前宏块的跳过模式。 作为另一示例,编码器对层间残留视频的图像的多个信道进行编码。 对于每个通道,编码器确定是否跳过通道。 编码器信号通道跳过信息以指示哪些通道被跳过。 相应的解码器解析信道跳过信息,并且逐个信道地确定是否跳过相应的信道。

Patent Agency Ranking