INTRA-MODE PREDICTION FOR A VIDEO TRANSCODER
    1.
    发明申请
    INTRA-MODE PREDICTION FOR A VIDEO TRANSCODER 有权
    视频转码器的内部模式预测

    公开(公告)号:US20120082220A1

    公开(公告)日:2012-04-05

    申请号:US13163853

    申请日:2011-06-20

    IPC分类号: H04N7/32

    摘要: A video transcoder for converting an encoded input video bit-stream having one spatial resolution into an encoded output video bit-stream having a lower spatial resolution, wherein learned statistics of intra-mode transcoding are used to constrain the search of intra modes for the output video bit-stream. The statistics of intra-mode transcoding can be gathered, e.g., by applying brute-force downsizing to a training set of video frames and then analyzing the observed intra-mode transcoding patterns to determine a transition-probability matrix for use during normal operation of the transcoder. The transition-probability matrix enables the transcoder to select appropriate intra modes for the output video bit-stream without performing a corresponding exhaustive full search, which advantageously reduces the computational complexity and processor load compared to those of a comparably performing prior-art video transcoder.

    摘要翻译: 一种用于将具有一个空间分辨率的编码输入视频比特流转换为具有较低空间分辨率的编码输出视频比特流的视频代码转换器,其中使用帧内模式代码转换的学习统计量来限制用于输出的帧内模式搜索 视频位流。 可以收集帧内模式转码的统计数据,例如,通过对视频帧的训练集应用强力小型化,然后分析观察到的模式内转码模式,以确定在正常操作期间使用的转换概率矩阵 代码转换器 转换概率矩阵使得代码转换器能够选择用于输出视频比特流的适当的帧内模式,而不执行相应的穷尽全搜索,这有助于降低计算复杂度和处理器负载,与相对于现有技术的现有技术的视频转码器相比。

    Intra-mode prediction for a video transcoder
    2.
    发明授权
    Intra-mode prediction for a video transcoder 有权
    视频转码器的模式间预测

    公开(公告)号:US08780983B2

    公开(公告)日:2014-07-15

    申请号:US13163853

    申请日:2011-06-20

    IPC分类号: H04N7/32 H04N7/26 H04N7/46

    摘要: A video transcoder for converting an encoded input video bit-stream having one spatial resolution into an encoded output video bit-stream having a lower spatial resolution, wherein learned statistics of intra-mode transcoding are used to constrain the search of intra modes for the output video bit-stream. The statistics of intra-mode transcoding can be gathered, e.g., by applying brute-force downsizing to a training set of video frames and then analyzing the observed intra-mode transcoding patterns to determine a transition-probability matrix for use during normal operation of the transcoder. The transition-probability matrix enables the transcoder to select appropriate intra modes for the output video bit-stream without performing a corresponding exhaustive full search, which advantageously reduces the computational complexity and processor load compared to those of a comparably performing prior-art video transcoder.

    摘要翻译: 一种用于将具有一个空间分辨率的编码输入视频比特流转换为具有较低空间分辨率的编码输出视频比特流的视频代码转换器,其中使用帧内模式代码转换的学习统计量来限制用于输出的帧内模式搜索 视频位流。 可以收集帧内模式转码的统计数据,例如,通过对视频帧的训练集应用强力小型化,然后分析观察到的模式内转码模式,以确定在正常操作期间使用的转换概率矩阵 代码转换器 转换概率矩阵使得代码转换器能够选择用于输出视频比特流的适当的帧内模式,而不执行相应的穷尽全搜索,这有助于降低计算复杂度和处理器负载,与相对于现有技术的现有技术的视频转码器相比。

    Motion Estimation for a Video Transcoder
    3.
    发明申请
    Motion Estimation for a Video Transcoder 有权
    视频转码器的运动估计

    公开(公告)号:US20120106642A1

    公开(公告)日:2012-05-03

    申请号:US13165056

    申请日:2011-06-21

    IPC分类号: H04N7/26

    CPC分类号: H04N19/40 H04N19/56 H04N19/59

    摘要: A video transcoder for converting an encoded input video bit-stream having one spatial resolution into an encoded output video bit-stream having a lower spatial resolution, wherein motion-vector dispersion observed at the higher spatial resolution is quantified and used to configure the motion-vector search at the lower spatial resolution. For example, for video-frame areas characterized by relatively low motion-vector dispersion values, the motion-vector search may be performed over a relatively small vector space and with the use of fewer search patterns and/or hierarchical search levels. These constraints enable the transcoder to find appropriate motion vectors for inter-prediction coding without having to perform an exhaustive motion-vector search for these video-frame areas, which advantageously reduces the computational complexity and processor load compared to those of a comparably performing prior-art video transcoder.

    摘要翻译: 一种用于将具有一个空间分辨率的编码输入视频比特流转换为具有较低空间分辨率的编码输出视频比特流的视频代码转换器,其中以较高空间分辨率观察到的运动矢量色散被量化并用于配置运动 - 矢量搜索在较低的空间分辨率。 例如,对于以相对低的运动矢量色散值为特征的视频帧区域,运动矢量搜索可以在相对较小的向量空间上并且使用较少的搜索模式和/或分级搜索级别来执行。 这些约束使得代码转换器能够找到用于帧间预测编码的适当的运动矢量,而不必对这些视频帧区域执行穷尽的运动矢量搜索,这有助于降低计算复杂度和处理器负载, 艺术视频转码器。

    Motion estimation for a video transcoder
    4.
    发明授权
    Motion estimation for a video transcoder 有权
    视频代码转换器的运动估计

    公开(公告)号:US08711941B2

    公开(公告)日:2014-04-29

    申请号:US13165056

    申请日:2011-06-21

    IPC分类号: H04N7/12 H04N11/02 G06K9/32

    CPC分类号: H04N19/40 H04N19/56 H04N19/59

    摘要: A video transcoder for converting an encoded input video bit-stream having one spatial resolution into an encoded output video bit-stream having a lower spatial resolution, wherein motion-vector dispersion observed at the higher spatial resolution is quantified and used to configure the motion-vector search at the lower spatial resolution. For example, for video-frame areas characterized by relatively low motion-vector dispersion values, the motion-vector search may be performed over a relatively small vector space and with the use of fewer search patterns and/or hierarchical search levels. These constraints enable the transcoder to find appropriate motion vectors for inter-prediction coding without having to perform an exhaustive motion-vector search for these video-frame areas, which advantageously reduces the computational complexity and processor load compared to those of a comparably performing prior-art video transcoder.

    摘要翻译: 一种用于将具有一个空间分辨率的编码输入视频比特流转换为具有较低空间分辨率的编码输出视频比特流的视频代码转换器,其中以较高空间分辨率观察到的运动矢量色散被量化并用于配置运动 - 矢量搜索在较低的空间分辨率。 例如,对于以相对低的运动矢量色散值为特征的视频帧区域,运动矢量搜索可以在相对较小的向量空间上并且使用较少的搜索模式和/或分级搜索级别来执行。 这些约束使得代码转换器能够找到用于帧间预测编码的适当的运动矢量,而不必对这些视频帧区域执行穷尽的运动矢量搜索,这有助于降低计算复杂度和处理器负载, 艺术视频转码器。

    VIDEO TRANSCODER WITH FLEXIBLE QUALITY AND COMPLEXITY MANAGEMENT
    5.
    发明申请
    VIDEO TRANSCODER WITH FLEXIBLE QUALITY AND COMPLEXITY MANAGEMENT 有权
    视频转码器具有灵活的质量和复杂性管理

    公开(公告)号:US20120051440A1

    公开(公告)日:2012-03-01

    申请号:US13069552

    申请日:2011-03-23

    IPC分类号: H04N7/12

    摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution in a manner that enables the transcoder to dynamically change the amount of computational resources allocated to the conversion process. In one embodiment, the video transcoder has a plurality of configurable processing paths whose configuration determines the amount of allocated computational resources. Exemplary processing-path configuration changes may include, but are not limited to engaging or disengaging a processing path, redirecting a data flow from flowing through one processing path to flowing through another processing path, and attaching or detaching one or more processing modules to an engaged processing path. The capability to make these and other configuration changes enables the video transcoder to adjust the computational complexity and picture quality on the fly, without interrupting the video sequence in the output video bit-stream.

    摘要翻译: 一种视频代码转换器,用于将具有一个空间分辨率的压缩输入视频比特流转换成具有不同空间分辨率的压缩输出视频比特流,使得代码转换器能够动态地改变分配给转换处理的计算资源量。 在一个实施例中,视频代码转换器具有多个可配置的处理路径,其配置确定分配的计算资源的数量。 示例性的处理路径配置改变可以包括但不限于接合或分离处理路径,将数据流从流经一个处理路径重定向到流过另一个处理路径,以及将一个或多个处理模块附接到一个或多个处理模块 处理路径 进行这些和其他配置更改的能力使视频代码转换器能够即时调整计算复杂度和图像质量,而不会中断输出视频比特流中的视频序列。

    Video transcoder with flexible quality and complexity management
    6.
    发明授权
    Video transcoder with flexible quality and complexity management 有权
    视频转码器具有灵活的质量和复杂性管理

    公开(公告)号:US08731068B2

    公开(公告)日:2014-05-20

    申请号:US13069552

    申请日:2011-03-23

    IPC分类号: H04N7/50

    摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution in a manner that enables the transcoder to dynamically change the amount of computational resources allocated to the conversion process. In one embodiment, the video transcoder has a plurality of configurable processing paths whose configuration determines the amount of allocated computational resources. Exemplary processing-path configuration changes may include, but are not limited to engaging or disengaging a processing path, redirecting a data flow from flowing through one processing path to flowing through another processing path, and attaching or detaching one or more processing modules to an engaged processing path. The capability to make these and other configuration changes enables the video transcoder to adjust the computational complexity and picture quality on the fly, without interrupting the video sequence in the output video bit-stream.

    摘要翻译: 一种视频代码转换器,用于将具有一个空间分辨率的压缩输入视频比特流转换成具有不同空间分辨率的压缩输出视频比特流,使得代码转换器能够动态地改变分配给转换处理的计算资源量。 在一个实施例中,视频代码转换器具有多个可配置的处理路径,其配置确定分配的计算资源的数量。 示例性的处理路径配置改变可以包括但不限于接合或分离处理路径,将数据流从流经一个处理路径重定向到流过另一个处理路径,以及将一个或多个处理模块附接到一个或多个处理模块 处理路径 进行这些和其他配置更改的能力使视频代码转换器能够即时调整计算复杂度和图像质量,而不会中断输出视频比特流中的视频序列。

    MIXED-MODE RESIZING FOR A VIDEO TRANSCODER
    7.
    发明申请
    MIXED-MODE RESIZING FOR A VIDEO TRANSCODER 审中-公开
    视频转码器的混合模式调整

    公开(公告)号:US20120051427A1

    公开(公告)日:2012-03-01

    申请号:US13069731

    申请日:2011-03-23

    IPC分类号: H04N7/46

    摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution using a plurality of resizing channels. The transcoder has a kernel that partially decodes the compressed input video bit-stream to generate partially decoded video data. The data segments corresponding to picture portions that have both intra- and inter-predicted blocks in close spatial proximity to one another are applied to a mixed-mode resizing channel that is specifically designed for processing such data segments. For each received data segment, the control logic of the channel selects, from a bank of pre-configured resizers, a resizer that is deemed to be most suitable for resizing the image portion represented by that data segment in a computationally efficient manner. The data segment is processed in the selected resizer to generate the corresponding resized-image data. The resized-image data generated by the mixed-mode resizing channel are combined with the resized data generated by other resizing channels of the transcoder and then re-encoded to generate the compressed output video bit-stream.

    摘要翻译: 一种视频代码转换器,用于使用多个调整大小的信道将具有一个空间分辨率的压缩输入视频比特流转换为具有不同空间分辨率的压缩输出视频比特流。 代码转换器具有对压缩的输入视频比特流进行部分解码以生成部分解码的视频数据的内核。 对应于具有彼此紧密地空间接近的帧内和帧间预测的块的图像部分的数据段被应用于专门设计用于处理这样的数据段的混合模式调整大小信道。 对于每个接收的数据段,信道的控制逻辑从预先配置的重定位器组中选择被认为最适合于以计算有效的方式调整由该数据段表示的图像部分的大小的调整器。 在所选择的调整器中处理数据段以产生相应的调整大小的图像数据。 由混合模式调整大小信道生成的经调整大小的图像数据与由代码转换器的其它调整大小信道生成的调整大小的数据组合,然后被重新编码以产生压缩的输出视频比特流。

    TIME-DOMAIN ACOUSTIC ECHO CONTROL
    8.
    发明申请
    TIME-DOMAIN ACOUSTIC ECHO CONTROL 有权
    时域声控ECHO控制

    公开(公告)号:US20120201370A1

    公开(公告)日:2012-08-09

    申请号:US13222050

    申请日:2011-08-31

    IPC分类号: H04B3/23 H04M9/08

    CPC分类号: H04M9/082 H04B3/23

    摘要: In one embodiment, an acoustic echo control (AEC) module receives an outgoing signal and an incoming signal, which, at various times, contains acoustic echo corresponding to the outgoing signal. The AEC module has a delay estimation block that estimates, in the time domain, the echo delay using an adaptive filtering technique. This delay estimation is used to align samples of the incoming signal having acoustic echo with the corresponding samples of the outgoing signal from which the acoustic echo originated. The AEC module determines whether or not samples of the incoming signal contain acoustic echo based on the aligned outgoing signal, and the determinations are applied to a hangover counter. The AEC module then suppresses acoustic echo in the incoming signal and adds comfort noise to the incoming signal. The amount of echo suppression performed is gradually increased or decreased based on comparisons of the counter to a hangover threshold.

    摘要翻译: 在一个实施例中,声学回声控制(AEC)模块接收输出信号和输入信号,其在不同时间包含对应于输出信号的声学回声。 AEC模块具有延迟估计块,其在时域中使用自适应滤波技术来估计回波延迟。 该延迟估计用于将具有声学回波的输入信号的样本与来自声学回声的输出信号的对应样本对准。 AEC模块确定输入信号的采样是否基于对齐的输出信号包含声学回波,并且确定应用于宿醉计数器。 AEC模块然后抑制输入信号中的声学回声,并为输入信号增加舒适噪声。 基于计数器与宿醉阈值的比较,进行的回波抑制量逐渐增加或减少。

    Time-domain acoustic echo control
    9.
    发明授权
    Time-domain acoustic echo control 有权
    时域声学回声控制

    公开(公告)号:US08824667B2

    公开(公告)日:2014-09-02

    申请号:US13222050

    申请日:2011-08-31

    IPC分类号: H04M9/08 H04B3/20 H04B3/23

    CPC分类号: H04M9/082 H04B3/23

    摘要: In one embodiment, an acoustic echo control (AEC) module receives an outgoing signal and an incoming signal, which, at various times, contains acoustic echo corresponding to the outgoing signal. The AEC module has a delay estimation block that estimates, in the time domain, the echo delay using an adaptive filtering technique. This delay estimation is used to align samples of the incoming signal having acoustic echo with the corresponding samples of the outgoing signal from which the acoustic echo originated. The AEC module determines whether or not samples of the incoming signal contain acoustic echo based on the aligned outgoing signal, and the determinations are applied to a hangover counter. The AEC module then suppresses acoustic echo in the incoming signal and adds comfort noise to the incoming signal. The amount of echo suppression performed is gradually increased or decreased based on comparisons of the counter to a hangover threshold.

    摘要翻译: 在一个实施例中,声学回声控制(AEC)模块接收输出信号和输入信号,其在不同时间包含对应于输出信号的声学回声。 AEC模块具有延迟估计块,其在时域中使用自适应滤波技术来估计回波延迟。 该延迟估计用于将具有声学回波的输入信号的样本与来自声学回声的输出信号的对应样本对准。 AEC模块确定输入信号的采样是否基于对齐的输出信号包含声学回波,并且确定应用于宿醉计数器。 AEC模块然后抑制输入信号中的声学回声,并为输入信号增加舒适噪声。 基于计数器与宿醉阈值的比较,进行的回波抑制量逐渐增加或减少。

    High-performance tone detection using a digital signal processor (DSP) having multiple arithmetic logic units (ALUs)
    10.
    发明授权
    High-performance tone detection using a digital signal processor (DSP) having multiple arithmetic logic units (ALUs) 失效
    使用具有多个算术逻辑单元(ALU)的数字信号处理器(DSP)的高性能音调检测

    公开(公告)号:US08761916B2

    公开(公告)日:2014-06-24

    申请号:US13124474

    申请日:2009-05-28

    IPC分类号: G06F17/00

    CPC分类号: H04Q1/453 G10L25/90

    摘要: In one embodiment, a DSP having four arithmetic logic units (ALUs) and able to have two read/write operations per clock cycle performs silence detection and tone detection for data frames containing samples of an audio signal. The ALUs are used together in parallel to process the samples in the data frames received by the DSP. A received data frame is filtered by the silence detection so that substantially silent frames are dropped and non-silent frames are further processed. In the tone detection, a filtered data frame is processed, four samples at a time, to determine the power of the signal at a given frequency, where the power determination is used to determine whether a given tone (i.e., a signal at a given frequency) is present in the data frame.

    摘要翻译: 在一个实施例中,具有四个算术逻辑单元(ALU)并且能够在每个时钟周期中具有两个读/写操作的DSP对包含音频信号的样本的数据帧执行静音检测和音调检测。 ALU并行使用,以处理由DSP接收的数据帧中的样本。 接收到的数据帧通过静音检测进行过滤,使得基本静默的帧被丢弃,非静音帧被进一步处理。 在音调检测中,处理滤波后的数据帧,一次四个采样,以确定给定频率处的信号功率,其中功率确定用于确定给定的音调(即,给定的信号 频率)存在于数据帧中。