-
1.
公开(公告)号:US20120051440A1
公开(公告)日:2012-03-01
申请号:US13069552
申请日:2011-03-23
申请人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
发明人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
IPC分类号: H04N7/12
CPC分类号: H04N19/59 , H04N19/102 , H04N19/159 , H04N19/176 , H04N19/40 , H04N19/61
摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution in a manner that enables the transcoder to dynamically change the amount of computational resources allocated to the conversion process. In one embodiment, the video transcoder has a plurality of configurable processing paths whose configuration determines the amount of allocated computational resources. Exemplary processing-path configuration changes may include, but are not limited to engaging or disengaging a processing path, redirecting a data flow from flowing through one processing path to flowing through another processing path, and attaching or detaching one or more processing modules to an engaged processing path. The capability to make these and other configuration changes enables the video transcoder to adjust the computational complexity and picture quality on the fly, without interrupting the video sequence in the output video bit-stream.
摘要翻译: 一种视频代码转换器,用于将具有一个空间分辨率的压缩输入视频比特流转换成具有不同空间分辨率的压缩输出视频比特流,使得代码转换器能够动态地改变分配给转换处理的计算资源量。 在一个实施例中,视频代码转换器具有多个可配置的处理路径,其配置确定分配的计算资源的数量。 示例性的处理路径配置改变可以包括但不限于接合或分离处理路径,将数据流从流经一个处理路径重定向到流过另一个处理路径,以及将一个或多个处理模块附接到一个或多个处理模块 处理路径 进行这些和其他配置更改的能力使视频代码转换器能够即时调整计算复杂度和图像质量,而不会中断输出视频比特流中的视频序列。
-
2.
公开(公告)号:US08731068B2
公开(公告)日:2014-05-20
申请号:US13069552
申请日:2011-03-23
申请人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
发明人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
IPC分类号: H04N7/50
CPC分类号: H04N19/59 , H04N19/102 , H04N19/159 , H04N19/176 , H04N19/40 , H04N19/61
摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution in a manner that enables the transcoder to dynamically change the amount of computational resources allocated to the conversion process. In one embodiment, the video transcoder has a plurality of configurable processing paths whose configuration determines the amount of allocated computational resources. Exemplary processing-path configuration changes may include, but are not limited to engaging or disengaging a processing path, redirecting a data flow from flowing through one processing path to flowing through another processing path, and attaching or detaching one or more processing modules to an engaged processing path. The capability to make these and other configuration changes enables the video transcoder to adjust the computational complexity and picture quality on the fly, without interrupting the video sequence in the output video bit-stream.
摘要翻译: 一种视频代码转换器,用于将具有一个空间分辨率的压缩输入视频比特流转换成具有不同空间分辨率的压缩输出视频比特流,使得代码转换器能够动态地改变分配给转换处理的计算资源量。 在一个实施例中,视频代码转换器具有多个可配置的处理路径,其配置确定分配的计算资源的数量。 示例性的处理路径配置改变可以包括但不限于接合或分离处理路径,将数据流从流经一个处理路径重定向到流过另一个处理路径,以及将一个或多个处理模块附接到一个或多个处理模块 处理路径 进行这些和其他配置更改的能力使视频代码转换器能够即时调整计算复杂度和图像质量,而不会中断输出视频比特流中的视频序列。
-
公开(公告)号:US20120051427A1
公开(公告)日:2012-03-01
申请号:US13069731
申请日:2011-03-23
申请人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
发明人: Denis Vassilevich Parfenov , Pavel Aleksandrovich Aliseychik , Aleksey Alexandrovich Letunovskiy , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
IPC分类号: H04N7/46
CPC分类号: H04N19/59 , H04N19/102 , H04N19/159 , H04N19/176 , H04N19/40 , H04N19/61
摘要: A video transcoder for converting a compressed input video bit-stream having one spatial resolution into a compressed output video bit-stream having a different spatial resolution using a plurality of resizing channels. The transcoder has a kernel that partially decodes the compressed input video bit-stream to generate partially decoded video data. The data segments corresponding to picture portions that have both intra- and inter-predicted blocks in close spatial proximity to one another are applied to a mixed-mode resizing channel that is specifically designed for processing such data segments. For each received data segment, the control logic of the channel selects, from a bank of pre-configured resizers, a resizer that is deemed to be most suitable for resizing the image portion represented by that data segment in a computationally efficient manner. The data segment is processed in the selected resizer to generate the corresponding resized-image data. The resized-image data generated by the mixed-mode resizing channel are combined with the resized data generated by other resizing channels of the transcoder and then re-encoded to generate the compressed output video bit-stream.
摘要翻译: 一种视频代码转换器,用于使用多个调整大小的信道将具有一个空间分辨率的压缩输入视频比特流转换为具有不同空间分辨率的压缩输出视频比特流。 代码转换器具有对压缩的输入视频比特流进行部分解码以生成部分解码的视频数据的内核。 对应于具有彼此紧密地空间接近的帧内和帧间预测的块的图像部分的数据段被应用于专门设计用于处理这样的数据段的混合模式调整大小信道。 对于每个接收的数据段,信道的控制逻辑从预先配置的重定位器组中选择被认为最适合于以计算有效的方式调整由该数据段表示的图像部分的大小的调整器。 在所选择的调整器中处理数据段以产生相应的调整大小的图像数据。 由混合模式调整大小信道生成的经调整大小的图像数据与由代码转换器的其它调整大小信道生成的调整大小的数据组合,然后被重新编码以产生压缩的输出视频比特流。
-
公开(公告)号:US20130028317A1
公开(公告)日:2013-01-31
申请号:US13370670
申请日:2012-02-10
申请人: Denis Vassilevich Parfenov , Ivan Leonidovich Mazurenko , Aleksey Alexandrovich Letunovskiy , Pavel Aleksandrovich Aliseychik , Denis Vladimirovich Parkhomenko
发明人: Denis Vassilevich Parfenov , Ivan Leonidovich Mazurenko , Aleksey Alexandrovich Letunovskiy , Pavel Aleksandrovich Aliseychik , Denis Vladimirovich Parkhomenko
IPC分类号: H04N7/32
CPC分类号: H04N19/154 , H04N19/11 , H04N19/176 , H04N19/196
摘要: A search method for identifying an intra mode that can produce acceptable video-encoding quality for a pixel block while striking a proper balance between the quality and processor load. In a representative embodiment, the search method relies on a set of mode-selection rules for iteratively identifying candidate intra modes. Each identified candidate is evaluated based on a comparison of its sum of absolute differences (SAD) with the smallest SAD in the set of the previously identified candidates. The mode-selection rules use the comparison results as conditions that efficiently guide the search method toward an intra mode that is suitable for encoding the pixel block with acceptable video quality. On average, a representative embodiment of the search method disclosed herein is advantageously capable of finding a suitable intra mode in fewer iterations than a comparable prior-art search method.
摘要翻译: 一种用于识别能够在质量和处理器负载之间达到适当平衡的情况下为像素块产生可接受的视频编码质量的帧内模式的搜索方法。 在代表性实施例中,搜索方法依赖于用于迭代地识别候选帧内模式的一组模式选择规则。 基于其绝对差(SAD)与先前识别的候选组中的最小SAD的比较来评估每个识别的候选。 模式选择规则使用比较结果作为将搜索方法有效地引向适合于以可接受的视频质量对像素块进行编码的帧内模式的条件。 平均而言,本文公开的搜索方法的代表性实施例有利地能够以比可比较的现有技术搜索方法更少的迭代找到合适的帧内模式。
-
公开(公告)号:US20120155655A1
公开(公告)日:2012-06-21
申请号:US13205856
申请日:2011-08-09
申请人: Denis Vladimirovich Parkhomenko , Pavel Aleksandrovich Aliseychik , Dmitry Nikolaevich Babin , Alexander Markovic , Ivan Leonidovich Mazurenko
发明人: Denis Vladimirovich Parkhomenko , Pavel Aleksandrovich Aliseychik , Dmitry Nikolaevich Babin , Alexander Markovic , Ivan Leonidovich Mazurenko
IPC分类号: H04R29/00
CPC分类号: H04R3/00
摘要: In one embodiment, a pause-based music detection (MD) module detects music by analyzing pauses in a received audio signal. The energy of each frame of the signal is compared to an energy threshold to determine whether the frame corresponds to background noise only (i.e., a pause) or sound such as speech or music. A window having a number of frames is analyzed to determine whether there is a pause within the window. If no pauses are detected in the window, then the current frame is presumed to correspond to music. If a pause is detected, then the current frame is presumed to correspond to speech. In another embodiment, the pause-based MD module output is applied to Boolean “OR” logic along with a tone-based MD module output to generate a final MD decision. The tone-based MD module detects music by analyzing tones in the signal using any suitable tone-based MD algorithm.
摘要翻译: 在一个实施例中,基于暂停的音乐检测(MD)模块通过分析所接收的音频信号中的暂停来检测音乐。 将信号的每帧的能量与能量阈值进行比较,以确定帧是否仅对应于背景噪声(即,暂停)或诸如语音或音乐的声音。 分析具有多个帧的窗口以确定窗口内是否存在暂停。 如果在窗口中没有检测到暂停,则假设当前帧对应于音乐。 如果检测到暂停,则认为当前帧对应于语音。 在另一个实施例中,将基于暂停的MD模块输出与基于音调的MD模块输出一起应用于布尔“OR”逻辑以产生最终的MD决定。 基于音调的MD模块通过使用任何合适的基于音调的MD算法分析信号中的音调来检测音乐。
-
公开(公告)号:US20120224684A1
公开(公告)日:2012-09-06
申请号:US13222132
申请日:2011-08-31
申请人: Alexander Alexandrovich Petyushko , Dmitry Nikolaevich Babin , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
发明人: Alexander Alexandrovich Petyushko , Dmitry Nikolaevich Babin , Alexander Markovic , Ivan Leonidovich Mazurenko , Denis Vladimirovich Parkhomenko
摘要: In one embodiment, a high-level compensation (HLC) module receives samples of an input signal and determines whether a magnitude of each sample, represented in a linear domain, is relatively low or relatively high by comparing the magnitude to a threshold. If a sample is less than or equal to the threshold, then it is considered to have a relatively low magnitude and the sample is not attenuated. If a sample is greater than the threshold, then it is considered to have a relatively high magnitude and the HLC module attenuates the sample according to a “soft” non-linear function. The “soft” non-linear function is characterized by at least two of the following characteristics: the non-linear function (i) increases monotonically, (ii) forms a convex upwards curve, (iii) has a first derivative at the threshold equal to one, and (iv) has a first derivative at a maximum possible magnitude value equal to zero.
摘要翻译: 在一个实施例中,高电平补偿(HLC)模块接收输入信号的采样并且通过将幅度与阈值进行比较来确定在线性域中表示的每个采样的幅度是相对较低还是相对较高。 如果样品小于或等于阈值,则认为其具有相对较低的量值,并且样品不被衰减。 如果样本大于阈值,则认为其具有相对较高的幅度,并且HLC模块根据“软”非线性函数衰减样本。 “软”非线性函数的特征在于以下特征中的至少两个:非线性函数(i)单调增加,(ii)形成凸起的上升曲线,(iii)具有阈值下的一阶导数等于 至(iv)具有等于零的最大可能幅度值的一阶导数。
-
公开(公告)号:US20120158401A1
公开(公告)日:2012-06-21
申请号:US13205882
申请日:2011-08-09
申请人: Ivan Leonidovich Mazurenko , Dmitry Nikolaevich Babin , Alexander Markovic , Denis Vladimirovich Parkhomenko , Alexander Alexandrovich Petyushko
发明人: Ivan Leonidovich Mazurenko , Dmitry Nikolaevich Babin , Alexander Markovic , Denis Vladimirovich Parkhomenko , Alexander Alexandrovich Petyushko
IPC分类号: G10L11/06
CPC分类号: G10L25/81
摘要: In one embodiment, a music detection (MD) module accumulates sets of one or more frames and performs FFT processing on each set to recover a set of coefficients, each corresponding to a different frequency k. For each frame, the module identifies candidate musical tones by searching for peak values in the set of coefficients. If a coefficient corresponds to a peak, then a variable TONE[k] corresponding to the coefficient is set equal to one. Otherwise, the variable is set equal to zero. For each variable TONE[k] having a value of one, a corresponding accumulator A[k] is increased. Candidate musical tones that are short in duration are filtered out by comparing each accumulator A[k] to a minimum duration threshold. A determination is made as to whether or not music is present based on a number of candidate musical tones and a sum of candidate musical tone durations using a state machine.
摘要翻译: 在一个实施例中,音乐检测(MD)模块累积一个或多个帧的集合,并对每个集合执行FFT处理以恢复一组系数,每个系数对应于不同的频率k。 对于每个帧,模块通过搜索该组系数中的峰值来识别候选音调。 如果系数对应于峰值,那么对应于系数的变量TONE [k]设置为等于1。 否则,变量设置为零。 对于值为1的每个变量TONE [k],相应的累加器A [k]增加。 通过将每个累加器A [k]与最小持续时间阈值进行比较来滤除持续时间短的候选音调。 基于使用状态机的候选音乐音调的数量和候选音乐持续时间的总和,确定音乐是否存在。
-
-
-
-
-
-