-
公开(公告)号:US07085320B2
公开(公告)日:2006-08-01
申请号:US09953053
申请日:2001-09-14
申请人: He Ouyang , Li Sha , Shuhua Xiang , Yaojun Luo , Weimin Zeng , Jun Ding
发明人: He Ouyang , Li Sha , Shuhua Xiang , Yaojun Luo , Weimin Zeng , Jun Ding
IPC分类号: H04B7/12
CPC分类号: H04N21/440218 , H04N19/12 , H04N19/40 , H04N19/42 , H04N19/61
摘要: A video compression scheme enables the user to select one of many video compression formats, including the widely-used standard video formats such as MPEG-1, MPEG-2, MPEG-4 and H.263. In one embodiment, the scheme is implemented as a hardware-software combination, with the hardware portion, preferably implemented as an ASIC chip, performing the core compression and the software portion dealing with the detailed formatting. In another embodiment, a 32-bit aligned transitional data format is used.
摘要翻译: 视频压缩方案使得用户能够选择许多视频压缩格式之一,包括广泛使用的标准视频格式,如MPEG-1,MPEG-2,MPEG-4和H.263。 在一个实施例中,该方案被实现为硬件 - 软件组合,硬件部分优选地实现为ASIC芯片,执行核心压缩以及处理详细格式化的软件部分。 在另一个实施例中,使用32位对齐的过渡数据格式。
-
公开(公告)号:US20050226324A1
公开(公告)日:2005-10-13
申请号:US09953053
申请日:2001-09-14
申请人: He Ouyang , Li Sha , Shuhua Xiang , Yaojun Luo , Weimin Zeng , Jun Ding
发明人: He Ouyang , Li Sha , Shuhua Xiang , Yaojun Luo , Weimin Zeng , Jun Ding
CPC分类号: H04N21/440218 , H04N19/12 , H04N19/40 , H04N19/42 , H04N19/61
摘要: A video compression scheme enables the user to select one of many video compression formats, including the widely-used standard video formats such as MPEG-1, MPEG-2, MPEG-4 and H.263. In one embodiment, the scheme is implemented as a hardware-software combination, with the hardware portion, preferably implemented as an ASIC chip, performing the core compression and the software portion dealing with the detailed formatting. In another embodiment, a 32-bit aligned transitional data format is used.
摘要翻译: 视频压缩方案使得用户能够选择许多视频压缩格式之一,包括广泛使用的标准视频格式,如MPEG-1,MPEG-2,MPEG-4和H.263。 在一个实施例中,该方案被实现为硬件 - 软件组合,硬件部分优选地实现为ASIC芯片,执行核心压缩以及处理详细格式化的软件部分。 在另一个实施例中,使用32位对齐的过渡数据格式。
-
公开(公告)号:US20050207488A1
公开(公告)日:2005-09-22
申请号:US09924140
申请日:2001-08-07
申请人: He Ouyang , Li Sha , Shuhua Xiang , Ping Zhu , Yaojun Luo
发明人: He Ouyang , Li Sha , Shuhua Xiang , Ping Zhu , Yaojun Luo
CPC分类号: G06F17/147 , H04N19/423 , H04N19/43 , H04N19/61
摘要: A method, apparatus, computer medium, and other embodiments for discrete cosine transform and inverse discrete cosine transform (DCT/IDCT) of image signals are described. A DCT/IDCT module includes a plurality of different cores. One embodiment of a core includes two sets of lookup tables to provide multiplication and add operations for the DCT and IDCT functions. Another embodiment of a core include one set of lookup tables, while another embodiment of a core includes no lookup table. The DCT/IDCT module provides forward DCT and IDCT functionality without the use of additional multipliers.
摘要翻译: 描述了用于图像信号的离散余弦变换和逆离散余弦变换(DCT / IDCT)的方法,装置,计算机介质和其他实施例。 DCT / IDCT模块包括多个不同的核。 核心的一个实施例包括两组查找表,用于为DCT和IDCT功能提供乘法和加法运算。 核心的另一实施例包括一组查找表,而核心的另一实施例不包括查找表。 DCT / IDCT模块提供前向DCT和IDCT功能,而不需要使用额外的乘法器。
-
公开(公告)号:US07142251B2
公开(公告)日:2006-11-28
申请号:US10210254
申请日:2002-07-31
申请人: Li Sha , Shuhua Xiang , Yaojun Luo , He Ouyang
发明人: Li Sha , Shuhua Xiang , Yaojun Luo , He Ouyang
IPC分类号: H04N9/46
CPC分类号: H04N19/61
摘要: A video input processor is provided to process different input video format, including RGB, RGB Bayer, YUV 4:2:2 interlaced and progressive video data. The video input processor also uses an advanced algorithm to efficiently convert video data in RGB color space to YUV color space. The video input processor further enables multi-functions such as video image scaling, video image filtering before the video data are output for further video compression.
摘要翻译: 提供视频输入处理器来处理不同的输入视频格式,包括RGB,RGB拜耳,YUV 4:2:2隔行和逐行视频数据。 视频输入处理器还使用高级算法将RGB色彩空间中的视频数据有效地转换为YUV色彩空间。 视频输入处理器进一步实现诸如视频图像缩放,视频数据输出之前的视频图像滤波等多功能,用于进一步的视频压缩。
-
公开(公告)号:US20050206784A1
公开(公告)日:2005-09-22
申请号:US10210254
申请日:2002-07-31
申请人: Sha Li , Shuhua Xiang , Yaojun Luo , He Ouyang
发明人: Sha Li , Shuhua Xiang , Yaojun Luo , He Ouyang
CPC分类号: H04N19/61
摘要: A video input processor is provided to process different input video format, including RGB, RGB Bayer, YUV 4:2:2 interlaced and progressive video data. The video input processor also uses an advanced algorithm to efficiently convert video data in RGB color space to YUV color space. The video input processor further enables multi-functions such as video image scaling, video image filtering before the video data are output for further video compression.
摘要翻译: 提供视频输入处理器来处理不同的输入视频格式,包括RGB,RGB拜耳,YUV 4:2:2隔行和逐行视频数据。 视频输入处理器还使用高级算法将RGB色彩空间中的视频数据有效地转换为YUV色彩空间。 视频输入处理器进一步实现诸如视频图像缩放,视频数据输出之前的视频图像滤波等多功能,用于进一步的视频压缩。
-
公开(公告)号:US20070027677A1
公开(公告)日:2007-02-01
申请号:US11458143
申请日:2006-07-18
申请人: He Ouyang , Binghui Wu , Yi Zhou , Lin Luo , Kai Wan
发明人: He Ouyang , Binghui Wu , Yi Zhou , Lin Luo , Kai Wan
IPC分类号: G10L19/00
CPC分类号: G10L19/032 , G10L19/002
摘要: This invention discloses an implementation of audio codec, which has low computational complexity, small memory footprint and high coding efficiency. It can be used in handheld devices, SoC or ASIC products and embedded systems. At the encoder side: first, apply time-to-frequency transform to audio signals, obtaining un-quantized spectrum data; second, based on the un-quantized spectrum data and target bit count, calculate the corresponding information of optimal scale factor, frequency band group, code table index and quantized spectrum by iteration; third, calculate and format bit-stream; fourth, output formatted bit-stream. At the decoder side: parse the formatted bit-stream, apply decoding and inverse quantization to the spectrum of each frame, reconstruct temporal audio data by frequency-to-time transform, and reconstruct the time-domain signals of each channel.
摘要翻译: 本发明公开了一种具有低计算复杂度,小内存占用和高编码效率的音频编解码器的实现。 它可以用于手持设备,SoC或ASIC产品和嵌入式系统。 在编码器侧:首先,对音频信号进行时间 - 频率变换,获得未量化的频谱数据; 第二,基于未量化的频谱数据和目标比特数,通过迭代计算最佳比例因子,频带组,码表索引和量化频谱的对应信息; 第三,计算和格式比特流; 第四,输出格式化的比特流。 在解码器侧:解码格式化的比特流,对每个帧的频谱应用解码和反量化,通过频率 - 时间变换重建时间音频数据,并重构每个信道的时域信号。
-
公开(公告)号:US20070033011A1
公开(公告)日:2007-02-08
申请号:US11458207
申请日:2006-07-18
申请人: He Ouyang , Binghui Wu , Yi Zhou , Lin Luo , Kai Wan
发明人: He Ouyang , Binghui Wu , Yi Zhou , Lin Luo , Kai Wan
IPC分类号: G10L19/14
CPC分类号: G10L19/0204 , G10L19/24
摘要: This invention discloses a method of frequency band group partition for wideband audio codec. It can determine the initial frequency band group partition within the whole effective range of frequency bands. It further subdivides frequency band groups based on the initial partition. Instead of the iteration-based algorithm, this invention applies the 1-from-2 and 1-from-3 criterions to accomplish the fast partition with at most 3 subdivisions. This invention implements the fast partition for frequency band group without the loss of the coding efficiency. By applying this fast partition method, one can greatly reduce the computational complexity and significantly improve the coding performance.
摘要翻译: 本发明公开了一种用于宽带音频编解码器的频带分组方法。 它可以确定频带整个有效范围内的初始频带组分区。 它会根据初始分区进一步细分频段组。 本发明代替基于迭代的算法,应用1从2和1从3的标准来完成具有最多3个细分的快速分区。 本发明实现了对于频带组的快速划分,而不损失编码效率。 通过应用这种快速分割方法,可以大大降低计算复杂度并显着提高编码性能。
-
公开(公告)号:US20070033022A1
公开(公告)日:2007-02-08
申请号:US11458179
申请日:2006-07-18
申请人: He Ouyang , Yi Zhou , Binghui Wu , Lin Luo , Kai Wan
发明人: He Ouyang , Yi Zhou , Binghui Wu , Lin Luo , Kai Wan
IPC分类号: G10L19/02
CPC分类号: G10L19/035 , G10L19/002 , G10L19/0204 , G10L19/24 , G10L25/18
摘要: This invention discloses a method of bit-rate control and adjustment for audio coding, which comprises following steps: obtain the spectrum of the current audio frame and compute the maximum absolute value of each Bark (Bark: in the unit of critical band) frequency band; calculate the initial value of the minimum scale factor threshold and set the scale factor for each Bark band; Scale the spectrum of each audio frame with different scale factor, encode the quantized spectrum and calculate the coded bit of the current frame; Determine whether or not the coded bits of current frame is within the expected range of the bits, if yes, the bitstream is formatted and outputted, otherwise the minimum scale factor threshold is adjusted and repeat the above steps until the requirement is met. This method can significantly improve the encoding speed and reduce the coding loss of audio.
摘要翻译: 本发明公开了一种音频编码比特率控制和调整方法,包括以下步骤:获取当前音频帧的频谱,并计算每个树皮的最大绝对值(Bark:以临界频带为单位)频带 ; 计算最小比例因子阈值的初始值,并设置每个Bark带的比例因子; 用不同的比例因子缩放每个音频帧的频谱,对量化的频谱进行编码,并计算当前帧的编码比特; 确定当前帧的编码比特是否在比特的预期范围内,如果是,则比特流被格式化并输出,否则调整最小比例因子阈值并重复上述步骤直到满足要求。 这种方法可以显着提高编码速度,减少音频编码损耗。
-
-
-
-
-
-
-