Method and system for optimal video transcoding based on utility function descriptors
    11.
    发明授权
    Method and system for optimal video transcoding based on utility function descriptors 有权
    基于效用函数描述符的最佳视频转码方法和系统

    公开(公告)号:US08218617B2

    公开(公告)日:2012-07-10

    申请号:US10965040

    申请日:2004-10-14

    IPC分类号: H04N7/12 H04N11/02 H04N11/04

    摘要: Techniques for generating utility-based descriptors from compressed multimedia information are disclosed. A preferred method includes the steps of receiving least a segment of compressed multimedia information, determining two or more portions of utility based descriptor information based on one or more adaptation operations, each corresponding to a unique target rate, adapting the compressed multimedia segment by each the portions of utility based descriptor information to generate adapted multimedia segments, using a quality management method to generate measurement for each adapted multimedia segment, and generating a utility based descriptors based on the portions of utility based descriptor information and corresponding quality measurements.

    摘要翻译: 公开了从压缩多媒体信息生成基于实用的描述符的技术。 一种优选的方法包括以下步骤:接收压缩多媒体信息的最小部分,基于一个或多个自适应操作确定基于效用的描述符信息的两个或多个部分,每个自适应操作对应于唯一的目标速率, 使用基于效用的描述符信息的部分来生成适应的多媒体段,使用质量管理方法为每个适配的多媒体段生成测量,以及基于基于实用的描述符信息和对应的质量测量的部分生成基于实用的描述符。

    APPARATUS AND METHOD FOR GENERATING AN MPEG-2 TRANSPORT PACKET HAVING A VARIABLE PACKET LENGTH
    12.
    发明申请
    APPARATUS AND METHOD FOR GENERATING AN MPEG-2 TRANSPORT PACKET HAVING A VARIABLE PACKET LENGTH 失效
    用于生成具有可变分组长度的MPEG-2传送分组的装置和方法

    公开(公告)号:US20120120969A1

    公开(公告)日:2012-05-17

    申请号:US13380656

    申请日:2010-06-18

    IPC分类号: H04L29/00

    CPC分类号: H04N21/236 H04N21/434

    摘要: Provided is a transport packet generating apparatus that generates a transport packet having a variable length, and the length of the transport packet is indicated by a field included in a header of the transport packet or a synchronization area of the transport packet, the field indicating a length of the transport packet.Also provided is a transport packet depacketizing apparatus that depacketizes the transport packet having the variable length by decoding the field indicating the length of the transport packet or detecting a starting point of the transport packet based on a predetermined rule with respect to the synchronization area to decode the transport packet.

    摘要翻译: 提供了一种生成具有可变长度的传输分组的传输分组生成装置,并且传输分组的长度由包含在传输分组的报头或传输分组的同步区域中的字段指示,该字段指示 传输包的长度。 还提供了一种传输分组去分组装置,其通过解码指示传输分组的长度的字段或基于相对于同步区域的预定规则来检测传输分组的起始点来对具有可变长度的传输分组进行分组,以解码 传输包。

    APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO
    14.
    发明申请
    APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO 有权
    编码和解码集成语音和音频的设备

    公开(公告)号:US20110119055A1

    公开(公告)日:2011-05-19

    申请号:US13003979

    申请日:2009-07-14

    IPC分类号: G10L19/14

    摘要: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.

    摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的编码装置,并且可以包括:输入信号分析器,用于分析输入信号的特性; 立体声编码器,当输入信号是立体声信号时,将输入信号向下混合成单声道信号,并提取立体声声像信息; 用于扩展输入信号的频带的频带扩展器; 用于转换采样率的采样率转换器; 语音信号编码器,当所述输入信号是语音特征信号时,使用语音编码模块对所述输入信号进行编码; 音频信号编码器,当所述输入信号是音频特征信号时,使用音频编码模块对所述输入信号进行编码; 以及比特流发生器,用于生成比特流。

    3D audio signal processing system using rigid sphere and method thereof
    17.
    发明申请
    3D audio signal processing system using rigid sphere and method thereof 失效
    使用刚性球的3D音频信号处理系统及其方法

    公开(公告)号:US20050141723A1

    公开(公告)日:2005-06-30

    申请号:US10972029

    申请日:2004-10-22

    摘要: Provided are a three-dimensional audio signal processing system using a rigid sphere and a method thereof. The three-dimensional audio signal processing system of the present research simplifies the shape of a human head into a rigid sphere, acquires three-dimensional audio signals by setting up mikes on the rigid sphere, and applies the acquire three-dimensional audio signals to diverse existing reproduction systems. The system includes a three-dimensional audio signal acquiring unit for acquiring audio signals by using a predetermined number of mikes set up on the rigid sphere; and a three-dimensional audio signal post-processing unit for converting the acquired audio signals to reproduce in diverse reproduction environments such as five-channel, four-channel, headphone, stereo, and stereo dipole reproduction environments.

    摘要翻译: 提供一种使用刚性球的三维音频信号处理系统及其方法。 本研究的三维音频信号处理系统将人体头部的形状简化为刚性球体,通过在刚性球体上设置摩丝来获取三维音频信号,并将采集的三维音频信号应用于多种 现有的复制系统。 该系统包括三维音频信号获取单元,用于通过使用设置在刚性球上的预定数量的移动来获取音频信号; 以及三维音频信号后处理单元,用于将所获取的音频信号转换为在诸如五声道,四声道,耳机,立体声和立体声偶极重现环境的不同的再现环境中再现。

    Apparatus and method for reproducing a sound field with a loudspeaker array controlled via a control volume
    18.
    发明授权
    Apparatus and method for reproducing a sound field with a loudspeaker array controlled via a control volume 有权
    用于通过经由控制体积控制的扬声器阵列再现声场的装置和方法

    公开(公告)号:US09124996B2

    公开(公告)日:2015-09-01

    申请号:US13122252

    申请日:2009-10-01

    IPC分类号: H04R5/00 H04S3/00

    摘要: Method and, apparatus for implementing the method, the method comprising determining control signal data for an array of loudspeakers, the control signal data being such as to control the loudspeakers to produce a desired sound field associated with an audio signal, the method comprises determining control signal data for different frequency components of the desired sound field in respect of respective different positions in a listening volume of the loudspeaker array, wherein determination of the control signal data comprises sampling the desired sound field at the surface of a control volume (V).

    摘要翻译: 方法和用于实现该方法的装置,该方法包括确定扬声器阵列的控制信号数据,所述控制信号数据用于控制扬声器产生与音频信号相关联的期望声场,该方法包括确定控制 用于相对于扬声器阵列的收听音量中的各个不同位置的期望声场的不同频率分量的信号数据,其中控制信号数据的确定包括对控制体积(V)的表面处的期望声场进行采样。

    Apparatus for encoding and decoding of integrated speech and audio
    19.
    发明授权
    Apparatus for encoding and decoding of integrated speech and audio 有权
    用于编码和解码集成语音和音频的装置

    公开(公告)号:US08903720B2

    公开(公告)日:2014-12-02

    申请号:US13003979

    申请日:2009-07-14

    摘要: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.

    摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的编码装置,并且可以包括:输入信号分析器,用于分析输入信号的特性; 立体声编码器,当输入信号是立体声信号时,将输入信号向下混合成单声道信号,并提取立体声声像信息; 用于扩展输入信号的频带的频带扩展器; 用于转换采样率的采样率转换器; 语音信号编码器,当所述输入信号是语音特征信号时,使用语音编码模块对所述输入信号进行编码; 音频信号编码器,当所述输入信号是音频特征信号时,使用音频编码模块对所述输入信号进行编码; 以及比特流发生器,用于生成比特流。

    Apparatus for generating and playing object based audio contents
    20.
    发明授权
    Apparatus for generating and playing object based audio contents 有权
    用于生成和播放基于对象的音频内容的装置

    公开(公告)号:US08351612B2

    公开(公告)日:2013-01-08

    申请号:US12628317

    申请日:2009-12-01

    IPC分类号: H04R5/00 H04R5/02 G10L19/00

    摘要: Disclosed is an object based audio contents generating/playing apparatus. The object based audio contents generating/playing apparatus may include an object audio signal obtaining unit to obtain a plurality of object audio signals by recording a plurality of sound source signals, a recording space information obtaining unit to obtain recording space information with respect to a recording space of the plurality of sound source signals, a sound source location information obtaining unit to obtain sound location information of the plurality of sound source signals, and an encoding unit to generate object based audio contents by encoding at least one of the plurality of object audio signals, the recording space information, and the sound source location information, thereby enabling the object based audio contents to be played using at least one of a WFS scheme and a multi-channel surround scheme regardless of a reproducing environment of the audience.

    摘要翻译: 公开了一种基于对象的音频内容生成/播放装置。 基于对象的音频内容生成/播放装置可以包括通过记录多个声源信号来获得多个对象音频信号的对象音频信号获取单元,记录空间信息获取单元,用于获得关于记录的记录空间信息 多个声源信号的空间,用于获得多个声源信号的声音位置信息的声源位置信息获取单元,以及通过对多个对象音频中的至少一个进行编码来生成基于对象的音频内容的编码单元 信号,记录空间信息和声源位置信息,从而使得能够使用WFS方案和多声道环绕声方案中的至少一种来播放基于对象的音频内容,而不管观众的再现环境如何。