TRANSMISSION APPARATUS, TRANSMISSION METHOD, RECEPTION APPARATUS, AND RECEPTION METHOD

    公开(公告)号:US20180103082A1

    公开(公告)日:2018-04-12

    申请号:US15568899

    申请日:2016-05-10

    CPC classification number: H04L65/607 H04N21/233 H04N21/236 H04N21/436

    Abstract: To enable, on a receiving side, processing obtaining predetermined information to be performed easily and appropriately in a case the predetermined information is divided into a predetermined number of audio frames and transmitted. The predetermined information is inserted into an audio compressed data stream. The audio compressed data stream into which the predetermined information is inserted is transmitted. It is possible to insert each of the pieces of divided information obtained by dividing the predetermined information into the predetermined number of audio frames of the audio compressed data stream. Information indicating the overall size of the predetermined information is added to a first piece of divided information. It is possible to ensure space for storing the predetermined information in a storage medium on the basis of the information indicating the overall size of the predetermined information at a time point where the first piece of divided information is obtained.

    INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD
    2.
    发明申请
    INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD 审中-公开
    信息处理设备和信息处理方法

    公开(公告)号:US20160156944A1

    公开(公告)日:2016-06-02

    申请号:US14904232

    申请日:2014-07-01

    Abstract: The present disclosure relates to an information processing device and information processing method capable of recognizing an acquisition position of voice data on an image. A web server transmits image frame size information indicating image frame size of image data and audio position information indicating acquisition position of voice data. The present disclosure is applicable to an information processing system or other like system including file generation device, web server, and video playback terminal to perform tiled streaming using a manner compliant with moving picture experts group phase-dynamic adaptive streaming over HTTP (MPEG-DASH).

    Abstract translation: 本公开涉及能够识别图像上的语音数据的获取位置的信息处理设备和信息处理方法。 Web服务器发送指示图像数据的图像帧大小的图像帧大小信息和指示语音数据的获取位置的音频位置信息。 本公开可应用于包括文件生成设备,web服务器和视频回放终端的信息处理系统或其他类似的系统,以使用符合运动图像专家组的相关动态自适应流(HTTP-DASH)来执行平铺流 )。

    SIGNAL PROCESSING APPARATUS AND METHOD, PROGRAM, AND DATA RECORDING MEDIUM
    3.
    发明申请
    SIGNAL PROCESSING APPARATUS AND METHOD, PROGRAM, AND DATA RECORDING MEDIUM 有权
    信号处理装置和方法,程序和数据记录介质

    公开(公告)号:US20150049874A1

    公开(公告)日:2015-02-19

    申请号:US14528205

    申请日:2014-10-30

    CPC classification number: H03G1/0005 H03G1/00 H03G7/002 H03G7/007 H04R2430/01

    Abstract: The present invention relates to a signal processing apparatus and method, a program, and a data recording medium configured such that the playback level of an audio signal can be easily and effectively enhanced without requiring prior analysis. An analyzer 21 generates mapping control information in the form of the root mean square of samples in a given segment of a supplied audio signal. A mapping processor 22 takes a nonlinear function determined by the mapping control information taken as a mapping function, and conducts amplitude conversion on a supplied audio signal using the mapping function. In this way, by conducting amplitude conversion of an audio signal using a nonlinear function that changes according to the characteristics in respective segments of an audio signal, the playback level of an audio signal can be easily and effectively enhanced without requiring prior analysis. The present invention may be applied to portable playback apparatus.

    Abstract translation: 本发明涉及一种信号处理装置和方法,程序和数据记录介质,其被配置为使得可以容易且有效地增强音频信号的重放级别,而无需事先分析。 分析器21以所提供的音频信号的给定段中的采样均方根的形式产生映射控制信息。 映射处理器22获取由作为映射函数的映射控制信息确定的非线性函数,并且使用映射函数对所提供的音频信号进行幅度转换。 以这种方式,通过使用根据音频信号的各个段中的特征而改变的非线性函数对音频信号进行幅度转换,可以容易且有效地增强音频信号的重放级别,而无需事先分析。 本发明可以应用于便携式播放装置。

    VOICE PROCESSING APPARATUS, METHOD AND PROGRAM
    4.
    发明申请
    VOICE PROCESSING APPARATUS, METHOD AND PROGRAM 审中-公开
    语音处理设备,方法和程序

    公开(公告)号:US20130191124A1

    公开(公告)日:2013-07-25

    申请号:US13722117

    申请日:2012-12-20

    CPC classification number: G10L15/02 G10L21/034

    Abstract: Provided is a voice processing apparatus including a feature quantity calculation section extracting a feature quantity from a target frame of an input voice signal, a sound pressure estimation candidate point updating section making each frame of the input voice signal a sound pressure estimation candidate point, retaining the feature quantity of each sound pressure estimation candidate point, and updating the sound pressure estimation candidate point based on the feature quantity of the sound pressure estimation candidate point and the feature quantity of the target frame, a sound pressure estimation section calculating an estimated sound pressure of the input voice signal, based on the feature quantity of the sound pressure estimation candidate point, a gain calculation section calculating a gain applied to the input voice signal based on the estimated sound pressure, and a gain application section performing a gain adjustment of the input voice signal based on the gain.

    Abstract translation: 提供了一种语音处理装置,其特征量计算部分从输入语音信号的目标帧中提取特征量,声压估计候选点更新部分,使输入语音信号的每一帧成为声压估计候选点,保持 每个声压估计候选点的特征量,并且基于声压估计候选点的特征量和目标帧的特征量来更新声压估计候选点;声压估计部,计算估计声压 基于所述声压估计候补点的特征量的增益计算部,基于所估计的声压来计算应用于所述输入语音信号的增益的增益计算部,以及执行所述声压估计候补点的增益调整的增益应用部, 基于增益输入语音信号。

    NEURAL NETWORK DEVICE
    5.
    发明申请

    公开(公告)号:US20210312231A1

    公开(公告)日:2021-10-07

    申请号:US17250777

    申请日:2019-08-28

    Abstract: The present technology relates to a neural network device capable of improving recognition performance. The neural network device includes a non-linear transformation layer processing unit that performs a transformation with a non-linear function having a learnable parameter. The present technology can be applied to a neural network.

    INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM

    公开(公告)号:US20180196635A1

    公开(公告)日:2018-07-12

    申请号:US15741848

    申请日:2016-07-19

    Abstract: A device and method capable of performing image following type audio control or image non-following type audio control are implemented. Images in different directions are selectively displayed on the display unit, and an output audio is controlled in accordance with an image display. A data processing unit executes image following type audio control of moving an audio source direction in accordance with movement of the display image of the display unit and image non-following type audio control of not moving the audio source direction in accordance with the movement of an image in units of individual controllable audio elements. The data processing unit acquires audio control information from an MP4 file or a media presentation description (MPD) file and executes either the image following type audio control or the image non-following type audio control in accordance with the acquired audio control information in units of individual controllable audio elements.

    INFORMATION PROCESSING APPARATUS AND METHOD, AND PROGRAM

    公开(公告)号:US20210176582A1

    公开(公告)日:2021-06-10

    申请号:US17045450

    申请日:2019-03-29

    Abstract: The present technology relates to an information processing apparatus, an information processing method, and a program for achieving reduction of a processing load on a distribution side along with reduction of a transfer volume of information.
    The information processing apparatus includes an acquisition unit that acquires low accuracy position information having first accuracy and indicating a position of an object within a space where a user is located and acquires additional information for obtaining position information that has second accuracy higher than the first accuracy, indicates the position of the object within the space, and corresponds to a position of the user and a position information calculation unit that obtains the position information on the basis of the low accuracy position information and the additional information. The present technology is applicable to an information processing apparatus.

    FREQUENCY BAND EXTENSION APPARATUS, FREQUENCY BAND EXTENSION METHOD, AND PROGRAM

    公开(公告)号:US20190043523A1

    公开(公告)日:2019-02-07

    申请号:US16158500

    申请日:2018-10-12

    CPC classification number: G10L21/0388 G10L25/18

    Abstract: The present technique relates to a frequency band extension apparatus, a frequency band extension method, and a program which are configured to more easily obtain a high quality sound signal. An input signal may be divided into sub-band signals of a plurality of sub-bands, powers of high frequency sub-bands of the input signal may be estimated based on feature values extracted from the input signal to obtain high frequency sub-band power estimation values, the high frequency sub-band powers obtained from the sub-band signals of high-frequency sub-bands of the input signal may be compared with the high frequency sub-band power estimation values, and a high-frequency signal of the input signal may be generated based on a result of the comparison and the sub-band signals.

    ENCODING DEVICE AND METHOD, DECODING DEVICE AND METHOD, AND PROGRAM
    9.
    发明申请
    ENCODING DEVICE AND METHOD, DECODING DEVICE AND METHOD, AND PROGRAM 审中-公开
    编码设备和方法,解码设备和方法以及程序

    公开(公告)号:US20160133260A1

    公开(公告)日:2016-05-12

    申请号:US14893896

    申请日:2014-05-21

    Abstract: The present technology relates to an encoding device and method, a decoding device and method, and a program therefor capable of improving audio signal transmission efficiency.An identification information generation unit determines whether or not an audio signal is to be encoded on the basis of the audio signal, and generates identification information indicating the determination result. An encoding unit encodes only audio signals determined to be encoded. A packing unit generates a bit stream containing the identification information and encoded audio signals. As a result of storing only encoded audio signals in the bit stream and storing the identification information indicating whether or not the respective audio signals are to be encoded in the bit stream in this manner, the transmission efficiency of audio signals can be improved. The present technology can be applied to an encoder and a decoder.

    Abstract translation: 本技术涉及能够提高音频信号传输效率的编码装置和方法,解码装置和方法及其程序。 识别信息生成单元基于音频信号确定音频信号是否被编码,并且生成表示确定结果的识别信息。 编码单元仅对被确定为被编码的音频信号进行编码。 打包单元生成包含识别信息和编码音频信号的比特流。 作为仅将编码音频信号存储在比特流中并且以这种方式存储指示各个音频信号是否被编码在比特流中的识别信息的结果,可以提高音频信号的传输效率。 本技术可以应用于编码器和解码器。

Patent Agency Ranking