-
公开(公告)号:US10477337B2
公开(公告)日:2019-11-12
申请号:US15110176
申请日:2015-01-06
申请人: SONY CORPORATION
发明人: Minoru Tsuji , Toru Chinen
摘要: The present technology relates to an audio processing device, a method therefor, and a program therefor capable of achieving more flexible audio reproduction.An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.
-
公开(公告)号:US10431229B2
公开(公告)日:2019-10-01
申请号:US15424741
申请日:2017-02-03
申请人: Sony Corporation
发明人: Mitsuyuki Hatanaka , Toru Chinen
IPC分类号: G10L19/00 , G10L19/02 , G10L19/008 , G10L19/028 , G10L19/093 , G10L21/038 , H03M7/40
摘要: A signal processing device, method, and program that may obtain audio at a higher audio quality when decoding an audio signal. An envelope information generating unit generates envelope information representing an envelope form of high frequency components of an audio signal to be encoded. A sine wave information generating unit extracts a sine wave signal from the high frequency components of the audio signal, and generates a sine wave information representing an emergence start position of the sine wave signal. An encoding stream generating unit multiplexes the envelope information, the sine wave information, and low frequency components of the audio signal that have been encoded, and outputs an encoding stream obtained as the result. The high frequency components included in the sine wave signal may be predicted at a higher accuracy from the envelope information and the sine wave information at the receiving side of the encoding stream.
-
33.
公开(公告)号:US20190180768A1
公开(公告)日:2019-06-13
申请号:US16276936
申请日:2019-02-15
申请人: Sony Corporation
发明人: Yuki Yamamoto , Toru Chinen , Hiroyuki Honma , Yuhki Mitsufuji
IPC分类号: G10L21/0388 , G10L19/02 , G10L21/038 , G10L19/16
摘要: The present invention relates to a signal processing apparatus and a signal processing method, an encoder and an encoding method, a decoder and a decoding method, and a program capable of reproducing music signal having a better sound quality by expansion of frequency band.An encoder sets an interval including 16 frames as interval section to be processed, outputs high band encoded data for obtaining the high band component of an input signal and low band encoded data obtained by encoding the low band signal of the input signal for each section to be processed. In this case, for each frame, a coefficient used in estimation of the high band component is selected and the section to be processed is divided into continuous frame segments including continuous frames from which the coefficient with the same section to be processed is selected. In addition, high band encoded data is produced which includes data including information indicating a length of each continuous frame segment, information indicating the number of continuous frame segments included in the section to be processed and a coefficient index indicating the coefficient selected in each continuous frame segment. The present invention is applicable to the encoder.
-
公开(公告)号:US10304466B2
公开(公告)日:2019-05-28
申请号:US15227024
申请日:2016-08-03
申请人: Sony Corporation
发明人: Mitsuyuki Hatanaka , Toru Chinen
IPC分类号: G10L19/008 , G10L19/16 , H04S3/00
摘要: The present technique relates to a decoding device, a decoding method, an encoding device, an encoding method, and a program which can obtain a high-quality realistic sound.The encoding device stores speaker arrangement information in a comment region in a PCE of an encoded bit stream and stores a synchronous word and identification information in the comment region such that other public comments and the speaker arrangement information stored in the comment region can be distinguished from each other. When an encoded bit stream is decoded, it is determined whether the speaker arrangement information is stored on the basis of the synchronous word and the identification information stored in the comment region. Audio data included in the encoded bit stream is output according to the arrangement of the speakers corresponding to the determination result. The present technique can be applied to an encoding device.
-
公开(公告)号:US20190149935A1
公开(公告)日:2019-05-16
申请号:US16248739
申请日:2019-01-15
申请人: Sony Corporation
发明人: Yuki Yamamoto , Toru Chinen , Runyu Shi , Mitsuyuki Hatanaka
CPC分类号: H04S5/005 , H03G3/301 , H04S7/30 , H04S7/302 , H04S2400/11
摘要: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.
-
公开(公告)号:US10224054B2
公开(公告)日:2019-03-05
申请号:US15584447
申请日:2017-05-02
申请人: Sony Corporation
发明人: Yuki Yamamoto , Toru Chinen , Hiroyuki Honma , Yuhki Mitsufuji
IPC分类号: G10L21/0388 , G10L19/16 , G10L19/02 , G10L21/038
摘要: Methods and apparatus for performing signal processing. The signal processing comprises demultiplexing input encoded data into data including information for a segment including frames and coefficient information for a coefficient selected in the frames of the segment, and low band encoded data, decoding the low band encoded data to produce a low band signal, selecting a coefficient of a frame to be processed from a plurality of the coefficients based on the data, calculating a high band sub-band power of a high band sub-band signal of each sub-band constituting a high band signal of the frame to be processed based on a low band sub-band signal of each sub-band constituting the low band signal of the frame to be processed and the selected coefficient, and producing the high band signal of the frame to be processed based on the high band sub-band power and the low band sub-band signal.
-
公开(公告)号:US20180286419A1
公开(公告)日:2018-10-04
申请号:US15772310
申请日:2016-10-26
申请人: Sony Corporation
发明人: Mitsuyuki Hatanaka , Toru Chinen , Minoru Tsuji , Hiroyuki Honma
IPC分类号: G10L19/12 , G10L19/032
摘要: The present disclosure relates to a decoding apparatus, a decoding method, and a program that can switch, as quickly as possible, a plurality of audio encoded bit streams with synchronized reproduction timing to thereby decode and output the plurality of audio encoded bit streams.An aspect of the present disclosure provides a decoding apparatus including: an acquisition unit that acquires a plurality of audio encoded bit streams; a selection unit that determines a boundary position for switching output of the plurality of audio encoded bit streams and that selectively supplies one of the plurality of acquired audio encoded bit streams to a decoding processing unit according to the boundary position; and the decoding processing unit that applies a decoding process including IMDCT processing to the one input through the selection unit, in which the decoding processing unit skips overlap-and-add in the IMDCT processing corresponding to each frame before and after the boundary position. The present disclosure can be applied to, for example, a reception apparatus, a reproduction apparatus, and the like.
-
公开(公告)号:US20180184222A1
公开(公告)日:2018-06-28
申请号:US15932368
申请日:2018-02-16
申请人: Sony Corporation
发明人: Yuki Yamamoto , Toru Chinen , Runyu Shi , Mitsuyuki Hatanaka
摘要: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.
-
公开(公告)号:US09998845B2
公开(公告)日:2018-06-12
申请号:US14905116
申请日:2014-07-11
申请人: Sony Corporation
发明人: Runyu Shi , Toru Chinen , Yuki Yamamoto , Mitsuyuki Hatanaka
CPC分类号: H04S7/303 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15
摘要: The present technology relates to an information processing device and method for allowing a sound image to be localized with higher precision, and a program. When a target sound image is outside a mesh, the target sound image is moved in a vertical direction while a position in a horizontal direction of the target sound image remains fixed, so that the target sound image is present on a boundary of the mesh. Specifically, a mesh detection unit detects a mesh including a position in the horizontal direction of the target sound image. A candidate position calculation unit calculates a position that is a movement target of the target sound image, based on loudspeaker positions that are at opposite ends of an arc of the detected mesh that is a destination, and the position in the horizontal direction of the target sound image. As a result, the target sound image can be moved onto a boundary of the mesh. The present technology is applicable to a sound processing device.
-
公开(公告)号:US09805729B2
公开(公告)日:2017-10-31
申请号:US14893909
申请日:2014-05-21
申请人: SONY CORPORATION
发明人: Runyu Shi , Yuki Yamamoto , Toru Chinen , Mitsuyuki Hatanaka
CPC分类号: G10L19/008 , G10L19/167 , G10L19/22 , H04S3/002 , H04S3/02 , H04S5/005 , H04S5/02 , H04S2400/01 , H04S2400/15 , H04S2420/01 , H04S2420/03
摘要: The present technique relates to an encoding device and a method, a decoding device and a method, and a program capable of obtaining higher quality audio. An encoding unit encodes position information and a gain of an object in a current frame in multiple encoding modes. A compressing unit generates, for each combination of encoding modes of each pieces of position information and gains, encoded meta data including encoding mode information indicating the encoding modes and encoded data which are the encoded position information and gains, and compresses the encoding mode information included in the encoding meta data. A determining unit selects encoded meta data of which amount of data is the least from among the encoded meta data generated for each combination, thus determining the encoding mode of each pieces of position information and gains. The present technique can be applied to an encoder and a decoder.
-
-
-
-
-
-
-
-
-