-
公开(公告)号:US10375439B2
公开(公告)日:2019-08-06
申请号:US15311950
申请日:2015-05-22
Applicant: SONY CORPORATION
Inventor: Mitsuhiro Hirabayashi , Toru Chinen , Yuki Yamamoto , Runyu Shi
IPC: G06F17/00 , H04N21/439 , G11B27/00 , G10K15/02 , G10L19/00 , G10L19/008 , H04N21/233 , H04N21/2343 , H04N21/81 , H04N21/845
Abstract: The present disclosure relates to an information processing apparatus and an information processing method which are capable of improving an efficiency of acquiring a predetermined type of audio data among a plurality of types of audio data. Audio data of a predetermined track is acquired in one audio file in which audio data of 3D audio is divided into a plurality of tracks depending on the type of 3D audio and the tracks are arranged, the audio data of each track being successively arranged in the file for a predetermined length of time. The present disclosure is applicable to, for example, an information processing system including a file generation device that generates a file, a Web server that records a file generated by the file generation device, and a video playback terminal that plays back a file.
-
公开(公告)号:US20190164558A1
公开(公告)日:2019-05-30
申请号:US16263356
申请日:2019-01-31
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Mitsuyuki Hatanaka
IPC: G10L19/02 , G10L19/26 , G10L21/038
Abstract: A method, system, and computer program product for processing an encoded audio signal is described. In one exemplary embodiment, the system receives an encoded low-frequency range signal and encoded energy information used to frequency shift the encoded low-frequency range signal. The low-frequency range signal is decoded and an energy depression of the decoded signal is smoothed. The smoothed low-frequency range signal is frequency shifted to generate a high-frequency range signal. The low-frequency range signal and high-frequency range signal are then combined and outputted.
-
公开(公告)号:US10171926B2
公开(公告)日:2019-01-01
申请号:US14785416
申请日:2014-04-11
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Runyu Shi , Mitsuyuki Hatanaka
Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.
-
94.
公开(公告)号:US20180330746A1
公开(公告)日:2018-11-15
申请号:US16046070
申请日:2018-07-26
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Hiroyuki Honma , Yuhki Mitsufuji
IPC: G10L21/0388 , G10L19/02 , G10L19/16 , G10L21/038
CPC classification number: G10L21/0388 , G10L19/0204 , G10L19/0208 , G10L19/167 , G10L21/038
Abstract: Methods and apparatus for performing signal processing. The signal processing comprises demultiplexing input encoded data into data including information for a segment including frames and coefficient information for a coefficient selected in the frames of the segment, and low band encoded data, decoding the low band encoded data to produce a low band signal, selecting a coefficient of a frame to be processed from a plurality of the coefficients based on the data, calculating a high band sub-band power of a high band sub-band signal of each sub-band constituting a high band signal of the frame to be processed based on a low band sub-band signal of each sub-band constituting the low band signal of the frame to be processed and the selected coefficient, and producing the high band signal of the frame to be processed based on the high band sub-band power and the low band sub-band signal.
-
95.
公开(公告)号:US20180315436A1
公开(公告)日:2018-11-01
申请号:US15735630
申请日:2016-06-03
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
IPC: G10L19/16 , G10L19/008
CPC classification number: G10L19/008 , H04S5/02 , H04S7/00
Abstract: The present technology relates to an encoding apparatus, an encoding method, a decoding apparatus, a decoding method, and a program for obtaining sound of higher quality.An audio signal decoding section decodes encoded audio data to acquire an audio signal of each object. A metadata decoding section decodes encoded metadata to acquire a plurality of metadata about each object in each frame of the audio signal. A gain calculating section calculates VBAP gains of each object in the audio signal for each speaker based on the metadata. An audio signal generating section generates an audio signal to be fed to each speaker by having the audio signal of each object multiplied by the corresponding VBAP gain and by adding up the multiplied audio signals. The present technology may be applied to decoding apparatuses.
-
公开(公告)号:US20180242030A1
公开(公告)日:2018-08-23
申请号:US15516537
申请日:2015-09-28
Applicant: Sony Corporation
Inventor: Minoru Tsuji , Toru Chinen , Runyu Shi , Masayuki Nishiguchi , Yuki Yamamoto
IPC: H04N21/2343 , H04N19/157 , H04N19/80 , H04N19/156
Abstract: The present technology relates to an encoding device, an encoding method, a reproduction device, a reproduction method, and a program enabling each reproduction equipment to reproduce an appropriate content in a simplified manner. A content data decoding unit decodes encoded metadata and outputs zoom area information, which is included in metadata acquired as a result thereof, designating an area to be zoomed. A zoom area selecting unit selects one or a plurality of pieces of zoom area information from among the zoom area information. A video segmenting unit segments a zoom area represented by the selected zoom area information in a video based on video data and outputs zoom video data acquired as a result thereof. An audio converting unit performs an audio converting process according to the selected zoom area information for audio data and outputs zoom audio data acquired as a result thereof. The present technology can be applied to a reproduction device.
-
公开(公告)号:US20180160250A1
公开(公告)日:2018-06-07
申请号:US15737026
申请日:2016-06-09
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
Abstract: The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality.An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.
-
公开(公告)号:US09922660B2
公开(公告)日:2018-03-20
申请号:US15034947
申请日:2014-11-17
Applicant: SONY CORPORATION
Inventor: Yuki Yamamoto , Toru Chinen
CPC classification number: G10L19/24 , G10L19/0204 , G10L19/26 , G10L21/038 , G10L25/18
Abstract: The present technology relates to a device, a method, and a program for expanding a frequency band, which are capable of obtaining high-quality sound with a small processing amount. A low band extraction band-pass filter processing unit passes a predetermined band of a low band of an input signal and generates a low band sub band signal. A band-pass filter calculation circuit calculates band-pass filter coefficients of band-pass filters having sub bands of high bands as a pass band based on an estimate value of high band sub band power, and an addition unit obtains one filter coefficient by adding the band-pass filter coefficients. A poly-phase configuration level adjustment filter performs up-sampling and level adjustment by performing filtering on a flattened signal obtained from a low band sub band signal using the filter coefficient obtained by the addition unit, and generates a high band signal. An addition unit obtains an output signal by adding the high band signal to the low band signal. The present technology can be applied to a frequency band expanding device.
-
公开(公告)号:US20170352365A1
公开(公告)日:2017-12-07
申请号:US15684340
申请日:2017-08-23
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen
IPC: G10L21/0388 , G10L25/18 , G10L25/21 , G10L19/008 , G10L19/02
CPC classification number: G10L21/0388 , G10L19/008 , G10L19/0208 , G10L25/18 , G10L25/21
Abstract: The present invention relates to an encoding device and method, and a decoding device and method, and a program which enable music signals to be played with higher sound quality by expanding a frequency band.A band pass filter divides an input signal into multiple subband signals, a feature amount calculating circuit calculates feature amount using at least any one of the divided multiple subband signals and the input signal, a high-frequency subband power estimating circuit calculates an estimated value of high-frequency subband power based on the calculated feature amount, and a high-frequency signal generating circuit generates a high-frequency signal component based on the multiple subband signals divided by the band pass filter and the estimated value of the high-frequency subband power calculated by the high-frequency subband power estimating circuit. A frequency band expanding device expands the frequency band of the input signal using the high-frequency signal component generated by the high-frequency signal generating circuit. The present invention may be applied to a frequency band expanding device, encoding device, decoding device, and so forth, for example.
-
公开(公告)号:US20170245086A1
公开(公告)日:2017-08-24
申请号:US15591471
申请日:2017-05-10
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Runyu Shi , Mitsuyuki Hatanaka
CPC classification number: H04S5/005 , H03G3/301 , H04S7/30 , H04S7/302 , H04S2400/11
Abstract: The present technology relates to a sound processing apparatus and method, and a program for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.
-
-
-
-
-
-
-
-
-