Signal processing apparatus and method, and program

    公开(公告)号:US11722832B2

    公开(公告)日:2023-08-08

    申请号:US16762304

    申请日:2018-10-31

    申请人: Sony Corporation

    IPC分类号: H04S7/00 H04S3/00

    摘要: The present technology relates to a signal processing apparatus and method, and a program that can easily determine a localization position of a sound image.
    A signal processing apparatus includes: an acquisition unit configured to acquire information associated with a localization position of a sound image of an audio object in a listening space specified in a state where the listening space viewed from a listening position is displayed; and a generation unit configured to generate a bit stream on the basis of the information associated with the localization position. The present technology can be applied to the signal processing apparatus.

    Sound processing apparatus and sound processing system

    公开(公告)号:US11146904B2

    公开(公告)日:2021-10-12

    申请号:US16444589

    申请日:2019-06-18

    申请人: Sony Corporation

    IPC分类号: H04S5/00 H03G3/30

    摘要: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.
    A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.

    Audio coding/decoding method and apparatus using excess quantization information

    公开(公告)号:USRE48272E1

    公开(公告)日:2020-10-20

    申请号:US15434964

    申请日:2017-02-16

    申请人: Sony Corporation

    IPC分类号: H04B14/06 G10L19/032

    摘要: There is provided an audio coding device which appropriately sets the quantization bit number by a small calculation amount in each stage when coding an input audio signal by performing multi-stage normalization/quantization. A quantization information calculation section determines total quantization information idwl0, based on normalization information idsf, and allocates the total quantization information idwl0 for quantization information idwl1 and quantization information idwl2. At this time, the quantization information calculation section limits the quantization information idwl1 by a limiter lim1, and allocates the total quantization information idwl0 for quantization information idwl1. If the quantization information idwl1 exceeds the limiter lim1, the excess is allocated for the quantization information idwl2. A first normalization section and a first quantization section normalizes and quantizes a frequency spectrum mdspec1 in the first stage. A second normalization section and a second quantization section normalizes and quantizes a differential frequency spectrum mdspec2 in the second stage.

    ENCODING DEVICE AND METHOD, DECODING DEVICE AND METHOD, AND PROGRAM

    公开(公告)号:US20200265853A1

    公开(公告)日:2020-08-20

    申请号:US16651532

    申请日:2018-09-21

    申请人: Sony Corporation

    IPC分类号: G10L19/16 H03M7/30 H04L29/06

    摘要: The present technology relates to an encoding device and method, a decoding device and method, and a program, which are adapted to be capable of improving convenience.The decoding device is provided with: a decoding unit that decodes audio data including an object audio, the audio data being included in an encoded bit stream, and reads metadata of the object audio from an area in which arbitrary data of the encoded bit stream can be stored; and an output unit that outputs the decoded audio data on the basis of the metadata. The present technology can be applied to the decoding device.

    Decoding device, decoding method, encoding device, encoding method, and program

    公开(公告)号:US10140995B2

    公开(公告)日:2018-11-27

    申请号:US14239568

    申请日:2013-06-24

    申请人: Sony Corporation

    IPC分类号: G10L19/008 G10L19/16 H04S3/00

    摘要: The present technique relates to a decoding device, a decoding method, an encoding device, an encoding method, and a program which can obtain a high-quality realistic sound.The encoding device stores speaker arrangement information in a comment region in a PCE of an encoded bit stream and stores a synchronous word and identification information in the comment region such that other public comments and the speaker arrangement information stored in the comment region can be distinguished from each other. When an encoded bit stream is decoded, it is determined whether the speaker arrangement information is stored on the basis of the synchronous word and the identification information stored in the comment region. Audio data included in the encoded bit stream is output according to the arrangement of the speakers corresponding to the determination result. The present technique can be applied to an encoding device.

    Decoding device, decoding method, encoding device, encoding method, and program

    公开(公告)号:US10083700B2

    公开(公告)日:2018-09-25

    申请号:US14238243

    申请日:2013-06-24

    申请人: Sony Corporation

    IPC分类号: G10L19/008 H04S3/00

    摘要: The present technique relates to a decoding device, a decoding method, an encoding device, an encoding method, and a program which can obtain a high-quality realistic sound.The encoding device stores speaker arrangement information in a comment region in a PCE of an encoded bit stream and stores a synchronous word and identification information in the comment region such that other public comments and the speaker arrangement information stored in the comment region can be distinguished from each other. When an encoded bit stream is decoded, it is determined whether the speaker arrangement information is stored on the basis of the synchronous word and the identification information stored in the comment region. Audio data included in the encoded bit stream is output according to the arrangement of the speakers corresponding to the determination result. The present technique can be applied to an encoding device.

    Encoding device and method, decoding device and method, and program

    公开(公告)号:US09875746B2

    公开(公告)日:2018-01-23

    申请号:US14917825

    申请日:2014-09-05

    申请人: Sony Corporation

    IPC分类号: G10L19/008 G10L19/16 H04S3/00

    摘要: The present invention pertains to an encoding device and method, a decoding device and method, and to a program, with which sound of an appropriate volume level can be obtained with a smaller quantity of codes. A first gain calculation circuit calculates a first gain for volume level correction of an input time series signal, and a second gain calculation circuit calculates a second gain for volume level correction of a downmixed signal obtained by downmixing of the input time series signal. A gain encoding circuit computes the gain differential between the first gain and the second gain, the gain differential between time frames, and the gain differential within time frames, and encodes the first gain and the second gain. The present invention can be applied in encoding devices and decoding devices.