-
公开(公告)号:US11812197B2
公开(公告)日:2023-11-07
申请号:US17250541
申请日:2019-07-31
Applicant: SONY CORPORATION
Inventor: Takuto Motoyama , Yuki Yamamoto , Masahiko Toyoshi , Suguru Aoki
IPC: B60W30/095 , H04N7/18 , G06T7/50 , G06T7/70 , G06T7/40 , G06T11/00 , H04N5/38 , G06V20/58 , B60W60/00 , B60W50/08
CPC classification number: H04N7/183 , G06T7/40 , G06T7/50 , G06T7/70 , G06T11/001 , G06V20/58 , H04N5/38 , B60W50/08 , B60W60/001 , B60W2420/40 , G06T2207/30261
Abstract: Provided is an information processing device that includes an acquisition unit that acquires an image captured by an imaging unit, a recognition unit that recognizes attributes of an object shown in the image captured by the imaging unit, and a generation unit that generates a bird's-eye view map showing the attributes of the object on the basis of the image captured by the imaging unit and information on the attributes of the object recognized by the recognition unit.
-
公开(公告)号:US11574644B2
公开(公告)日:2023-02-07
申请号:US16606276
申请日:2018-04-12
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
IPC: G10L19/20 , G10L25/51 , G10L19/008 , G10L19/02 , G10L25/78
Abstract: The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.-
公开(公告)号:US11184579B2
公开(公告)日:2021-11-23
申请号:US16303331
申请日:2017-05-17
Applicant: Sony Corporation
Inventor: Hiroyuki Honma , Yuki Yamamoto
IPC: H04N9/802 , H04N5/92 , G10L19/008 , G10L21/0272 , H04N19/46 , G06K9/00 , H04R1/40 , G10L19/00 , H04R3/00 , G11B27/30
Abstract: The present technique relates to an apparatus and a method for video-audio processing, and a program each of which enables a desired object sound to be more simply and accurately separated. A video-audio processing apparatus includes a display control portion configured to cause a video object based on a video signal to be displayed; an object selecting portion configured to select the predetermined video object from the one video object or among a plurality of the video objects; and an extraction portion configured to extract an audio signal of the video object selected by the object selecting portion as an audio object signal.
-
公开(公告)号:US20210272576A1
公开(公告)日:2021-09-02
申请号:US17255191
申请日:2019-06-20
Applicant: Sony Corporation
Inventor: Mitsuyuki Hatanaka , Toru Chinen , Minoru Tsuji , Hiroyuki Honma , Yuki Yamamoto
IPC: G10L19/035 , G10L21/00
Abstract: The present technology relates to an information processing device and method, and a program capable of reducing a code amount.
The information processing device includes: an acquisition unit that acquires space information regarding a position and a size of a child space within a parent space and position information in the child space indicating a position of an object within the child space, the child space being included in the parent space, and the object being included in the child space; and a calculation unit that calculates position information in the parent space indicating a position of the object within the parent space on the basis of the space information and the position information in the child space. The present technology can be applied to a signal processing device.-
公开(公告)号:US20210204086A1
公开(公告)日:2021-07-01
申请号:US17200532
申请日:2021-03-12
Applicant: Sony Corporation
Inventor: Hiroyuki Honma , Yuki Yamamoto
Abstract: The present technology relates to a signal processing apparatus and method capable of reducing calculation loads, as well as a program.
A signal processing apparatus includes an ambisonic gain calculation unit configured to find, on the basis of spread information of an object, an ambisonic gain while the object is present at a predetermined position. The present technology is applicable to an encoder and a decoder.-
公开(公告)号:US20210118466A1
公开(公告)日:2021-04-22
申请号:US16606276
申请日:2018-04-12
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Minoru Tsuji
Abstract: The present technology relates to a signal processing device and method, and a program making it possible to reduce the computational complexity of decoding at low cost.
A signal processing device includes: a priority information generation unit configured to generate priority information about an audio object on the basis of a plurality of elements expressing a feature of the audio object. The present technology may be applied to an encoding device and a decoding device.-
公开(公告)号:US10692511B2
公开(公告)日:2020-06-23
申请号:US15106498
申请日:2014-12-12
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Hiroyuki Honma , Runyu Shi
Abstract: The present technology relates to a decoding apparatus, a decoding method and a program which make it possible to obtain sound with higher quality.A demultiplexing circuit demultiplexes an input code string into a gain code string and a signal code string. A signal decoding circuit decodes the signal code string to output a time series signal. A gain decoding circuit decodes the gain code string. That is, the gain decoding circuit reads out gain values and gain inclination values at predetermined gain sample positions of the time series signal and interpolation mode information. An interpolation processing unit obtains a gain value at each sample position between two gain sample positions through linear interpolation or non-linear interpolation according to the interpolation mode based on the gain values and the gain inclination values. A gain applying circuit adjusts a gain of the time series signal based on the gain values. The present technology can be applied to a decoding apparatus.
-
公开(公告)号:US10455345B2
公开(公告)日:2019-10-22
申请号:US15932368
申请日:2018-02-16
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen , Runyu Shi , Mitsuyuki Hatanaka
Abstract: The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.
-
公开(公告)号:US10236015B2
公开(公告)日:2019-03-19
申请号:US15684340
申请日:2017-08-23
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen
IPC: G10L21/0388 , G10L25/21 , G10L25/18 , G10L19/008 , G10L19/02
Abstract: The present invention relates to an encoding device and method, and a decoding device and method, and a program which enable music signals to be played with higher sound quality by expanding a frequency band.A band pass filter divides an input signal into multiple subband signals, a feature amount calculating circuit calculates feature amount using at least any one of the divided multiple subband signals and the input signal, a high-frequency subband power estimating circuit calculates an estimated value of high-frequency subband power based on the calculated feature amount, and a high-frequency signal generating circuit generates a high-frequency signal component based on the multiple subband signals divided by the band pass filter and the estimated value of the high-frequency subband power calculated by the high-frequency subband power estimating circuit. A frequency band expanding device expands the frequency band of the input signal using the high-frequency signal component generated by the high-frequency signal generating circuit. The present invention may be applied to a frequency band expanding device, encoding device, decoding device, and so forth, for example.
-
公开(公告)号:US10134418B2
公开(公告)日:2018-11-20
申请号:US14412037
申请日:2013-07-12
Applicant: Sony Corporation
Inventor: Yuki Yamamoto , Toru Chinen
IPC: H03G5/00 , G10L21/0388 , G10L25/18
Abstract: The present technique relates to a frequency band extension apparatus, a frequency band extension method, and a program which are configured to more easily obtain a high quality sound signal. An input signal may be divided into sub-band signals of a plurality of sub-bands, powers of high frequency sub-bands of the input signal may be estimated based on feature values extracted from the input signal to obtain high frequency sub-band power estimation values, the high frequency sub-band powers obtained from the sub-band signals of high-frequency sub-bands of the input signal may be compared with the high frequency sub-band power estimation values, and a high-frequency signal of the input signal may be generated based on a result of the comparison and the sub-band signals.
-
-
-
-
-
-
-
-
-