-
公开(公告)号:US20210264934A1
公开(公告)日:2021-08-26
申请号:US16766284
申请日:2018-09-18
Applicant: SONY CORPORATION
Inventor: KAZUYA TATEISHI , SHUSUKE TAKAHASHI , AKIRA TAKAHASHI , KAZUKI OCHIAI
IPC: G10L21/0232 , G10L25/51 , H04R29/00 , H04R1/08 , G06N20/00
Abstract: It is desirable to provide an echo cancellation technique that enables an environmental change to be flexibly addressed. Provided is a signal processing apparatus including: an echo cancellation unit that learns an estimated transfer characteristic in a space through which a signal reproduced by a speaker is input to a microphone, and performs echo cancellation on the basis of the estimated transfer characteristic learned; and an environmental change detection unit that detects an environmental change, in which the echo cancellation unit learns the estimated transfer characteristic by causing the speaker to reproduce a sound for learning on the basis of detection of the environmental change.
-
公开(公告)号:US20210195324A1
公开(公告)日:2021-06-24
申请号:US16640137
申请日:2018-07-06
Applicant: SONY CORPORATION
Inventor: KAZUYA TATEISHI
Abstract: Provided are an audio processing device, an audio processing method, an information processing device, and a computer program that perform echo cancellation corresponding to double talk. The audio processing device includes an estimation unit that estimates a filter representing a transmission characteristic from a speaker where a reference signal is output to a microphone in which the reference signal sneaks, an adjustment unit that adjusts a step size on the basis of a filter update coefficient estimated by the estimation unit, and an update unit that updates the filter according to the update coefficient and the step size. The adjustment unit adjusts the step size on the basis of a ratio of power of the filter update coefficient to maximum power of the filter.
-
公开(公告)号:US20200320994A1
公开(公告)日:2020-10-08
申请号:US16758034
申请日:2018-08-28
Applicant: SONY CORPORATION
Inventor: NORIKO TOTSUKA , KAZUYA TATEISHI , YUICHIRO KOYAMA
Abstract: Provided is an information processing apparatus that has an utterance function or controls the utterance function. The information processing apparatus includes: a sending unit that sends interactive information regarding a voice agent; a receiving unit that receives interactive information regarding another voice agent; and a control unit that controls an utterance timing of the voice agent on the basis of the interactive information regarding another voice agent received by the receiving unit. The control unit causes utterance by the voice agent to stand by on the basis of the interactive information received from another voice agent. Moreover, the control unit causes the interactive information to be continuously sent during the utterance by the voice agent and during interaction between the voice agent and a user.
-
公开(公告)号:US20200333423A1
公开(公告)日:2020-10-22
申请号:US16753252
申请日:2018-09-27
Applicant: SONY CORPORATION
Inventor: KAZUKI OCHIAI , SHUSUKE TAKAHASHI , AKIRA TAKAHASHI , KAZUYA TATEISHI
Abstract: The present technology relates to a sound source direction estimation device and method, and a program that can reduce an operation amount for estimating a direction of a target sound source. A first estimation unit estimates a first horizontal angle that is a horizontal angle of a sound source direction from an input acoustic signal. A second estimation unit estimates a second horizontal angle that is the horizontal angle of the sound source direction and an elevation angle, with respect to the first horizontal angle, in a predetermined range near the first horizontal angle. The present technology can be applied, in a case where a voice is uttered from a surrounding sound source (for example, a person), to a device having a function of estimating the direction in which the voice is uttered.
-
公开(公告)号:US20200329308A1
公开(公告)日:2020-10-15
申请号:US16753236
申请日:2018-09-27
Applicant: SONY CORPORATION
Inventor: KAZUYA TATEISHI , SHUSUKE TAKAHASHI , AKIRA TAKAHASHI , KAZUKI OCHIAI
Abstract: The present technology relates to a voice input device and method, and a program that facilitate estimation of an utterance direction The voice input device includes: a fixed part disposed at a predetermined position; a movable part movable with respect to the fixed part; a microphone array attached to the fixed part; an utterance direction estimation unit configured to estimate an utterance direction on the basis of a voice from an utterer that is input from the microphone array; and a driving unit configured to drive the movable part according to the estimated utterance direction. The voice input device can be used by installation in, for example, a smart speaker, a voice agent, a robot, and the like.
-
公开(公告)号:US20210050013A1
公开(公告)日:2021-02-18
申请号:US17085628
申请日:2020-10-30
Applicant: SONY CORPORATION
Inventor: YUICHIRO KOYAMA , KAZUYA TATEISHI
Abstract: To provide an information processing device, an information processing method, and a program, which are capable of causing a device which is desirable to the user among a plurality of devices to give a response. An information processing device including: an input unit configured to obtain information related to a voice of a user and device information of each of a plurality of devices; and a selecting unit configured to select a device from the plurality of devices on the basis of an aspect specified by at least one of the information related to the voice and the device information obtained by the input unit and the device information.
-
-
-
-
-