-
公开(公告)号:US11488573B2
公开(公告)日:2022-11-01
申请号:US17257413
申请日:2019-09-06
Inventor: Rohith Mars , Srikanth Nagisetty , Chong Soon Lim , Hiroyuki Ehara , Akihisa Kawamura
Abstract: In the acoustic object extraction device, beam forming processing units generate a first acoustic signal by beam forming in an arrival direction of a signal from an acoustic object with respect to a microphone array and generate a second acoustic signal by beam forming in an arrival direction of a signal from the acoustic object with respect to a microphone array, and a common component extraction unit extracts, on the basis of a similarity between the spectrum of the first acoustic signal and the spectrum of the second acoustic signal and from the first acoustic signal and the second acoustic signal, a signal containing a common component corresponding to the acoustic object. The common component extraction unit divides the spectrums of the first acoustic signal and the second acoustic signal into a plurality of frequency sections and calculates a similarity for each of the frequency sections.
-
公开(公告)号:US10735886B2
公开(公告)日:2020-08-04
申请号:US16724921
申请日:2019-12-23
Inventor: Hiroyuki Ehara , Kai Wu , Sua Hong Neo
IPC: G10L19/008 , H04S7/00 , H04S1/00
Abstract: A method of generating binaural headphone playback signals given multiple audio source signals with an associated metadata and binaural room impulse response (BRIR) database, wherein the multiple audio source signals can be channel-based, object-based, or a mixture of both signals. The method includes grouping the multiple audio source signals according to positions of the audio sources in a hierarchical manner, and parameterizing BRIR to be used for rendering. The method also includes dividing each audio source signal to be rendered into a number of blocks and frames, averaging the parameterized BRIR sequences identified with a hierarchically grouping result, and downmixing the divided audio source signals identified with the hierarchically grouping result.
-
公开(公告)号:US12062378B2
公开(公告)日:2024-08-13
申请号:US17791708
申请日:2020-12-02
Inventor: Akira Harada , Hiroyuki Ehara , Toshiaki Sakurai
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: This encoding device is provided with a control circuit that, on the basis of information relating to the capability to convert the signal form of a sound signal in a decoding device for decoding encoded data of the sound signal, controls the conversion of the signal form of the sound signal, and an encoding circuit that encodes the sound signal in accordance to the conversion control.
-
公开(公告)号:US09830919B2
公开(公告)日:2017-11-28
申请号:US15063529
申请日:2016-03-08
Inventor: Srikanth Nagisetty , Zongxian Liu , Hiroyuki Ehara
IPC: G10L19/038 , G10L19/02
CPC classification number: G10L19/038 , G10L19/0204
Abstract: An acoustic signal coding apparatus includes a subband classifier that classifies subbands obtained by dividing a frequency-domain spectrum into a plurality of perceptually important first-category subbands and the other subbands referred to as second-category subbands according to at least one of measures in terms of energy and peak property, a subband peak-algebraic vector quantization (SBP-AVQ) vector generator that generates an SBP-AVQ vector by collecting a maximum peak from each first-category subband, outputs the generated SBP-AVQ vector, and outputs peak position information indicating the positions of the maximum peaks, a bit distributor that distributes bits for AVQ coding to the SBP-AVQ vector and the second-category subband vector, and an AVQ coder that performs AVQ coding on the SBP-AVQ vector and the second-category subband vector.
-
公开(公告)号:US11653171B2
公开(公告)日:2023-05-16
申请号:US17725097
申请日:2022-04-20
Inventor: Hiroyuki Ehara , Kai Wu , Sua Hong Neo
IPC: H04S7/00 , H04S1/00 , G10L19/008
CPC classification number: H04S7/304 , G10L19/008 , H04S1/005 , H04S7/305 , H04S2400/01 , H04S2420/01
Abstract: A method that generates binaural headphone playback signals given multiple audio source signals with an associated metadata and binaural room impulse response (BRIR) database, wherein the audio source signals are channel-based, object-based, or a mixture of both channel-based and object-based signals. The method includes parameterizing BRIR to be used for rendering, dividing each audio source signal to be rendered into a number of blocks and frames, and averaging the parameterized BRIR sequences. The method also includes downmixing the divided audio source signals using the diffuse blocks of BRIRs, and performing late reverberation processing on the downmixed version of the previous blocks of the audio source signals.
-
公开(公告)号:US11647428B2
公开(公告)日:2023-05-09
申请号:US17069499
申请日:2020-10-13
Inventor: Takako Hori , Hiroyuki Ehara
CPC classification number: H04W36/0022 , H04W72/04 , H04W76/22 , H04M7/0072 , H04W88/181
Abstract: A communication method supports Enhanced Voice Services (EVS) codec performed by a communication terminal apparatus. The method includes performing negotiation to use an EVS codec for communication between a communication terminal apparatus and a counterpart terminal, using an IP multimedia subsystem (IMS) signaling including one of a session description protocol (SDP) offer and an SDP answer, and performing negotiation to specify one or more audio-bandwidths of input signals in Hertz (Hz) or kilohertz (kHz) for the EVS codec. In a communication session after the codec negotiation session, the processor causes the receiver to receive a signaling for changing an audio-bandwidth of an input signal to the EVS codec from a network node, changes the audio-bandwidth of the input signal to the EVS codec to another audio-bandwidth without changing the EVS codec based on the signaling, and causes the transmitter to transmit encoded data in the changed audio-bandwidth.
-
公开(公告)号:US11545165B2
公开(公告)日:2023-01-03
申请号:US17256899
申请日:2019-07-02
Inventor: Srikanth Nagisetty , Hiroyuki Ehara , Rohith Mars , Chong Soon Lim , Toshiaki Sakurai
IPC: G10L19/04 , G10L19/008 , G10L19/02
Abstract: This encoding device is able to encode an S signal efficiently in MS prediction encoding. An M signal encoding unit generates first encoding information by encoding a sum signal indicating a sum of a left channel signal and a right channel signal that constitute a stereo signal. An energy difference calculation unit calculates a prediction parameter for predicting a difference signal indicating a difference between the left channel signal and the right channel signal by using a parameter regarding an energy difference between the left channel signal and the right channel signal. An entropy encoding unit generates second encoding information by encoding the prediction parameter.
-
公开(公告)号:US11337026B2
公开(公告)日:2022-05-17
申请号:US17097829
申请日:2020-11-13
Inventor: Hiroyuki Ehara , Kai Wu , Sua Hong Neo
IPC: G10L19/008 , H04S7/00 , H04S1/00
Abstract: A method generates binaural headphone playback signals given multiple audio source signals with associated metadata and a binaural room impulse response (BRIR) database, where the audio source signals can be channel-based, object-based, or a mixture of both signals. The method groups the audio source signals according to positions of the audio sources, divides BRIR into blocks and frames, where the BRIR is divided into a direct block and diffuse blocks, and divides each audio source signal into blocks and frames, wherein the source signal is divided into a current block and previous blocks, and the current block is further divided into the frames. The method further averages, for each of previous frames of the source signals, the divided BRIR identified with the grouping result by downmixing the previous frames of the source signals according to the grouping result, and performs a convolution with the downmixed previous frame.
-
公开(公告)号:US11270710B2
公开(公告)日:2022-03-08
申请号:US16640708
申请日:2018-08-31
Inventor: Srikanth Nagisetty , Hiroyuki Ehara
IPC: G10L19/008 , G10L19/22
Abstract: In an encoder, a signal analysis unit performs signal analysis on an L channel signal and an R channel signal that constitute a stereo signal and generates a parameter used to determine a coding mode for each of an L channel and an R channel. A DMA stereo encoding unit encodes the L channel signal and the R channel signal by using a coding mode common to the L channel signal and the R channel signal. At this time, the DMA stereo encoding unit determines the common coding mode by selecting, out of the L channel and the R channel, the one that has a lower ratio of energy of an environmental sound component to the entire energy of the channel and using the parameter of the selected channel.
-
10.
公开(公告)号:US11145316B2
公开(公告)日:2021-10-12
申请号:US16612902
申请日:2018-05-09
Inventor: Srikanth Nagisetty , Sua Hong Neo , Hiroyuki Ehara
IPC: G10L19/008 , G10L19/005 , G10L19/24 , G10L19/00
Abstract: An inter-channel correlation calculation unit calculates an inter-channel correlation between a left channel and a right channel by using a left channel signal and a right channel signal that constitute a stereo signal. A DMA stereo encoding unit and a DM stereo encoding unit encode the left channel signal and the right channel signal by using a common coding mode when the inter-channel correlation is greater than a threshold value, and individually encode the left channel signal and the right channel signal by using a coding mode determined for each of the left channel signal and the right channel signal when the inter-channel correlation is less than or equal to the threshold value.
-
-
-
-
-
-
-
-
-