Acoustic object extraction device and acoustic object extraction method

    公开(公告)号:US11488573B2

    公开(公告)日:2022-11-01

    申请号:US17257413

    申请日:2019-09-06

    Abstract: In the acoustic object extraction device, beam forming processing units generate a first acoustic signal by beam forming in an arrival direction of a signal from an acoustic object with respect to a microphone array and generate a second acoustic signal by beam forming in an arrival direction of a signal from the acoustic object with respect to a microphone array, and a common component extraction unit extracts, on the basis of a similarity between the spectrum of the first acoustic signal and the spectrum of the second acoustic signal and from the first acoustic signal and the second acoustic signal, a signal containing a common component corresponding to the acoustic object. The common component extraction unit divides the spectrums of the first acoustic signal and the second acoustic signal into a plurality of frequency sections and calculates a similarity for each of the frequency sections.

    Binaural rendering apparatus and method for playing back of multiple audio sources

    公开(公告)号:US10735886B2

    公开(公告)日:2020-08-04

    申请号:US16724921

    申请日:2019-12-23

    Abstract: A method of generating binaural headphone playback signals given multiple audio source signals with an associated metadata and binaural room impulse response (BRIR) database, wherein the multiple audio source signals can be channel-based, object-based, or a mixture of both signals. The method includes grouping the multiple audio source signals according to positions of the audio sources in a hierarchical manner, and parameterizing BRIR to be used for rendering. The method also includes dividing each audio source signal to be rendered into a number of blocks and frames, averaging the parameterized BRIR sequences identified with a hierarchically grouping result, and downmixing the divided audio source signals identified with the hierarchically grouping result.

    Communication terminal apparatus and communication method

    公开(公告)号:US11647428B2

    公开(公告)日:2023-05-09

    申请号:US17069499

    申请日:2020-10-13

    Abstract: A communication method supports Enhanced Voice Services (EVS) codec performed by a communication terminal apparatus. The method includes performing negotiation to use an EVS codec for communication between a communication terminal apparatus and a counterpart terminal, using an IP multimedia subsystem (IMS) signaling including one of a session description protocol (SDP) offer and an SDP answer, and performing negotiation to specify one or more audio-bandwidths of input signals in Hertz (Hz) or kilohertz (kHz) for the EVS codec. In a communication session after the codec negotiation session, the processor causes the receiver to receive a signaling for changing an audio-bandwidth of an input signal to the EVS codec from a network node, changes the audio-bandwidth of the input signal to the EVS codec to another audio-bandwidth without changing the EVS codec based on the signaling, and causes the transmitter to transmit encoded data in the changed audio-bandwidth.

    Binaural rendering apparatus and method for playing back of multiple audio sources

    公开(公告)号:US11337026B2

    公开(公告)日:2022-05-17

    申请号:US17097829

    申请日:2020-11-13

    Abstract: A method generates binaural headphone playback signals given multiple audio source signals with associated metadata and a binaural room impulse response (BRIR) database, where the audio source signals can be channel-based, object-based, or a mixture of both signals. The method groups the audio source signals according to positions of the audio sources, divides BRIR into blocks and frames, where the BRIR is divided into a direct block and diffuse blocks, and divides each audio source signal into blocks and frames, wherein the source signal is divided into a current block and previous blocks, and the current block is further divided into the frames. The method further averages, for each of previous frames of the source signals, the divided BRIR identified with the grouping result by downmixing the previous frames of the source signals according to the grouping result, and performs a convolution with the downmixed previous frame.

    Encoder and encoding method
    9.
    发明授权

    公开(公告)号:US11270710B2

    公开(公告)日:2022-03-08

    申请号:US16640708

    申请日:2018-08-31

    Abstract: In an encoder, a signal analysis unit performs signal analysis on an L channel signal and an R channel signal that constitute a stereo signal and generates a parameter used to determine a coding mode for each of an L channel and an R channel. A DMA stereo encoding unit encodes the L channel signal and the R channel signal by using a coding mode common to the L channel signal and the R channel signal. At this time, the DMA stereo encoding unit determines the common coding mode by selecting, out of the L channel and the R channel, the one that has a lower ratio of energy of an environmental sound component to the entire energy of the channel and using the parameter of the selected channel.

Patent Agency Ranking