ENCODING OF MULTIPLE AUDIO SIGNALS
    11.
    发明申请

    公开(公告)号:US20190035409A1

    公开(公告)日:2019-01-31

    申请号:US16152357

    申请日:2018-10-04

    Abstract: A device includes an encoder configured to determine, during a first period, that a first audio signal is a leading signal and that a second audio signal is a lagging signal. The encoder is also configured to generate a first frame of at least one encoded signal based on a first modified version of the second audio signal that is generated by adjusting the second audio signal based on a first mismatch value. The encoder is configured to determine, during a second period, that the first audio signal is the leading signal and that the second audio signal is the lagging signal. The encoder is configured to generate a second frame of the at least one encoded signal based on a second modified version of the second audio signal that is generated by adjusting the second audio signal based on the first mismatch value and a second mismatch value.

    VOICE PROFILE MANAGEMENT AND SPEECH SIGNAL GENERATION

    公开(公告)号:US20170256268A1

    公开(公告)日:2017-09-07

    申请号:US15603270

    申请日:2017-05-23

    CPC classification number: G10L21/003 G10L13/033 G10L17/00 G10L25/48

    Abstract: A device includes a receiver, a memory, and a processor. The receiver is configured to receive a remote voice profile. The memory is electrically coupled to the receiver. The memory is configured to store a local voice profile associated with a person. The processor is electrically coupled to the memory and the receiver. The processor is configured to determine that the remote voice profile is associated with the person based on speech content associated with the remote voice profile or an identifier associated with the remote voice profile. The processor is also configured to select the local voice profile for profile management based on the determination.

    Speech encoding using a pre-encoded database

    公开(公告)号:US11710492B2

    公开(公告)日:2023-07-25

    申请号:US16591478

    申请日:2019-10-02

    Abstract: Methods, systems, and devices for encoding are described. A device, which may be otherwise known as user equipment (UE), may support standards-compatible audio encoding (e.g., speech encoding) using a pre-encoded database. The device may receive a digital representation of an audio signal and identify, based on receiving the digital representation of the audio signal, a database that is pre-encoded according to a coding standard and that includes a quantity of digital representations of other audio signals. The device may encode the digital representation of the audio signal using a machine learning scheme and information from the database pre-encoded according to the coding standard. The device may generate a bitstream of the digital representation that is compatible with the coding standard based on encoding the digital representation of the audio signal, and output a representation of the bitstream.

    SPEECH ENCODING USING A PRE-ENCODED DATABASE

    公开(公告)号:US20210104250A1

    公开(公告)日:2021-04-08

    申请号:US16591478

    申请日:2019-10-02

    Abstract: Methods, systems, and devices for encoding are described. A device, which may be otherwise known as user equipment (UE), may support standards-compatible audio encoding (e.g., speech encoding) using a pre-encoded database. The device may receive a digital representation of an audio signal and identify, based on receiving the digital representation of the audio signal, a database that is pre-encoded according to a coding standard and that includes a quantity of digital representations of other audio signals. The device may encode the digital representation of the audio signal using a machine learning scheme and information from the database pre-encoded according to the coding standard. The device may generate a bitstream of the digital representation that is compatible with the coding standard based on encoding the digital representation of the audio signal, and output a representation of the bitstream.

    Split-domain speech signal enhancement

    公开(公告)号:US10741192B2

    公开(公告)日:2020-08-11

    申请号:US15973214

    申请日:2018-05-07

    Abstract: A method and an apparatus for estimating speech signal in split-domain is disclosed. The method includes performing LP analysis on a noisy speech signal to generate a first plurality of LPC and a first residual signal. The method also includes estimating speech LPC spectrum to generate cleaned LPC. The method further includes estimating speech residual spectrum to generate cleaned residual signal. The method also includes synthesizing output signals based on the cleaned LPC and the cleaned residual signal.

    Adjustable laser microphone
    18.
    发明授权

    公开(公告)号:US10362409B1

    公开(公告)日:2019-07-23

    申请号:US15913701

    申请日:2018-03-06

    Abstract: A method of capturing audio includes initiating capture, at a laser microphone, of first audio of an area of interest. The first audio is captured while the laser microphone is focused on a first target surface associated with the area of interest. The method also includes generating adjustment parameters based on a feedback signal to adjust targeting characteristics of the laser microphone. The method further includes adjusting the targeting characteristics of the laser microphone based on the adjustment parameters to focus the laser microphone on a second target surface associated with the area of interest. The method also includes initiating capture, at the laser microphone, of second audio of the area of interest in response to adjusting the targeting characteristics. The second audio has an audio quality that is greater than the first audio.

    Voice profile management and speech signal generation

    公开(公告)号:US09666204B2

    公开(公告)日:2017-05-30

    申请号:US14700009

    申请日:2015-04-29

    CPC classification number: G10L21/003 G10L13/033 G10L17/00 G10L25/48

    Abstract: A device includes a receiver, a memory, and a processor. The receiver is configured to receive a remote voice profile. The memory is electrically coupled to the receiver. The memory is configured to store a local voice profile associated with a person. The processor is electrically coupled to the memory and the receiver. The processor is configured to determine that the remote voice profile is associated with the person based on speech content associated with the remote voice profile or an identifier associated with the remote voice profile. The processor is also configured to select the local voice profile for profile management based on the determination.

    ENCODING OF MULTIPLE AUDIO SIGNALS
    20.
    发明申请

    公开(公告)号:US20170148447A1

    公开(公告)日:2017-05-25

    申请号:US15274041

    申请日:2016-09-23

    Abstract: A device includes an encoder. The encoder is configured to receive two audio channels. The encoder is also configured to determine a mismatch value indicative of an amount of a temporal mismatch between the two audio channels. The encoder is further configured to determine, based on the mismatch value, at least one of a target channel or a reference channel. The target channel corresponds to a lagging audio channel of the two audio channels and the reference channel corresponds to a leading audio channel of the two audio channels. The encoder is also configured to generate a modified target channel by adjusting the target channel based on the offset value. The encoder is further configured to generate at least one encoded channel based on the reference channel and the modified target channel.

Patent Agency Ranking