Voice profile management and speech signal generation

    公开(公告)号:US09875752B2

    公开(公告)日:2018-01-23

    申请号:US15603270

    申请日:2017-05-23

    CPC classification number: G10L21/003 G10L13/033 G10L17/00 G10L25/48

    Abstract: A device includes a receiver, a memory, and a processor. The receiver is configured to receive a remote voice profile. The memory is electrically coupled to the receiver. The memory is configured to store a local voice profile associated with a person. The processor is electrically coupled to the memory and the receiver. The processor is configured to determine that the remote voice profile is associated with the person based on speech content associated with the remote voice profile or an identifier associated with the remote voice profile. The processor is also configured to select the local voice profile for profile management based on the determination.

    Encoding of multiple audio signals

    公开(公告)号:US11094330B2

    公开(公告)日:2021-08-17

    申请号:US16805289

    申请日:2020-02-28

    Abstract: A device includes an encoder configured to determine, during a first period, that a first audio signal is a leading signal and that a second audio signal is a lagging signal. The encoder is also configured to generate a first frame of at least one encoded signal based on a first modified version of the second audio signal that is generated by adjusting the second audio signal based on a first mismatch value. The encoder is configured to determine, during a second period, that the first audio signal is the leading signal and that the second audio signal is the lagging signal. The encoder is configured to generate a second frame of the at least one encoded signal based on a second modified version of the second audio signal that is generated by adjusting the second audio signal based on the first mismatch value and a second mismatch value.

    Audio processing for temporally mismatched signals

    公开(公告)号:US10204629B2

    公开(公告)日:2019-02-12

    申请号:US16049688

    申请日:2018-07-30

    Abstract: A device includes a processor and a transmitter. The processor is configured to determine a first value and a second value indicative of a first amount and a second amount, respectively, of a temporal mismatch between a first audio signal and a second audio signal. The processor is also configured to determine an effective value based on the first value and the second value, to select, based on the effective value, a first coding mode and a second coding mode, and to generate at least one encoded signal having a bit allocation. The at least one encoded signal is based on a first encoded signal and a second encoded signal that are based on the first coding mode and the second coding mode, respectively. The bit allocation is at least partially based on the effective mismatch value. The transmitter is configured to transmit the at least one encoded signal.

    AUDIO PROCESSING FOR TEMPORALLY MISMATCHED SIGNALS

    公开(公告)号:US20180336907A1

    公开(公告)日:2018-11-22

    申请号:US16049688

    申请日:2018-07-30

    CPC classification number: G10L19/002 G10L19/008 G10L19/025 G10L19/22

    Abstract: A device includes a processor and a transmitter. The processor is configured to determine a first value and a second value indicative of a first amount and a second amount, respectively, of a temporal mismatch between a first audio signal and a second audio signal. The processor is also configured to determine an effective value based on the first value and the second value, to select, based on the effective value, a first coding mode and a second coding mode, and to generate at least one encoded signal having a bit allocation. The at least one encoded signal is based on a first encoded signal and a second encoded signal that are based on the first coding mode and the second coding mode, respectively. The bit allocation is at least partially based on the effective mismatch value. The transmitter is configured to transmit the at least one encoded signal.

    Media segment representation using fixed weights

    公开(公告)号:US12300233B2

    公开(公告)日:2025-05-13

    申请号:US18047562

    申请日:2022-10-18

    Abstract: A device includes a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment. The device also includes one or more processors configured to generate data representing the detected first input speech segment and to pass the data representing the detected first input speech segment into a collection of memory units. Each memory unit of the collection of memory units includes a set of weights from the collection of sets of weights. The one or more processors are also configured to generate a first estimate of an associated media segment that represents the detected first input speech segment. The associated media segment corresponds to a first memory unit in the collection of memory units.

Patent Agency Ranking