-
公开(公告)号:US11817107B2
公开(公告)日:2023-11-14
申请号:US17875237
申请日:2022-07-27
CPC分类号: G10L19/0018 , G10L19/265 , G10L25/12 , G10L25/69 , G10L25/72
摘要: Innovations in phase quantization during speech encoding and phase reconstruction during speech decoding are described. For example, to encode a set of phase values, a speech encoder omits higher-frequency phase values and/or represents at least some of the phase values as a weighted sum of basis functions. Or, as another example, to decode a set of phase values, a speech decoder reconstructs at least some of the phase values using a weighted sum of basis functions and/or reconstructs lower-frequency phase values then uses at least some of the lower-frequency phase values to synthesize higher-frequency phase values. In many cases, the innovations improve the performance of a speech codec in low bitrate scenarios, even when encoded data is delivered over a network that suffers from insufficient bandwidth or transmission quality problems.
-
公开(公告)号:US20190122675A1
公开(公告)日:2019-04-25
申请号:US16222721
申请日:2018-12-17
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
IPC分类号: G10L19/008 , G10L19/16 , H04S3/00 , G10L19/06 , G10L19/02
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.
-
公开(公告)号:US10013991B2
公开(公告)日:2018-07-03
申请号:US15799777
申请日:2017-10-31
CPC分类号: G10L19/0204 , G10L19/00 , G10L19/025 , G10L19/083 , G10L19/12 , G10L19/26 , G10L25/12 , G10L25/18 , G10L25/21 , G10L25/48 , H03H17/0266
摘要: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
-
公开(公告)号:US20180166094A1
公开(公告)日:2018-06-14
申请号:US15889775
申请日:2018-02-06
发明人: Yutaka KAMAMOTO , Takehiro MORIYA , Noboru HARADA
摘要: An autocorrelation calculating part calculates autocorrelation Ro(i) from an input signal. A predictive coefficient calculating part performs linear predictive analysis using modified autocorrelation R′o(i) obtained by multiplying the autocorrelation Ro(i) by a coefficient wo(i). Here, it is assumed that a case where, for at least part of each order i, the coefficient wo(i) corresponding to each order i monotonically increases as a value having negative correlation with a fundamental frequency of an input signal in a current frame or a past frame increases and a case where the coefficient wo(i) monotonically decreases as a value having positive correlation with a pitch gain in a current frame or a past frame increases, are included.
-
公开(公告)号:US09990929B2
公开(公告)日:2018-06-05
申请号:US15799641
申请日:2017-10-31
CPC分类号: G10L19/0204 , G10L19/00 , G10L19/025 , G10L19/083 , G10L19/12 , G10L19/26 , G10L25/12 , G10L25/18 , G10L25/21 , G10L25/48 , H03H17/0266
摘要: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
-
公开(公告)号:US20180137868A1
公开(公告)日:2018-05-17
申请号:US15849653
申请日:2017-12-20
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
IPC分类号: G10L19/008 , G10L19/16 , G10L19/02 , H04S3/00 , G10L19/06
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
-
公开(公告)号:US20180137866A1
公开(公告)日:2018-05-17
申请号:US15849622
申请日:2017-12-20
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
IPC分类号: G10L19/008 , G10L19/16 , G10L19/02 , H04S3/00 , G10L19/06
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
-
公开(公告)号:US20170365261A1
公开(公告)日:2017-12-21
申请号:US15670709
申请日:2017-08-07
发明人: Heiko Purnhagen , Pontus Carlsson , Lars Villemoes
IPC分类号: G10L19/008 , G10L19/06 , G10L19/02 , G10L19/16 , H04S3/00
CPC分类号: G10L19/008 , G10L19/0212 , G10L19/06 , G10L19/167 , G10L25/12 , H04S3/008 , H04S2400/01
摘要: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
-
9.
公开(公告)号:US20170303054A1
公开(公告)日:2017-10-19
申请号:US15487474
申请日:2017-04-14
申请人: SIVANTOS PTE. LTD.
IPC分类号: H04R25/00 , G10L19/04 , G10L19/038 , G10L21/038
CPC分类号: H04R25/554 , G10L19/032 , G10L19/038 , G10L19/04 , G10L21/038 , G10L25/12 , H04R25/50 , H04R25/552 , H04R2225/43 , H04R2225/51 , H04R2225/53 , H04R2430/03 , H04R2460/03
摘要: A method transmits an audio signal from a transmitter to a receiver and a hearing device, particularly a hearing aid, contains a communication facility which is provided and configured for transmitting and/or receiving an audio signal according to the method. A hearing device system has two hearing devices and is provided and configured to transmit audio signals between the two hearing devices by their communication facilities according to the method.
-
公开(公告)号:US20170287488A9
公开(公告)日:2017-10-05
申请号:US15185298
申请日:2016-06-17
CPC分类号: G10L17/04 , G10L15/08 , G10L15/10 , G10L15/22 , G10L17/005 , G10L17/24 , G10L25/12 , G10L25/24 , G10L2015/0638 , H04M3/38 , H04M3/382 , H04M3/385 , H04M3/42204 , H04M3/493 , H04M2201/40
摘要: A method of accessing a dial-up service is disclosed. An example method of providing access to a service includes receiving a first speech signal from a user to form a first utterance; recognizing the first utterance using speaker independent speaker recognition; requesting the user to enter a personal identification number; and when the personal identification number is valid, receiving a second speech signal to form a second utterance and providing access to the service.
-
-
-
-
-
-
-
-
-