-
41.
公开(公告)号:US09978376B2
公开(公告)日:2018-05-22
申请号:US14973722
申请日:2015-12-18
IPC分类号: G10L19/00 , G10L19/005 , G10L19/06 , G10L19/002 , G10L19/012 , G10L19/083 , G10L19/12 , G10L19/07 , G10L19/22 , G10L19/09 , G10L19/02
CPC分类号: G10L19/005 , G10L19/002 , G10L19/012 , G10L19/0212 , G10L19/06 , G10L19/07 , G10L19/083 , G10L19/09 , G10L19/12 , G10L19/22 , G10L2019/0002 , G10L2019/0011 , G10L2019/0016
摘要: An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal includes a receiving interface for receiving one or more frames comprising information on a plurality of audio signal samples of an audio signal spectrum of the encoded audio signal, and a processor for generating the reconstructed audio signal. The processor is configured to generate the reconstructed audio signal by fading a modified spectrum to a target spectrum, if a current frame is not received by the receiving interface or if the current frame is received by the receiving interface but is corrupted, wherein the modified spectrum includes a plurality of modified signal samples, wherein, for each of the modified signal samples of the modified spectrum, an absolute value of the modified signal sample is equal to an absolute value of one of the audio signal samples of the audio signal spectrum.
-
42.
公开(公告)号:US20180114535A1
公开(公告)日:2018-04-26
申请号:US15848841
申请日:2017-12-20
IPC分类号: G10L19/02 , G10L19/032 , G10L19/002
CPC分类号: G10L19/0204 , G10L19/002 , G10L19/02 , G10L19/0212 , G10L19/032 , G10L19/24 , G10L21/038
摘要: A speech/audio decoding apparatus is provided that includes a receiver that receives encoded data including a limited-band mode flag, and a memory that stores information on a position of a maximum amplitude spectrum frequency of a previous frame in a divided band. The speech/audio decoding apparatus also includes a processor that identifies whether a decoding band is encoded using a limited-band mode based on the decoded limited-band mode flag. Additionally, the processor decodes the spectrum in a limited band within each of the divided bands in a current frame using the stored information. Furthermore, the limited-band mode is set at an encoder side, when a difference between a first frequency with a first maximum amplitude in a spectrum of the divided band in a preceding frame and a second frequency with a second maximum amplitude in a spectrum of the divided band in the current frame is below a threshold.
-
43.
公开(公告)号:US20180108367A1
公开(公告)日:2018-04-19
申请号:US15818102
申请日:2017-11-20
IPC分类号: G10L19/16 , G10L19/002 , G10L19/06
摘要: Apparatus and methods for generating an encoded audio bitstream, including by including program loudness metadata and audio data in the bitstream, and optionally also program boundary metadata in at least one segment (e.g., frame) of the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, e.g., including by performing adaptive loudness processing of the audio data of an audio program indicated by the bitstream, or authentication and/or validation of metadata and/or audio data of such an audio program. Another aspect is an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.
-
公开(公告)号:US20180082694A1
公开(公告)日:2018-03-22
申请号:US15823284
申请日:2017-11-27
发明人: Moo Young Kim
IPC分类号: G10L19/008 , H04S3/00 , G10L19/002
CPC分类号: G10L19/008 , G10L19/002 , H04S3/008 , H04S2420/11
摘要: Systems and techniques for compression and decoding of audio data are generally disclosed. An example device for compressing higher order ambisonic (HOA) coefficients representative of a soundfield includes a memory configured to store audio data and one or more processors configured to: determine when to use ambient HOA coefficients of the HOA coefficients to augment one or more foreground audio objects obtained through decomposition of the HOA coefficients based on one or more singular values also obtained through the decomposition of the HOA coefficients, the ambient HOA coefficients representative of an ambient component of the soundfield.
-
45.
公开(公告)号:US09905237B2
公开(公告)日:2018-02-27
申请号:US15491661
申请日:2017-04-19
IPC分类号: G10L19/16 , H03G9/00 , G10L19/002 , H03G9/02
CPC分类号: G10L19/167 , G10L19/002 , G10L19/06 , H03G9/005 , H03G9/025
摘要: Apparatus and methods for generating an encoded audio bitstream, including by including program loudness metadata and audio data in the bitstream, and optionally also program boundary metadata in at least one segment (e.g., frame) of the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, e.g., including by performing adaptive loudness processing of the audio data of an audio program indicated by the bitstream, or authentication and/or validation of metadata and/or audio data of such an audio program. Another aspect is an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.
-
公开(公告)号:US09905230B2
公开(公告)日:2018-02-27
申请号:US14734088
申请日:2015-06-09
IPC分类号: G10L19/002 , H04S5/02 , H04S5/00 , H04S3/02 , G10L19/008 , G10L19/18
CPC分类号: G10L19/002 , G10L19/008 , G10L19/18 , H04S3/02 , H04S5/00 , H04S5/005 , H04S5/02 , H04S2400/01 , H04S2400/03 , H04S2420/03
摘要: The application relates to audio encoder and decoder systems. An embodiment of the encoder system comprises a downmix stage for generating a downmix signal and a residual signal based on a stereo signal. In addition, the encoder system comprises a parameter determining stage for determining parametric stereo parameters such as an inter-channel intensity difference and an inter-channel cross-correlation. Preferably, the parametric stereo parameters are time- and frequency-variant. Moreover, the encoder system comprises a transform stage. The transform stage generates a pseudo left/right stereo signal by performing a transform based on the downmix signal and the residual signal. The pseudo stereo signal is processed by a perceptual stereo encoder. For stereo encoding, left/right encoding or mid/side encoding is selectable. Preferably, the selection between left/right stereo encoding and mid/side stereo encoding is time- and frequency-variant.
-
公开(公告)号:US09899033B2
公开(公告)日:2018-02-20
申请号:US15684079
申请日:2017-08-23
发明人: Zexin Liu , Lei Miao , Fengyan Qi
IPC分类号: H04L27/26 , G10L21/00 , G10L19/16 , G10L19/06 , H04L5/06 , G10L19/02 , G10L19/002 , G10L19/26
CPC分类号: G10L19/167 , G10L19/002 , G10L19/0204 , G10L19/06 , G10L19/26 , H04L5/06 , H04L27/2602
摘要: In a signal coding method, bits for coding allocated to different bands of a frequency domain signal obtained from an input signal are adjusted to improve the coding quality. The total available bits for coding are first allocated to the bands of the frequency domain signal according to a predetermined allocation rule. The numbers of bits allocated to the respective bands of the frequency domain signal are then adjusted when a highest frequency of the frequency domain signal to which bits are allocated is greater than a predetermined value. The frequency domain signal is coded according to the adjusted bit allocation for the bands of the frequency domain signal.
-
48.
公开(公告)号:US20180047401A1
公开(公告)日:2018-02-15
申请号:US15544465
申请日:2016-01-27
发明人: Takehiro MORIYA , Yutaka KAMAMOTO , Noboru HARADA , Takahito KAWANISHI , Hirokazu KAMEOKA , Ryosuke SUGIURA
IPC分类号: G10L19/06 , G10L19/12 , G10L19/032 , G10L19/002
CPC分类号: G10L19/06 , G10L19/002 , G10L19/032 , G10L19/12 , G10L19/22
摘要: An encoding apparatus is an encoding apparatus for encoding a time-series signal for each of predetermined time sections in a frequency domain, wherein a parameter η is a positive number, the parameter η corresponding to a time-series signal is a shape parameter of generalized Gaussian distribution that approximates a histogram of a whitened spectral sequence, which is a sequence obtained by dividing a frequency domain sample sequence corresponding to the time-series signal by a spectral envelope estimated by regarding the η-th power of absolute values of the frequency domain sample sequence as a power spectrum, and any of a plurality of parameters η is selective or the parameter η is variable for each of the predetermined time sections; and the encoding apparatus comprises an encoding portion encoding the time-series signal for each of the predetermined time sections by an encoding process with a configuration identified at least based on the parameter η for each of the predetermined time sections.
-
公开(公告)号:US20180040331A1
公开(公告)日:2018-02-08
申请号:US15784802
申请日:2017-10-16
发明人: Yang Gao
IPC分类号: G10L19/125 , G10L19/002 , G10L19/22 , G10L19/00
摘要: A method for processing speech signals prior to encoding a digital signal comprising audio data includes selecting frequency domain coding or time domain coding based on a coding bit rate to be used for coding the digital signal and a short pitch lag detection of the digital signal.
-
公开(公告)号:US09881624B2
公开(公告)日:2018-01-30
申请号:US14891515
申请日:2013-05-15
申请人: SAMSUNG ELECTRONICS CO., LTD. , KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
发明人: Ki-hyun Choo , Ho-chong Park , Eun-mi Oh
IPC分类号: H04S3/02 , G10L19/008 , H04R3/04 , G10L19/02 , G10L19/002 , G10L25/18 , G10L19/00
CPC分类号: G10L19/0204 , G10L19/002 , G10L25/18 , G10L2019/0002
摘要: A method is provided. The method includes obtaining a low-band spectrum of an audio signal in which a low-band signal is frequency transformed; obtaining phase information of a high-band spectrum of the audio signal based on the low-band spectrum; and outputting a bitstream that comprises the phase information of the high-band spectrum.
-
-
-
-
-
-
-
-
-