-
公开(公告)号:US11929089B2
公开(公告)日:2024-03-12
申请号:US16176280
申请日:2018-10-31
发明人: Christian Uhle , Michael Kratz , Paul Klose , Timothy Leonard , André Luvizotto , Sebastian Scharrer
IPC分类号: G10L21/043 , G10L19/008 , G10L19/02 , G10L21/04 , G11B27/00 , G11B27/28
CPC分类号: G10L21/043 , G10L19/008 , G10L19/02 , G10L21/04 , G11B27/005 , G11B27/28
摘要: An apparatus for processing a multichannel audio signal has a plurality of channel signals. The apparatus performs a time scale modulation of the multichannel audio signal and has a phase adaptor and a separator. The phase adaptor provides a processed signal by modifying a phase of a signal based on a combination of the channel signals. The separator provides separated signals based on the processed signal. A corresponding method is provided.
-
公开(公告)号:US20180218748A1
公开(公告)日:2018-08-02
申请号:US15913197
申请日:2018-03-06
申请人: TiVo Solutions Inc.
发明人: Robert Watts
IPC分类号: G10L21/043 , G10L19/16 , G06F3/16 , G10L15/02
CPC分类号: G10L21/043 , G06F3/165 , G10L15/02 , G10L19/167 , G10L2015/027 , H04L65/4084 , H04L65/605 , H04L65/607 , H04N21/23805 , H04N21/647 , H04N21/8106
摘要: Input media data with an input playing speed is received and divided into input media data subsets. A first rate of audio utterance is determined for a first input media data subset in the media data subsets. A second different rate of audio utterance is determined for a second input media data subset in the media data subsets. Audio output media data is generated with an output playing speed at which audio utterance in the audio output media data is played at a preferred rate of audio utterance. The audio output media data comprises (a) a first output audio media data subset generated based on the preferred rate, the first rate, and the first input media data subset and (b) a second output audio media data subset generated based on the preferred rate, the second rate, and the second input media data subset.
-
公开(公告)号:US10026417B2
公开(公告)日:2018-07-17
申请号:US15136278
申请日:2016-04-22
申请人: OpenTV, Inc.
发明人: Kai Sun
IPC分类号: G10L21/04 , G10L21/043 , G10L25/57 , G10L25/78 , G10L15/25 , H04N21/2387 , H04N21/472 , H04N21/233 , H04N21/439
CPC分类号: G10L21/043 , G10L15/25 , G10L25/57 , G10L25/78 , H04N21/233 , H04N21/23418 , H04N21/234381 , H04N21/2387 , H04N21/439 , H04N21/44008 , H04N21/440281 , H04N21/47217 , H04N21/8456
摘要: Example embodiments provide systems and methods for accelerating digital content playback based on speech. A content acceleration system electronically accesses digital content. The system analyzes the digital content to detect at least one audio portion within the digital content, each of the at least one audio portion comprising speech. The system creates at least one digital content segment from the digital content based on the at least one audio portion, whereby a beginning of each digital content segment of the at least one digital content segment coincides with a beginning of a corresponding audio portion of the at least one audio portion. The system then accelerates playback of the digital content by fast forwarding through parts of the at least one digital content segment where speech is absent.
-
4.
公开(公告)号:US20180061437A1
公开(公告)日:2018-03-01
申请号:US15720498
申请日:2017-09-29
申请人: Google Inc.
发明人: Erik Kay , Jonas Erik Lindberg , Serge Lachapelle , Henrik Lundin
IPC分类号: G10L21/043 , G10L21/0208 , H04L29/06 , G10L15/20 , G10L25/78
CPC分类号: G10L21/043 , G10L15/20 , G10L19/005 , G10L21/0208 , G10L21/0232 , G10L25/78 , H04L65/1069 , H04L65/80 , H04L67/141
摘要: A computer-implemented technique can include establishing an audio communication session between first and second computing devices and obtaining, by the first computing device, an audio input signal using audio data captured by a microphone. The first computing device can analyze the audio input signal to detect a speech input by its first user and can determine a duration of a detection period from when the audio input signal was obtained until the analyzing has completed. The first computing device can then transmit, to the second computing device, (i) a portion of the audio input signal beginning at a start of the speech input and (ii) the detection period duration, wherein receipt of the portion of the audio input signal and the detection period duration causes the second computing device to accelerate playback of the portion of the audio input signal to compensate for the detection period duration.
-
公开(公告)号:US09813689B2
公开(公告)日:2017-11-07
申请号:US14571458
申请日:2014-12-16
申请人: THOMSON LICENSING
发明人: Cyril Quinquis
IPC分类号: H04N5/92 , H04N5/89 , H04N9/80 , H04N5/93 , G11B27/00 , H04N9/806 , H04N9/804 , H04N5/85 , G11B27/10 , G10L21/043 , H04N5/78 , G06F17/00
CPC分类号: H04N9/806 , G10L21/043 , G11B27/005 , G11B27/105 , H04N5/85 , H04N9/8042
摘要: The present invention relates to an audio content restitution method in a receiver of audio and/or audiovisual content, the receiver being adapted to the restitution of the audio content, the audio content being received encoded and containing a succession of frames of audio samples and pointer type information on at least one portion of the audio samples of the frames.According to a particular embodiment, the audio content restitution method comprises: a selection of audio samples from the frames, the selected audio samples being identified from the pointer type information; a restitution of the only samples selected.
-
公开(公告)号:US09767825B2
公开(公告)日:2017-09-19
申请号:US15374455
申请日:2016-12-09
申请人: TiVo Solutions Inc.
发明人: Robert Watts
IPC分类号: G06F17/00 , G10L21/043 , G06F3/16
CPC分类号: G10L21/043 , G06F3/165
摘要: Input media data with an input playing speed is received. One or more user identities are identified based at least in part on biometric data collected from one or more users who correspond to the one or more user identities and to whom audio utterance derived from the input media data is to be played. A preferred rate of audio utterance is determined based at least in part on the one or more user identities. A rate of audio utterance is determined for a portion of the input media data. Based at least in part on the preferred rate of audio utterance and the rate of audio utterance, a portion of audio output media data is generated with an output playing speed at which audio utterance in the portion of audio output media data is rendered with the preferred rate of audio utterance.
-
公开(公告)号:US20170238026A1
公开(公告)日:2017-08-17
申请号:US15041740
申请日:2016-02-11
发明人: Amit Kumar Agrawal
IPC分类号: H04N21/2387 , G10L21/043 , G06F3/16 , G10L21/057 , G06F3/0484 , G06F3/0488 , H04N21/61 , H04N21/44 , H04N21/472 , H04N21/442 , H04N21/422 , H04N21/439 , H04N21/84 , H04N21/45 , H04N21/258 , G10L15/00
CPC分类号: H04N21/2387 , G06F3/04847 , G06F3/0488 , G06F3/165 , G10L15/005 , G10L21/043 , G10L21/057 , H04N21/25883 , H04N21/42203 , H04N21/4394 , H04N21/44008 , H04N21/442 , H04N21/4532 , H04N21/47217 , H04N21/6125 , H04N21/84
摘要: A method, a system, and a computer program product for providing media to a requester at a particular playback rate associated with the requester. The method includes receiving a request from a requester for a playback session of media that includes a time varying content. In response to receiving the request, a profile associated with the requester is accessed to determine a playback rate of the media for the requester. In response to determining the playback rate of the media for the requester, the media is provided to the requester at the determined playback rate. The method further includes monitoring the playback session of the media for playback changes by the requester and dynamically adapting the playback rate associated with the requester based on the type and frequency of playback changes.
-
公开(公告)号:US09715540B2
公开(公告)日:2017-07-25
申请号:US12822802
申请日:2010-06-24
申请人: Om D. Deshmukh , Nitendra Rajput
发明人: Om D. Deshmukh , Nitendra Rajput
CPC分类号: G06F17/30775 , G06F17/30743 , G10L15/00 , G10L21/043 , H04M3/4938
摘要: Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.
-
9.
公开(公告)号:US09324336B2
公开(公告)日:2016-04-26
申请号:US14353044
申请日:2012-10-22
申请人: LG Electronics Inc.
发明人: Ingyu Kang , Younghan Lee , Gyuhyeok Jeong , Hyejeong Jeon , Lagyoung Kim
IPC分类号: G10L19/26 , G10L19/00 , H04J3/06 , G10L21/043 , G10L19/16
CPC分类号: G10L19/265 , G10L19/00 , G10L19/167 , G10L21/043 , H04J3/0632
摘要: The present invention relates to a method of managing a jitter buffer and a jitter buffer using same. The method of managing a jitter buffer includes the steps of: receiving audio information frames; and adjusting a jitter buffer on the basis of the received audio information frames, wherein the adjusting step of the jitter buffer includes compensation of an audio signal, and the compensation of the audio signal can be performed for each sub frame of the audio information frames.
摘要翻译: 本发明涉及一种管理抖动缓冲器和使用其的抖动缓冲器的方法。 管理抖动缓冲器的方法包括以下步骤:接收音频信息帧; 以及基于接收到的音频信息帧调整抖动缓冲器,其中抖动缓冲器的调整步骤包括对音频信号的补偿,并且可以对音频信息帧的每个子帧执行音频信号的补偿。
-
公开(公告)号:US20150066493A1
公开(公告)日:2015-03-05
申请号:US14538728
申请日:2014-11-11
发明人: Stefan BAYER , Sascha DISCH , Ralf GEIGER , Guillaume FUCHS , Max NEUENDORF , Gerald SCHULLER , Bernd EDLER
IPC分类号: G10L19/028 , G10L21/04
CPC分类号: G10L21/04 , G10L19/002 , G10L19/0212 , G10L19/022 , G10L19/028 , G10L19/032 , G10L19/265 , G10L21/043 , G10L25/90
摘要: An audio encoder has a window function controller, a windower, a time warper with a final quality check functionality, a time/frequency converter, a TNS stage or a quantizer encoder, the window function controller, the time warper, the TNS stage or an additional noise filling analyzer are controlled by signal analysis results obtained by a time warp analyzer or a signal classifier. Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.
-
-
-
-
-
-
-
-
-