REAL-TIME JITTER CONTROL AND PACKET-LOSS CONCEALMENT IN AN AUDIO SIGNAL
    1.
    发明申请
    REAL-TIME JITTER CONTROL AND PACKET-LOSS CONCEALMENT IN AN AUDIO SIGNAL 审中-公开
    音频信号中的实时抖动控制和分组丢失隐藏

    公开(公告)号:US20090304032A1

    公开(公告)日:2009-12-10

    申请号:US12542558

    申请日:2009-08-17

    IPC分类号: H04J3/06

    摘要: An “adaptive audio playback controller” operates by decoding and reading received packets of an audio signal into a signal buffer. Samples of the decoded audio signal are then played out of the signal buffer according to the needs of a player device. Jitter control and packet loss concealment are accomplished by continuously analyzing buffer content in real-time, and determining whether to provide unmodified playback from the buffer contents, whether to compress buffer content, stretch buffer content, or whether to provide for packet loss concealment for overly delayed or lost packets as a function of buffer content. Further, the adaptive audio playback controller also determines where to stretch or compress particular frames or signal segments in the signal buffer, and how much to stretch or compress such segments in order to optimize perceived playback quality.

    摘要翻译: “自适应音频播放控制器”通过将音频信号的接收分组解码并读取到信号缓冲器来进行操作。 然后根据播放器设备的需要从信号缓冲器中播放经解码的音频信号的样本。 抖动控制和分组丢失隐藏是通过实时连续分析缓冲区内容来实现的,并且确定是否从缓冲器内容中提供未修改的重放,是否压缩缓冲区内容,扩展缓冲区内容,还是提供丢包隐藏 延迟或丢失的数据包作为缓冲区内容的函数。 此外,自适应音频重放控制器还确定在哪里拉伸或压缩信号缓冲器中的特定帧或信号段,以及拉伸或压缩这些段以便优化感知的播放质量。

    Real-time detection and preservation of speech onset in a signal
    4.
    发明授权
    Real-time detection and preservation of speech onset in a signal 有权
    在信号中实时检测和保存言语发生

    公开(公告)号:US07917357B2

    公开(公告)日:2011-03-29

    申请号:US12181159

    申请日:2008-07-28

    IPC分类号: G10L11/02 G10L21/04

    CPC分类号: G10L25/87 G10L2025/783

    摘要: A “speech onset detector” provides a variable length frame buffer in combination with either variable transmission rate or temporal speech compression for buffered signal frames. The variable length buffer buffers frames that are not clearly identified as either speech or non-speech frames during an initial analysis. Buffering of signal frames continues until a current frame is identified as either speech or non-speech. If the current frame is identified as non-speech, buffered frames are encoded as non-speech frames. However, if the current frame is identified as a speech frame, buffered frames are searched for the actual onset point of the speech. Once that onset point is identified, the signal is either transmitted in a burst, or a time-scale modification of the buffered signal is applied for compressing buffered frames beginning with the frame in which onset point is detected. The compressed frames are then encoded as one or more speech frames.

    摘要翻译: “语音起始检测器”提供了可变长度帧缓冲器,与缓冲信号帧的可变传输速率或时间语音压缩相结合。 可变长度缓冲器缓冲在初始分析期间未被清楚地识别为语音或非语音帧的帧。 信号帧的缓冲持续到当前帧被识别为语音或非语音。 如果当前帧被识别为非语音,则缓冲帧被编码为非语音帧。 然而,如果当前帧被识别为语音帧,则搜索缓冲的帧用于语音的实际起始点。 一旦该起始点被识别,则信号以突发方式发送,或者缓冲信号的时间尺度修改被应用于从检测到起始点的帧开始的缓冲帧。 然后将压缩的帧编码为一个或多个语音帧。

    ENERGY-BASED SOUND SOURCE LOCALIZATION AND GAIN NORMALIZATION
    5.
    发明申请
    ENERGY-BASED SOUND SOURCE LOCALIZATION AND GAIN NORMALIZATION 有权
    基于能量的声源定位和增益正规化

    公开(公告)号:US20080170717A1

    公开(公告)日:2008-07-17

    申请号:US11623643

    申请日:2007-01-16

    IPC分类号: H04R3/00

    摘要: An energy based technique to estimate the positions of people speaking from an ad hoc network of microphones. The present technique does not require accurate synchronization of the microphones. In addition, a technique to normalize the gains of the microphones based on people's speech is presented, which allows aggregation of various audio channels from the ad hoc microphone network into a single stream for audio conferencing. The technique is invariant of the speaker's volumes thus making the system easy to deploy in practice.

    摘要翻译: 一种基于能量的技术来估计从麦克风的自组织网络发言的人的位置。 本技术不需要麦克风的准确同步。 此外,提出了一种基于人们的语音来归一化麦克风的增益的技术,其允许将各种音频频道从专用麦克风网络聚合成用于音频会议的单个流。 该技术是扬声器音量不变的,从而使得系统在实践中容易部署。

    Collaborative Media Recommendation and Sharing Technique
    6.
    发明申请
    Collaborative Media Recommendation and Sharing Technique 有权
    协作媒体推荐与分享技术

    公开(公告)号:US20090055377A1

    公开(公告)日:2009-02-26

    申请号:US11843129

    申请日:2007-08-22

    IPC分类号: G06F7/06 G06F15/16 G06F17/30

    CPC分类号: G06F17/30029 G06F17/30053

    摘要: A media recommendation and sharing technique that employs agents on media players/devices to expand the scope of media sharing scenarios. The technique assists a user in discovering media items, such as, for example, music, recordings, play lists, pictures, video games, on nearby media players or devices (devices which are capable of receiving, storing and playing media) which are interesting to the user. The collaborative media recommendation and sharing technique contemporaneously determines a user's media preferences based on media stored on a pair of media devices and recommends media for potential sharing based on these determined user preferences.

    摘要翻译: 媒体推荐和分享技术,在媒体播放器/设备上使用代理扩展媒体共享场景的范围。 该技术帮助用户发现媒体项目,例如音乐,录音,播放列表,图片,视频游戏,附近的媒体播放器或设备(能够接收,存储和播放媒体的设备),这是有趣的 给用户 合作媒体推荐和共享技术同时基于存储在一对媒体设备上的媒体同时确定用户的媒体偏好,并且基于这些确定的用户偏好推荐媒体进行潜在共享。

    Energy-based sound source localization and gain normalization
    7.
    发明授权
    Energy-based sound source localization and gain normalization 有权
    基于能量的声源定位和增益归一化

    公开(公告)号:US07924655B2

    公开(公告)日:2011-04-12

    申请号:US11623643

    申请日:2007-01-16

    IPC分类号: H04R5/02

    摘要: An energy based technique to estimate the positions of people speaking from an ad hoc network of microphones. The present technique does not require accurate synchronization of the microphones. In addition, a technique to normalize the gains of the microphones based on people's speech is presented, which allows aggregation of various audio channels from the ad hoc microphone network into a single stream for audio conferencing. The technique is invariant of the speaker's volumes thus making the system easy to deploy in practice.

    摘要翻译: 一种基于能量的技术来估计从麦克风的自组织网络发言的人的位置。 本技术不需要麦克风的准确同步。 此外,提出了一种基于人们的语音来归一化麦克风的增益的技术,其允许将各种音频频道从专用麦克风网络聚合成用于音频会议的单个流。 该技术是扬声器音量不变的,从而使得系统在实践中容易部署。

    Management of split audio/video streams
    9.
    发明授权
    Management of split audio/video streams 有权
    分割音频/视频流的管理

    公开(公告)号:US08276195B2

    公开(公告)日:2012-09-25

    申请号:US11968194

    申请日:2008-01-02

    IPC分类号: H04L9/32 H04N7/167 G06F7/04

    CPC分类号: G06F21/6209

    摘要: Described herein is a method that includes receiving multiple requests for access to an exposed media object, wherein the exposed media object represents a live media stream that is being generated by a media source. The method also includes receiving data associated with each entity that provided a request, and determining, for each entity, whether the entities that provided the request are authorized to access the media stream based at least in part upon the received data and splitting the media stream into multiple media streams, wherein a number of media streams corresponds to a number of authorized entities. The method also includes automatically applying at least one policy to at least one of the split media streams based at least in part upon the received data.

    摘要翻译: 这里描述的方法包括接收对暴露的媒体对象的访问的多个请求,其中所述暴露的媒体对象表示正由媒体源生成的实况媒体流。 该方法还包括接收与提供请求的每个实体相关联的数据,以及为每个实体确定提供该请求的实体是否被授权至少部分地基于所接收的数据和分割媒体流来访问媒体流 转换成多个媒体流,其中多个媒体流对应于多个授权实体。 该方法还包括至少部分地基于所接收的数据自动地将至少一个策略应用于至少一个分离媒体流。