Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets
    81.
    发明授权
    Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets 失效
    使用投影到凸集上重建缺失的完全线性变换系数

    公开(公告)号:US06470469B1

    公开(公告)日:2002-10-22

    申请号:US09276842

    申请日:1999-03-26

    IPC分类号: H03M1300

    摘要: A projection onto convex sets (POCS)-based method for consistent reconstruction of a signal from a subset of quantized coefficients received from an N×K overcomplete transform. By choosing a frame operator F to be the concatenization of two or more K×K invertible transforms, the POCS projections are calculated in RK space using only the K×K transforms and their inverses, rather than the larger RN space using pseudo inverse transforms. Practical reconstructions are enabled based on, for example, wavelet, subband, or lapped transforms of an entire image. In one embodiment, unequal error protection for multiple description source coding is provided. In particular, given a bit-plane representation of the coefficients in an overcomplete representation of the source, one embodiment of the present invention provides coding the most significant bits with the highest redundancy and the least significant bits with the lowest redundancy. In one embodiment, this is accomplished by varying the quantization stepsize for the different coefficients. Then, the available received quantized coefficients are decoded using a method based on alternating projections onto convex sets.

    摘要翻译: 基于凸集(POCS)的方法的投影,用于从从NxK过完全变换接收的量化系数的子集的信号的一致重构。 通过选择一个帧运算符F作为两个或多个KxK可逆变换的并置,POCS投影在RK空间中仅使用KxK变换及其反转而不是使用伪逆变换的较大的RN空间来计算。 基于例如整个图像的小波,子带或重叠变换来实现实际重建。 在一个实施例中,提供了用于多描述源编码的不等差错保护。 特别地,给定源的过完整表示中的系数的位平面表示,本发明的一个实施例提供了具有最高冗余度的最高有效位和具有最低冗余度的最低有效位的编码。 在一个实施例中,这通过改变不同系数的量化步长来实现。 然后,使用基于在凸集上的交替投影的方法对可用的接收量化系数进行解码。

    Interleaved multiple multimedia stream for synchronized transmission over a computer network
    82.
    发明授权
    Interleaved multiple multimedia stream for synchronized transmission over a computer network 失效
    交织多个多媒体流,用于通过计算机网络进行同步传输

    公开(公告)号:US06449653B2

    公开(公告)日:2002-09-10

    申请号:US08826345

    申请日:1997-03-25

    IPC分类号: G06F1516

    摘要: The production of an interleaved multimedia stream for servers and client computers coupled to each other by a diverse computer network which includes local area networks (LANs) and/or wide area networks (WANs) such as the internet. Interleaved multimedia streams can include compressed video frames for display in a video window, accompanying compressed audio frames and annotation frames. In one embodiment, a producer captures separate video/audio frames and generates an interleaved multimedia file. In another embodiment, the interleaved file include annotation frames which provide either pointer(s) to the event(s) of interest or include displayable data embedded within the annotation stream. The interleaved file is then stored in the web server for subsequent retrieval by client computer(s) in a coordinated manner, so that the client computer(s) is able to synchronously display the video frames and displayable event(s) in a video window and event window(s), respectively. In some embodiments, the interleaved file includes packets with variable length fields, each of which are at least one numerical unit in length.

    摘要翻译: 通过包括诸如互联网的局域网(LAN)和/或广域网(WAN))的不同计算机网络来产生用于服务器和客户端计算机的交错多媒体流。 交错多媒体流可以包括用于在视频窗口中显示的压缩视频帧,伴随压缩音频帧和注释帧。 在一个实施例中,制作者捕获单独的视频/音频帧并产生交织的多媒体文件。 在另一个实施例中,交织的文件包括向感兴趣的事件提供指针或者包括嵌入注释流内的可显示数据的注释帧。 然后将交织的文件存储在网络服务器中,以便由客户端计算机以协调的方式随后检索,使得客户端计算机能够在视频窗口中同步显示视频帧和可显示的事件 和事件窗口。 在一些实施例中,交织文件包括具有可变长度字段的分组,每个长度字段的长度至少为一个数字单位。

    Document image decoding using modified branch-and-bound methods
    83.
    发明授权
    Document image decoding using modified branch-and-bound methods 失效
    使用修改的分支和绑定方法的文档图像解码

    公开(公告)号:US5526444A

    公开(公告)日:1996-06-11

    申请号:US60196

    申请日:1993-05-07

    摘要: An image decoding and recognition system and method comprising a fast heuristic algorithm using hidden Markov models (HMM). The new search algorithm, called an "iterative complete path" (ICP) algorithm, patterned after well-known branch-and-bound (B&B) methods, significantly reduces the complexity and improves the speed of HMM image decoding without sacrificing the optimality of the straightforward procedure. An advantageous form of the heuristic functions which is useful in applying the ICP algorithm to text-like images is described. The ICP algorithm is directly applicable to the separable type of finite-state source models. Also disclosed is a technique for transforming more general source models into such a separable form.

    摘要翻译: 一种使用隐马尔可夫模型(HMM)的快速启发式算法的图像解码识别系统和方法。 称为“迭代完整路径”(ICP)算法的新型搜索算法,在公知的分支绑定(B&B)方法之后进行图案化,显着降低了复杂度并提高了HMM图像解码的速度,而不牺牲 简单的程序。 描述了将ICP算法应用于文字图像中有用的启发式函数的有利形式。 ICP算法直接适用于可分离类型的有限状态源模型。 还公开了一种用于将更一般的源模型转换成这种可分离形式的技术。

    Non-linguistic signal detection and feedback
    84.
    发明授权
    Non-linguistic signal detection and feedback 有权
    非语言信号检测和反馈

    公开(公告)号:US08963987B2

    公开(公告)日:2015-02-24

    申请号:US12789142

    申请日:2010-05-27

    IPC分类号: H04N7/14 H04N7/15

    CPC分类号: H04N7/15 H04N7/147

    摘要: Non-linguistic signal information relating to one or more participants to an interaction may be determined using communication data received from the one or more participants. Feedback can be provided based on the determined non-linguistic signals. The participants may be given an opportunity to opt in to having their non-linguistic signal information collected, and may be provided complete control over how their information is shared or used.

    摘要翻译: 可以使用从一个或多个参与者接收的通信数据来确定与交互的一个或多个参与者有关的非语言信号信息。 可以基于确定的非语言信号提供反馈。 参与者可能有机会选择收集其非语言信号信息,并且可以完全控制他们的信息如何共享或使用。

    Techniques for managing visual compositions for a multimedia conference call
    85.
    发明授权
    Techniques for managing visual compositions for a multimedia conference call 有权
    用于管理多媒体电话会议的视觉作品的技术

    公开(公告)号:US08773494B2

    公开(公告)日:2014-07-08

    申请号:US11511749

    申请日:2006-08-29

    IPC分类号: H04N7/14

    摘要: Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.

    摘要翻译: 描述用于管理多媒体电话会议的视觉作品的技术。 设备可以包括处理器,用于为多个显示对象分配显示对象比特率,其中所有显示对象的总显示对象比特率等于或小于总输入比特率,并且从多个视频流解码视频信息,每个视频流具有 具有不同级别的空间分辨率,时间分辨率和两个或多个显示对象的质量的不同视频层。 描述和要求保护其他实施例。

    Immersive remote conferencing
    86.
    发明授权
    Immersive remote conferencing 有权
    沉浸式远程会议

    公开(公告)号:US08675067B2

    公开(公告)日:2014-03-18

    申请号:US13100504

    申请日:2011-05-04

    IPC分类号: H04N7/18

    摘要: The subject disclosure is directed towards an immersive conference, in which participants in separate locations are brought together into a common virtual environment (scene), such that they appear to each other to be in a common space, with geometry, appearance, and real-time natural interaction (e.g., gestures) preserved. In one aspect, depth data and video data are processed to place remote participants in the common scene from the first person point of view of a local participant. Sound data may be spatially controlled, and parallax computed to provide a realistic experience. The scene may be augmented with various data, videos and other effects/animations.

    摘要翻译: 本发明涉及一种身临其境的会议,其中分开的位置的参与者被聚集到一个共同的虚拟环境(场景)中,使得它们彼此看起来处于共同的空间中,具有几何,外观和实时性, 保留时间自然的相互作用(如手势)。 在一个方面,深度数据和视频数据被处理以将远程参与者从本地参与者的第一人的角度放置在公共场景中。 声音数据可以是空间控制的,并且计算视差以提供真实的体验。 场景可能会增加各种数据,视频和其他效果/动画。

    Forward error correction for media transmission
    87.
    发明授权
    Forward error correction for media transmission 有权
    媒体传输的前向纠错

    公开(公告)号:US08553757B2

    公开(公告)日:2013-10-08

    申请号:US11675047

    申请日:2007-02-14

    IPC分类号: H04L12/66 H04L12/28

    摘要: A “Media Transmission Optimizer” provides a media transmission optimization framework for lossy or bursty networks such as the Internet. This optimization framework provides a novel form of dynamic Forward Error Correction (FEC) that focuses on the perceived quality of a recovered media signal rather than on the absolute accuracy of the recovered media signal. In general, the Media Transmission Optimizer provides an encoder that optimizes the transmission of redundant frames of electronic media information encoded at different bit rates, and provides optimized playback quality by providing a decoder that automatically selects an optimal path through one or more available representations of each frame as a function of overall rate/distortion criteria.

    摘要翻译: “媒体传输优化器”为诸如因特网之类的有损或突发的网络提供媒体传输优化框架。 该优化框架提供了一种新型的动态前向纠错(FEC),其侧重于恢复的媒体信号的感知质量,而不是恢复的媒体信号的绝对精度。 通常,媒体传输优化器提供了一种编码器,其优化以不同比特率编码的电子媒体信息的冗余帧的传输,并且通过提供自动选择通过每个的一个或多个可用表示的最佳路径的解码器来提供优化的播放质量 帧作为总体速率/失真标准的函数。

    Multi-device capture and spatial browsing of conferences
    88.
    发明授权
    Multi-device capture and spatial browsing of conferences 有权
    会议的多设备捕获和空间浏览

    公开(公告)号:US08537196B2

    公开(公告)日:2013-09-17

    申请号:US12245774

    申请日:2008-10-06

    IPC分类号: H04N7/14 G06F15/16 G06F3/48

    CPC分类号: H04N7/157 H04N7/147

    摘要: Multi-device capture and spatial browsing of conferences is described. In one implementation, a system detects cameras and microphones, such as the webcams on participants' notebook computers, in a conference room, group meeting, or table game, and enlists an ad-hoc array of available devices to capture each participant and the spatial relationships between participants. A video stream composited from the array is browsable by a user to navigate a 3-dimensional representation of the meeting. Each participant may be represented by a video pane, a foreground object, or a 3-D geometric model of the participant's face or body displayed in spatial relation to the other participants in a 3-dimensional arrangement analogous to the spatial arrangement of the meeting. The system may automatically re-orient the 3-dimensional representation as needed to best show the currently interesting event such as current speaker or may extend navigation controls to a user for manually viewing selected participants or nuanced interactions between participants.

    摘要翻译: 描述会议的多设备捕获和空间浏览。 在一个实现中,系统检测相机和麦克风,例如参与者的笔记本计算机上的网络摄像机,会议室,组会议或桌面游戏,并且招募可用设备的特设阵列以捕获每个参与者和空间 参与者之间的关系。 从阵列合成的视频流可由用户浏览以浏览会议的三维表示。 每个参与者可以以类似于会议的空间安排的三维布置的视频窗格,前景对象或与其他参与者以空间关系显示的三维几何模型来表示。 该系统可以根据需要自动重新定向三维表示,以最佳地显示当前有趣的事件,例如当前的扬声器,或者可以将导航控件扩展到用户,以便手动地观看选定的参与者或参与者之间微妙的交互。

    REAL-TIME JITTER CONTROL AND PACKET-LOSS CONCEALMENT IN AN AUDIO SIGNAL
    89.
    发明申请
    REAL-TIME JITTER CONTROL AND PACKET-LOSS CONCEALMENT IN AN AUDIO SIGNAL 审中-公开
    音频信号中的实时抖动控制和分组丢失隐藏

    公开(公告)号:US20090304032A1

    公开(公告)日:2009-12-10

    申请号:US12542558

    申请日:2009-08-17

    IPC分类号: H04J3/06

    摘要: An “adaptive audio playback controller” operates by decoding and reading received packets of an audio signal into a signal buffer. Samples of the decoded audio signal are then played out of the signal buffer according to the needs of a player device. Jitter control and packet loss concealment are accomplished by continuously analyzing buffer content in real-time, and determining whether to provide unmodified playback from the buffer contents, whether to compress buffer content, stretch buffer content, or whether to provide for packet loss concealment for overly delayed or lost packets as a function of buffer content. Further, the adaptive audio playback controller also determines where to stretch or compress particular frames or signal segments in the signal buffer, and how much to stretch or compress such segments in order to optimize perceived playback quality.

    摘要翻译: “自适应音频播放控制器”通过将音频信号的接收分组解码并读取到信号缓冲器来进行操作。 然后根据播放器设备的需要从信号缓冲器中播放经解码的音频信号的样本。 抖动控制和分组丢失隐藏是通过实时连续分析缓冲区内容来实现的,并且确定是否从缓冲器内容中提供未修改的重放,是否压缩缓冲区内容,扩展缓冲区内容,还是提供丢包隐藏 延迟或丢失的数据包作为缓冲区内容的函数。 此外,自适应音频重放控制器还确定在哪里拉伸或压缩信号缓冲器中的特定帧或信号段,以及拉伸或压缩这些段以便优化感知的播放质量。

    EXTERNAL DATA ACCESS INFORMATION IN A VOIP CONVERSATION
    90.
    发明申请
    EXTERNAL DATA ACCESS INFORMATION IN A VOIP CONVERSATION 审中-公开
    VOIP对话中的外部数据访问信息

    公开(公告)号:US20080117897A1

    公开(公告)日:2008-05-22

    申请号:US11562935

    申请日:2006-11-22

    IPC分类号: H04L12/56

    CPC分类号: H04L65/4023

    摘要: A method and system provides the ability to share access information for external data over a digital voice communication channel. The access information of external data may be exchanged instead of the external data itself. More specifically, a recipient device may receive contextual information which relates to the access information of external data. The contextual information may be processed to identify the source of the external data and other information necessary to access the external data. For example, a hyperlink directed to the external data in a Web server may be exchanged while the recipient device and the sending device are involved in a digital conversation. The recipient device can access the external data by activating the hyperlink.

    摘要翻译: 方法和系统提供通过数字语音通信信道共享外部数据的访问信息的能力。 可以交换外部数据的访问信息而不是外部数据本身。 更具体地,接收者设备可以接收与外部数据的访问信息有关的上下文信息。 可以处理上下文信息以识别外部数据的源和访问外部数据所需的其他信息。 例如,可以在接收设备和发送设备参与数字会话期间交换针对Web服务器中的外部数据的超链接。 收件人设备可以通过激活超链接来访问外部数据。