Techniques for managing visual compositions for a multimedia conference call
    91.
    发明申请
    Techniques for managing visual compositions for a multimedia conference call 有权
    用于管理多媒体电话会议的视觉作品的技术

    公开(公告)号:US20080068446A1

    公开(公告)日:2008-03-20

    申请号:US11511749

    申请日:2006-08-29

    IPC分类号: H04N7/14

    摘要: Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.

    摘要翻译: 描述用于管理多媒体电话会议的视觉作品的技术。 设备可以包括处理器,用于为多个显示对象分配显示对象比特率,其中所有显示对象的总显示对象比特率等于或小于总输入比特率,并且从多个视频流解码视频信息,每个视频流具有 具有不同级别的空间分辨率,时间分辨率和两个或多个显示对象的质量的不同视频层。 描述和要求保护其他实施例。

    LOCAL NETWORK CODING FOR WIRELESS NETWORKS
    92.
    发明申请
    LOCAL NETWORK CODING FOR WIRELESS NETWORKS 有权
    无线网络的本地网络编码

    公开(公告)号:US20070274324A1

    公开(公告)日:2007-11-29

    申请号:US11420616

    申请日:2006-05-26

    IPC分类号: H04L12/56

    摘要: A local network coding framework and method including techniques to improve efficiency in a wireless network by reducing overhead. The local network coding method includes exchanging data availability between nodes on the wireless network by sending Bloom filters of lists of packets to neighboring nodes. Based on data availability, optimized mixing of pure packets is performed to form mixture packets for output. A separate acknowledgement buffer keeps track of the pure packets transmitted but not acknowledged. If an acknowledgement does not arrive after a certain time period, the packet is assumed to be lost and is retransmitted. An optimized packet mixing process generates mixture packets and decides which nodes to send the mixture packets. The local network coding framework and method also includes methods for representing the composition of a mixture packet and using mixing at a wireless access point to improve the performance of the wireless local area network.

    摘要翻译: 本地网络编码框架和方法包括通过减少开销来提高无线网络的效率的技术。 本地网络编码方法包括通过向相邻节点发送分组列表的Bloom过滤器来在无线网络上的节点之间交换数据可用性。 基于数据可用性,执行纯数据包的优化混合,形成混合数据包进行输出。 单独的确认缓冲区跟踪发送但未确认的纯数据包。 如果确认在某个时间段之后未到达,则假定该分组丢失并被重发。 优化的分组混合过程产生混合分组,并决定哪些节点发送混合分组。 本地网络编码框架和方法还包括用于表示混合分组的组合并在无线接入点使用混合来改善无线局域网的性能的方法。

    Receiver-driven layered error correction multicast over heterogeneous packet networks
    93.
    发明授权
    Receiver-driven layered error correction multicast over heterogeneous packet networks 有权
    接收器驱动的分层纠错多播在异构分组网络上

    公开(公告)号:US06996097B1

    公开(公告)日:2006-02-07

    申请号:US09315810

    申请日:1999-05-21

    IPC分类号: H04L12/56

    摘要: A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission. A preferred process for generating the error correction streams involves the use of a unique adaptation of the Forward Error Correction (FEC) techniques. This process encodes the transmission data using a linear transform which adds redundant elements. The redundancy permits losses to be corrected because any of the original data elements can be derived from any of the encoded elements. Thus, as long as enough of the encoded data elements are received so as to equal the number of the original data elements, it is possible to derive all the original elements.

    摘要翻译: 一种用于在异构分组网络(例如因特网)下校正在实时媒体的接收机驱动分层多播(RLM)期间发生的错误和损失的系统和方法。 这是通过用一层或多层纠错信息增强RLM来实现的。 这允许每个接收机通过订阅至少一个纠错层来分别优化所接收的音频和视频信息的质量。 理想地,RLM中的每个源层将具有与其相关联的一个或多个多播的纠错数据流(即,层)。 每个纠错层将包含可用于替换相关源层丢失的分组的信息。 提出了多于一个纠错层,因为包含在替换相关源流中丢失的分组所需的数据流中的一些纠错分组本身可能在传输中丢失。 用于产生纠错流的优选过程涉及使用前向纠错(FEC)技术的唯一适配。 该过程使用添加冗余元素的线性变换对传输数据进行编码。 冗余允许修正损失,因为任何原始数据元素可以从任何编码元素导出。 因此,只要接收到足够的编码数据元素以便等于原始数据元素的数量,就有可能导出所有的原始元素。

    System and method for media streaming
    94.
    发明授权
    System and method for media streaming 有权
    媒体流的系统和方法

    公开(公告)号:US06778553B1

    公开(公告)日:2004-08-17

    申请号:US09710688

    申请日:2000-11-10

    申请人: Philip A. Chou

    发明人: Philip A. Chou

    IPC分类号: H04J316

    摘要: A system and process according to the present invention involves tagging prescribed portions of the data of each layer in a layered multicast or layered presentation with an indicator of the importance or utility that the data provides to the receiver. Additionally, the data is tagged with a cost factor involved with sending the data. The aforementioned portions of the data can be an entire data stream of a layer, or some part thereof all the way down to the individual packets making up the stream. The invention also involves determining the optimized scenario for sending the data from the sender to the receiver based on the data tags.

    摘要翻译: 根据本发明的系统和过程涉及在具有数据提供给接收机的重要性或效用的指示符的分层多播或分层呈现中标记每一层的数据的规定部分。 此外,数据被标记与发送数据涉及的成本因素。 数据的上述部分可以是层的整个数据流,或其一部分一直到构成流的各个分组。 本发明还涉及基于数据标签确定用于将数据从发送器发送到接收器的优化方案。

    Multimedia presentation latency minimization
    95.
    发明授权
    Multimedia presentation latency minimization 有权
    多媒体演示延迟最小化

    公开(公告)号:US06637031B1

    公开(公告)日:2003-10-21

    申请号:US09205875

    申请日:1998-12-04

    申请人: Philip A. Chou

    发明人: Philip A. Chou

    IPC分类号: H04N7173

    摘要: To obtain real-time responses with interactive multimedia servers, the server provides at least two different audio/visual data streams. A first data stream has fewer bits per frame and provides a video image much more quickly than a second data stream with a higher number of bits and hence higher quality video image. The first data stream becomes available to a client much faster and may be more quickly displayed on demand while the second data stream is sent to improve the quality as soon as the playback buffer can handle it. In one embodiment, an entire video signal is layered, with a base layer providing the first signal and further enhancement layers comprising the second. The base layer may be actual image frames or just the audio portion of a video stream. The first and second streams are gradually combined in a manner such that the playback buffer does not overflow or underflow.

    摘要翻译: 为了通过交互式多媒体服务器获得实时响应,服务器提供至少两个不同的音频/视频数据流。 第一数据流每帧具有较少的比特,并且比具有较高比特数的第二数据流更快地提供视频图像,因此提供更高质量的视频图像。 第一个数据流可以更快地提供给客户端,并且可以根据需要更快速地显示,同时发送第二数据流以在回放缓冲器可以处理它时提高质量。 在一个实施例中,整个视频信号被分层,其中提供第一信号的基本层和包括第二信号的进一步增强层。 基本层可以是实际的图像帧或只是视频流的音频部分。 第一和第二流以使得重放缓冲器不溢出或下溢的方式逐渐组合。

    Apparatus and method for unequal error protection in multiple-description coding using overcomplete expansions
    96.
    发明授权
    Apparatus and method for unequal error protection in multiple-description coding using overcomplete expansions 有权
    使用不完全扩展的多描述编码中的不等错误保护的装置和方法

    公开(公告)号:US06460153B1

    公开(公告)日:2002-10-01

    申请号:US09276955

    申请日:1999-03-26

    IPC分类号: H03M1300

    CPC分类号: H04N19/89

    摘要: A projection onto convex sets (POCS)-based method for consistent reconstruction of a signal from a subset of quantized coefficients received from an N×K overcomplete transform. By choosing a frame operator F to be the concatenization of two or more K×K invertible transforms, the POCS projections are calculated in RK space using only the K×K transforms and their inverses, rather than the larger RN space using pseudo inverse transforms. Practical reconstructions are enabled based on, for example, wavelet, subband, or lapped transforms of an entire image. In one embodiment, unequal error protection for multiple description source coding is provided. In particular, given a bit-plane representation of the coefficients in an overcomplete representation of the source, one embodiment of the present invention provides coding the most significant bits with the highest redundancy and the least significant bits with the lowest redundancy. In one embodiment, this is accomplished by varying the quantization stepsize for the different coefficients. Then, the available received quantized coefficients are decoded using a method based on alternating projections onto convex sets.

    摘要翻译: 基于凸集(POCS)的方法的投影,用于从从NxK过完全变换接收的量化系数的子集的信号的一致重构。 通过选择一个帧运算符F作为两个或多个KxK可逆变换的并置,POCS投影在RK空间中仅使用KxK变换及其反转而不是使用伪逆变换的较大的RN空间来计算。 基于例如整个图像的小波,子带或重叠变换来实现实际重建。 在一个实施例中,提供了用于多描述源编码的不等差错保护。 特别地,给定源的过完整表示中的系数的位平面表示,本发明的一个实施例提供了具有最高冗余度的最高有效位和具有最低冗余度的最低有效位的编码。 在一个实施例中,这通过改变不同系数的量化步长来实现。 然后,使用基于在凸集上的交替投影的方法对可用的接收量化系数进行解码。

    Automatic training of character templates using a text line image, a
text line transcription and a line image source model
    97.
    发明授权
    Automatic training of character templates using a text line image, a text line transcription and a line image source model 失效
    使用文本行图像,文本行转录和线图像源模型自动训练角色模板

    公开(公告)号:US5594809A

    公开(公告)日:1997-01-14

    申请号:US431253

    申请日:1995-04-28

    IPC分类号: G06K9/68 G06K9/62

    CPC分类号: G06K9/6297 G06K2209/01

    摘要: A technique for automatically producing, or training, a set of bitmapped character templates defined according to the sidebearing model of character image positioning uses as input a text line image of unsegmented characters, called glyphs, as the source of training samples. The training process also uses a transcription associated with the text line image, and an explicit, grammar-based text line image source model that describes the structural and functional features of a set of possible text line images that may be used as the source of training samples. The transcription may be a literal transcription of the line image, or it may be nonliteral, for example containing logical structure tags for document formatting and layout, such as found in markup languages. Spatial positioning information modeled by the text line image source model and the labels in the transcription are used to determine labeled image positions identifying the location of glyph samples occurring in the input line image, and the character templates are produced using the labeled image positions. In another aspect of the technique, a set of character templates defined by any character template model, such as a segmentation-based model, is produced using the grammar-based text line image source model and specifically using a tag transcription containing logical structure tags for document formatting and layout. Both aspects of the training technique may represent the text line image source model and the transcription as finite state networks.

    摘要翻译: 用于自动生成或训练根据人物图像定位的侧边模型定义的一组位图字符模板的技术用作输入作为训练样本的来源的未分割字符的文本行图像,称为字形。 训练过程还使用与文本行图像相关联的转录,以及基于语法的基于语法的文本线图像源模型,其描述可用作训练来源的一组可能的文本线图像的结构和功能特征 样品。 转录可以是行图像的文字转录,或者可以是非标准的,例如包含用于文档格式化和布局的逻辑结构标签,例如在标记语言中找到的。 由文本线图像源模型建立的空间定位信息和转录中的标签用于确定标识图像位置,标识出现在输入线图像中的字形样本的位置,并且使用标记的图像位置产生字符模板。 在该技术的另一方面,使用基于语法的文本行图像源模型来产生由任何字符模板模型(例如基于分割的模型)定义的一组字符模板,并且具体使用包含逻辑结构标签的标签转录 文件格式和布局。 训练技术的两个方面可以表示文本行图像源模型和转录为有限状态网络。

    Image recognition method using two-dimensional stochastic grammars
    98.
    发明授权
    Image recognition method using two-dimensional stochastic grammars 失效
    使用二维随机语法的图像识别方法

    公开(公告)号:US5020112A

    公开(公告)日:1991-05-28

    申请号:US429923

    申请日:1989-10-31

    申请人: Philip A. Chou

    发明人: Philip A. Chou

    IPC分类号: G06K9/00 G06K9/68 G06K9/72

    摘要: A method of automatically identifying bitmapped image objects. Each of a set of templates in an object template library is compared with all areas of like size of a bitmapped image. A set of signals is generated for each such comparison that satisfies a defined matching criteria between the template and the image area being compared. The set of signals identifies the object based on the matching template, the location of the object in the image and an indication of the goodness of the match between the object and the template. A series of possible parse trees are formed that describe the image with a probability of occurrence for each tree. Each parent node and its child nodes of each parse tree satisfies a grammatical production rule in which some of the production rules define spatial relationships between objects in the image. The one of the possible parse trees which has the largest probability of occurence is selected for further utilization.

    摘要翻译: 自动识别位图图像对象的方法。 对象模板库中的一组模板中的每一个与位图图像的大小相同的所有区域进行比较。 为每个这样的比较产生一组信号,以满足模板和被比较的图像区域之间的定义的匹配标准。 信号集合基于匹配模板识别对象,图像中对象的位置以及对象与模板之间的匹配的良好性的指示。 形成一系列可能的解析树,描述具有每棵树的概率的图像。 每个分析树的每个父节点及其子节点都满足语法生产规则,其中一些生产规则定义图像中对象之间的空间关系。 选择发生概率最大的可能解析树之一用于进一步利用。