Transmitting packet-based data items
    61.
    发明申请
    Transmitting packet-based data items 有权
    发送基于数据包的数据项

    公开(公告)号:US20080002667A1

    公开(公告)日:2008-01-03

    申请号:US11480761

    申请日:2006-06-30

    Abstract: Aspects of the present invention are directed at sending a data item from a sending client to a receiving client. In accordance with one embodiment, a method provides controls for generating an audio-based command to send a data item from a sending client to a receiving client. More specifically, the method includes receiving an audio stream at the sending client from a sending party. As the audio stream is being received, a determination is made regarding whether a command to send a data item to the receiving client was received. If a command to send a data item is included in the audio stream, the method identifies the data item that is the object of the command and then transmits the data item to the receiving client over the network.

    Abstract translation: 本发明的方面涉及从发送客户端向接收客户端发送数据项。 根据一个实施例,一种方法提供用于生成基于音频的命令以从发送客户端向接收客户端发送数据项的控制。 更具体地,该方法包括从发送方在发送客户端处接收音频流。 当正在接收音频流时,确定是否接收到向接收客户端发送数据项的命令。 如果发送数据项的命令被包括在音频流中,则该方法识别作为命令对象的数据项,然后通过网络将数据项发送到接收客户端。

    Processing a received voicemail message
    62.
    发明申请
    Processing a received voicemail message 有权
    处理收到的语音邮件消息

    公开(公告)号:US20070280434A1

    公开(公告)日:2007-12-06

    申请号:US11445057

    申请日:2006-05-31

    CPC classification number: H04M3/533 H04M2201/42

    Abstract: Aspects of the present invention are directed at software for processing a received voicemail message to provide enhanced features in a voicemail system. In one embodiment, a user interface is provided where a callee may generate events to modify audio attributes of a received voicemail message. When an event to modify the audio attributes of the received voicemail message is received, aspects of the present invention obtain a set of event data that describes input generated by the callee. Then, a media player program is used to modify attributes of the data packets in the voicemail message in accordance with the input received from the callee.

    Abstract translation: 本发明的方面涉及用于处理接收的语音邮件消息以在语音邮件系统中提供增强特征的软件。 在一个实施例中,提供了用户界面,其中被叫方可以生成事件来修改所接收的语音邮件消息的音频属性。 当接收到修改接收到的语音邮件消息的音频属性的事件时,本发明的各方面获得描述被叫方产生的输入的一组事件数据。 然后,媒体播放器程序用于根据从被叫方接收到的输入来修改语音邮件消息中的数据分组的属性。

    Automatic training of layout parameters in a 2D image model
    63.
    发明授权
    Automatic training of layout parameters in a 2D image model 失效
    在2D图像模型中自动训练布局参数

    公开(公告)号:US06687404B1

    公开(公告)日:2004-02-03

    申请号:US08880137

    申请日:1997-06-20

    CPC classification number: G06K9/00442

    Abstract: A two-dimensional (2D) image model models the layout structure of a class of document images as an image grammar and includes production rules having explicit layout parameters as data items that indicate information about the spatial relationships among image constituents occurring in images included in the class. The parameters are explicitly represented in the grammar rules in a manner that permits them to be automatically trained by a training operation that makes use of sample document images from the class of modeled documents. After each sample image is aligned with the 2D grammar, document-specific measurements about the spatial relationships between image constituents are taken from the image. Optimal values for the layout parameters are then computed from the measurement data collected from all samples. An illustrated implementation of the 2D image model takes the form of a stochastic context-free attribute grammar in which synthesized and inherited attributes and synthesis and inheritance functions are associated with each production rule in the grammar. The attributes indicate physical spatial locations of image constituents in the image, and a set of parameterized functions, in which the coefficients are the layout parameters, compute the attributes as a function of a characteristic of an image constituent of the production rule. The measurement data is taken from an annotated parse tree produced for each training image by the grammar. A trained grammar can then be used, for example, for document recognition and layout analysis operations on any document in the class of documents modeled by the grammar.

    Abstract translation: 二维(2D)图像模型将一类文档图像的布局结构模型化为图像语法,并且包括具有显式布局参数的生产规则作为数据项,其指示关于在包含在图像中的图像中出现的图像成分之间的空间关系的信息 类。 这些参数以语法规则明确表示,允许通过使用来自建模文档类的样本文档图像的训练操作来自动训练这些参数。 在每个样本图像与2D语法对齐后,从图像中获取关于图像成分之间的空间关系的文献特异性测量。 然后根据从所有样本收集的测量数据计算布局参数的最优值。 2D图像模型的说明性实现采用随机上下文无关属性语法的形式,其中合成和继承的属性和合成和继承函数与语法中的每个生产规则相关联。 属性表示图像中图像成分的物理空间位置,以及一组参数化函数,其中系数是布局参数,根据生产规则的图像成分的特征计算属性。 测量数据取自用语法为每个训练图像生成的注释分析树。 然后,可以使用经过训练的语法,例如,用于由语法建模的文档类中的任何文档的文档识别和布局分析操作。

    Receiver-driven layered error correction multicast over heterogeneous packet networks
    64.
    发明授权
    Receiver-driven layered error correction multicast over heterogeneous packet networks 有权
    接收器驱动的分层纠错多播在异构分组网络上

    公开(公告)号:US06532562B1

    公开(公告)日:2003-03-11

    申请号:US09316869

    申请日:1999-05-21

    CPC classification number: H04L1/0059 H04L1/0002 H04L1/06 H04L1/08

    Abstract: “Correction of errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet is accomplished by augmenting RLM with one or more layers of error correction information. Each receiver separately optimizes the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more associated multicasted error correction data streams (i.e., layers). Each error correction layer contains information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission.”

    Abstract translation: 通过异构分组网络(如Internet)的实时媒体接收机驱动分层多播(RLM)中发生的错误和损失的纠正是通过用一层或多层纠错信息增强RLM来实现的,每个接收机分别优化 通过订阅至少一个纠错层,接收的音频和视频信息的质量理想地,RLM中的每个源层将具有一个或多个关联的多播纠错数据流(即,层),每个纠错层包含 可以使用来自相关源层的丢失分组来替代不止一个纠错层,因为包含在数据流中的一些纠错分组被包含在替换相关源流中丢失的分组所需的数据流本身可能在传输中丢失 “。

    Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets
    65.
    发明授权
    Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets 失效
    使用投影到凸集上重建缺失的完全线性变换系数

    公开(公告)号:US06470469B1

    公开(公告)日:2002-10-22

    申请号:US09276842

    申请日:1999-03-26

    Abstract: A projection onto convex sets (POCS)-based method for consistent reconstruction of a signal from a subset of quantized coefficients received from an N×K overcomplete transform. By choosing a frame operator F to be the concatenization of two or more K×K invertible transforms, the POCS projections are calculated in RK space using only the K×K transforms and their inverses, rather than the larger RN space using pseudo inverse transforms. Practical reconstructions are enabled based on, for example, wavelet, subband, or lapped transforms of an entire image. In one embodiment, unequal error protection for multiple description source coding is provided. In particular, given a bit-plane representation of the coefficients in an overcomplete representation of the source, one embodiment of the present invention provides coding the most significant bits with the highest redundancy and the least significant bits with the lowest redundancy. In one embodiment, this is accomplished by varying the quantization stepsize for the different coefficients. Then, the available received quantized coefficients are decoded using a method based on alternating projections onto convex sets.

    Abstract translation: 基于凸集(POCS)的方法的投影,用于从从NxK过完全变换接收的量化系数的子集的信号的一致重构。 通过选择一个帧运算符F作为两个或多个KxK可逆变换的并置,POCS投影在RK空间中仅使用KxK变换及其反转而不是使用伪逆变换的较大的RN空间来计算。 基于例如整个图像的小波,子带或重叠变换来实现实际重建。 在一个实施例中,提供了用于多描述源编码的不等差错保护。 特别地,给定源的过完整表示中的系数的位平面表示,本发明的一个实施例提供了具有最高冗余度的最高有效位和具有最低冗余度的最低有效位的编码。 在一个实施例中,这通过改变不同系数的量化步长来实现。 然后,使用基于在凸集上的交替投影的方法对可用的接收量化系数进行解码。

    Interleaved multiple multimedia stream for synchronized transmission over a computer network
    66.
    发明授权
    Interleaved multiple multimedia stream for synchronized transmission over a computer network 失效
    交织多个多媒体流,用于通过计算机网络进行同步传输

    公开(公告)号:US06449653B2

    公开(公告)日:2002-09-10

    申请号:US08826345

    申请日:1997-03-25

    CPC classification number: H04L65/602 H04L29/06 H04L29/06027 H04L65/80

    Abstract: The production of an interleaved multimedia stream for servers and client computers coupled to each other by a diverse computer network which includes local area networks (LANs) and/or wide area networks (WANs) such as the internet. Interleaved multimedia streams can include compressed video frames for display in a video window, accompanying compressed audio frames and annotation frames. In one embodiment, a producer captures separate video/audio frames and generates an interleaved multimedia file. In another embodiment, the interleaved file include annotation frames which provide either pointer(s) to the event(s) of interest or include displayable data embedded within the annotation stream. The interleaved file is then stored in the web server for subsequent retrieval by client computer(s) in a coordinated manner, so that the client computer(s) is able to synchronously display the video frames and displayable event(s) in a video window and event window(s), respectively. In some embodiments, the interleaved file includes packets with variable length fields, each of which are at least one numerical unit in length.

    Abstract translation: 通过包括诸如互联网的局域网(LAN)和/或广域网(WAN))的不同计算机网络来产生用于服务器和客户端计算机的交错多媒体流。 交错多媒体流可以包括用于在视频窗口中显示的压缩视频帧,伴随压缩音频帧和注释帧。 在一个实施例中,制作者捕获单独的视频/音频帧并产生交织的多媒体文件。 在另一个实施例中,交织的文件包括向感兴趣的事件提供指针或者包括嵌入注释流内的可显示数据的注释帧。 然后将交织的文件存储在网络服务器中,以便由客户端计算机以协调的方式随后检索,使得客户端计算机能够在视频窗口中同步显示视频帧和可显示的事件 和事件窗口。 在一些实施例中,交织文件包括具有可变长度字段的分组,每个长度字段的长度至少为一个数字单位。

    Document image decoding using modified branch-and-bound methods
    67.
    发明授权
    Document image decoding using modified branch-and-bound methods 失效
    使用修改的分支和绑定方法的文档图像解码

    公开(公告)号:US5526444A

    公开(公告)日:1996-06-11

    申请号:US60196

    申请日:1993-05-07

    CPC classification number: G06K9/00 G06K9/00463 G06K9/34 G06K9/6297 G06K2209/01

    Abstract: An image decoding and recognition system and method comprising a fast heuristic algorithm using hidden Markov models (HMM). The new search algorithm, called an "iterative complete path" (ICP) algorithm, patterned after well-known branch-and-bound (B&B) methods, significantly reduces the complexity and improves the speed of HMM image decoding without sacrificing the optimality of the straightforward procedure. An advantageous form of the heuristic functions which is useful in applying the ICP algorithm to text-like images is described. The ICP algorithm is directly applicable to the separable type of finite-state source models. Also disclosed is a technique for transforming more general source models into such a separable form.

    Abstract translation: 一种使用隐马尔可夫模型(HMM)的快速启发式算法的图像解码识别系统和方法。 称为“迭代完整路径”(ICP)算法的新型搜索算法,在公知的分支绑定(B&B)方法之后进行图案化,显着降低了复杂度并提高了HMM图像解码的速度,而不牺牲 简单的程序。 描述了将ICP算法应用于文字图像中有用的启发式函数的有利形式。 ICP算法直接适用于可分离类型的有限状态源模型。 还公开了一种用于将更一般的源模型转换成这种可分离形式的技术。

    Minimizing network latency in interactive internet applications
    68.
    发明授权
    Minimizing network latency in interactive internet applications 有权
    最大限度地减少交互式互联网应用中的网络延迟

    公开(公告)号:US09059817B2

    公开(公告)日:2015-06-16

    申请号:US12951908

    申请日:2010-11-22

    CPC classification number: H04L1/0009 H04L1/1812 H04L1/188

    Abstract: A method and system that enhances a user's performance while interacting with an interactive internet application such as a Massively Multiplayer Online (MMO) game is provided. The network latency experienced by users participating in the MMO game is minimized by dynamically determining an optimal transmission action for a message generated by the MMO game. In one embodiment, determining the optimal transmission action for a message includes dynamically determining the optimal number of redundant Forward Error Correction (FEC) packets to add to a message prior to transmitting a message to a receiving device. The optimal number of FEC packets is determined based on a wide range of varying network conditions.

    Abstract translation: 提供了一种在与诸如大型多人在线(MMO)游戏之类的交互式互联网应用交互的同时增强用户表现的方法和系统。 通过动态地确定由MMO游戏产生的消息的最佳传输动作,使参与MMO游戏的用户经历的网络延迟最小化。 在一个实施例中,确定消息的最佳传输动作包括在将消息发送到接收设备之前动态地确定要添加到消息的冗余前向纠错(FEC)分组的最佳数量。 基于广泛的变化的网络条件来确定FEC分组的最佳数量。

    Non-linguistic signal detection and feedback
    69.
    发明授权
    Non-linguistic signal detection and feedback 有权
    非语言信号检测和反馈

    公开(公告)号:US08963987B2

    公开(公告)日:2015-02-24

    申请号:US12789142

    申请日:2010-05-27

    CPC classification number: H04N7/15 H04N7/147

    Abstract: Non-linguistic signal information relating to one or more participants to an interaction may be determined using communication data received from the one or more participants. Feedback can be provided based on the determined non-linguistic signals. The participants may be given an opportunity to opt in to having their non-linguistic signal information collected, and may be provided complete control over how their information is shared or used.

    Abstract translation: 可以使用从一个或多个参与者接收的通信数据来确定与交互的一个或多个参与者有关的非语言信号信息。 可以基于确定的非语言信号提供反馈。 参与者可能有机会选择收集其非语言信号信息,并且可以完全控制他们的信息如何共享或使用。

    Techniques for managing visual compositions for a multimedia conference call
    70.
    发明授权
    Techniques for managing visual compositions for a multimedia conference call 有权
    用于管理多媒体电话会议的视觉作品的技术

    公开(公告)号:US08773494B2

    公开(公告)日:2014-07-08

    申请号:US11511749

    申请日:2006-08-29

    Abstract: Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.

    Abstract translation: 描述用于管理多媒体电话会议的视觉作品的技术。 设备可以包括处理器,用于为多个显示对象分配显示对象比特率,其中所有显示对象的总显示对象比特率等于或小于总输入比特率,并且从多个视频流解码视频信息,每个视频流具有 具有不同级别的空间分辨率,时间分辨率和两个或多个显示对象的质量的不同视频层。 描述和要求保护其他实施例。

Patent Agency Ranking