Video data retrieval method for use in video server environments that
use striped disks
    71.
    发明授权
    Video data retrieval method for use in video server environments that use striped disks 失效
    视频数据检索方法,用于使用条带化磁盘的视频服务器环境

    公开(公告)号:US5724543A

    公开(公告)日:1998-03-03

    申请号:US491755

    申请日:1995-06-19

    摘要: A method for retrieving video data from a video server, the video data having been stored on a plurality of disks based on a disk striping technique. In accordance with one illustrative embodiment, the method comprises the steps of retrieving a predetermined number of bits from the plurality of disks in the video server, and storing that predetermined number of bits in a buffer memory, wherein the number of bits retrieved and stored is based on the number of disks and on the capacity of the buffer memory. These steps, which together may illustratively constitute one round of the video retrieval process, may be repeated until the entire video has been retrieved and, for example, transmitted to the intended recipient(s) at a required transmission rate.

    摘要翻译: 一种用于从视频服务器检索视频数据的方法,所述视频数据已经基于盘分条技术存储在多个盘上。 根据一个说明性实施例,该方法包括以下步骤:从视频服务器中的多个盘中检索预定数量的比特,并将该预定数量的比特存储在缓冲存储器中,其中检索和存储的比特数是 基于磁盘的数量和缓冲存储器的容量。 这些步骤一起可以说明性地构成视频检索过程的一轮,可以重复,直到整个视频被检索并且例如以所需的传输速率发送到预期的接收者。

    Method and apparatus for efficient aggregate computation over data streams
    74.
    发明授权
    Method and apparatus for efficient aggregate computation over data streams 有权
    用于数据流高效汇总计算的方法和装置

    公开(公告)号:US08832073B2

    公开(公告)日:2014-09-09

    申请号:US11770926

    申请日:2007-06-29

    IPC分类号: G06F17/30

    摘要: Improved techniques are disclosed for processing data stream queries wherein a data stream is obtained, a set of aggregate queries to be executed on the data stream is obtained, and a query plan for executing the set of aggregate queries on the data stream is generated. In a first method, the generated query plan includes generating at least one intermediate aggregate query, wherein the intermediate aggregate query combines a subset of aggregate queries from the set of aggregate queries so as to pre-aggregate data from the data stream prior to execution of the subset of aggregate queries such that the generated query plan is optimized for computational expense based on a given cost model. In a second method, the generated query plan includes identifying similar filters in two or more aggregate queries of the set of aggregate queries and combining the similar filters into a single filter such that the single filter is usable to pre-filter data input to the two or more aggregate queries.

    摘要翻译: 公开了用于处理数据流查询的改进技术,其中获得数据流,获得要在数据流上执行的一组聚合查询,并且生成用于在数据流上执行聚合查询集合的查询计划。 在第一种方法中,生成的查询计划包括生成至少一个中间聚合查询,其中中间聚合查询组合来自聚合查询集合的聚合查询的子集,以便在执行之前从数据流预聚合数据 聚合查询的子集,使得生成的查询计划基于给定的成本模型被优化用于计算费用。 在第二种方法中,所生成的查询计划包括在集合查询集合的两个或多个聚合查询中识别类似的过滤器,并将类似的过滤器组合成单个过滤器,使得单个过滤器可用于预先过滤输入到两个 或更多聚合查询。

    Techniques for categorizing web pages
    75.
    发明授权
    Techniques for categorizing web pages 有权
    技术分类网页

    公开(公告)号:US08768926B2

    公开(公告)日:2014-07-01

    申请号:US12652624

    申请日:2010-01-05

    IPC分类号: G06F7/00 G06F17/30

    摘要: Web pages are efficiently categorized in a data processor without analyzing the content of the web pages. According to at least one embodiment, data is maintained that represents sample URLs grouped into a plurality of clusters. The sample URLs of a cluster are used to produce a URL regular expression pattern (“URL-regex”) that differentiates the sample URLs of the cluster from the sample URLs of other clusters and that covers at least a specified percentage of the sample URLs in the cluster. The process of producing a URL-regex is repeated for each of the clusters producing a URL-regex for each cluster. Web pages are then categorized into one of the clusters by determining which of the URL-regex patterns produced for the clusters match URLs that refer to the web pages. Thus, a web page may be categorized based on a URL that refers to the web page without having to obtain and analyze the content of the web page.

    摘要翻译: 在不分析网页内容的情况下,网页被有效地分类到数据处理器中。 根据至少一个实施例,维护表示分组为多个集群的抽样URL的数据。 集群的示例URL用于生成URL正则表达式模式(“URL-regex”),可以将集群的示例URL与其他集群的示例URL进行区分,并且至少包含指定百分比的示例URL 集群。 为每个集群重复生成URL-regex的过程,为每个集群生成一个URL-regex。 然后,通过确定针对集群产生的哪个URL-regex模式与引用网页的URL匹配,将网页分类到其中一个集群。 因此,可以基于引用网页的URL来分类网页,而不必获取和分析网页的内容。

    Automated transformation of specifications for devices into executable modules
    76.
    发明授权
    Automated transformation of specifications for devices into executable modules 有权
    将设备规范自动转换为可执行模块

    公开(公告)号:US08356085B2

    公开(公告)日:2013-01-15

    申请号:US10600995

    申请日:2003-06-20

    IPC分类号: G06F15/177

    摘要: A number of configuration elements are associated with a number of devices. Information about input configuration elements is accessed. An input configuration element is associated with one or more input rules. It is determined which of the configuration elements could be accessed by the input rules and any call chains emanating from the rules. Output rules are determined by using the accessed configuration elements, the input rules, and the way the input rule manipulates its accessed configuration elements. Each output rule may be derived from an input rule and corresponds to the same input configuration element associated with that input rule. An executable module is generated that is adapted to access at least a given one of the input configuration elements and to trigger one or more of the output rules corresponding to the given input configuration element. Read and write sets for rules are determined, and the triggered output rules ensure that restrictions associated with a configuration element are not violated.

    摘要翻译: 许多配置元素与多个设备相关联。 访问有关输入配置元素的信息。 输入配置元素与一个或多个输入规则相关联。 确定哪些配置元素可以被输入规则和从规则发出的任何调用链访问。 输出规则通过使用访问的配置元素,输入规则以及输入规则操纵其访问的配置元素的方式来确定。 每个输出规则可以从输入规则导出,并且对应于与该输入规则相关联的相同输入配置元素。 生成可执行模块,其适于访问至少一个给定的一个输入配置元素,并且触发对应于给定输入配置元素的一个或多个输出规则。 确定规则的读取和写入集合,并且触发的输出规则确保与配置元素相关联的限制不被违反。

    METHOD AND SYSTEM FOR MAXIMIZING CONTENT SPREAD IN SOCIAL NETWORK
    77.
    发明申请
    METHOD AND SYSTEM FOR MAXIMIZING CONTENT SPREAD IN SOCIAL NETWORK 有权
    用于最大化社会网络中的内容传播的方法和系统

    公开(公告)号:US20120259915A1

    公开(公告)日:2012-10-11

    申请号:US13080661

    申请日:2011-04-06

    IPC分类号: G06F15/16

    CPC分类号: G06Q50/01

    摘要: A method, a system and a computer program product for maximizing content spread in a social network are provided. Samples of edges are generated from an initial candidate set of edges. Each edge of the samples of edges has a probability value for content flow. Further, a subset of edges is determined from the samples of edges based on gain corresponding to each edge. Also, each node of the subset of edges is having at least one of less than ‘K’ or equal to ‘K’ incoming edges. Further, the probability of each edge, of the subset of edges, may be incremented. Furthermore, a final set of edges may be determined by ensuring ‘K’ incoming edges. The ‘K’ incoming edges may be ensured by removing one or more incoming edges when a number of the incoming edges for a node of the final set is greater than ‘K’ incoming edge.

    摘要翻译: 提供了一种用于最大化社交网络中的内容传播的方法,系统和计算机程序产品。 边缘样本从初始候选边缘集合生成。 边缘样本的每个边缘都具有内容流的概率值。 此外,基于与每个边缘相对应的增益,从边缘样本确定边缘子集。 此外,边缘子集的每个节点具有小于K或等于K个入口边缘中的至少一个。 此外,可以增加边缘子集中的每个边缘的概率。 此外,可以通过确保K个进入边缘来确定最终的边缘集合。 当最终集合的节点的进入边缘的数量大于K个进入边缘时,可以通过去除一个或多个进入边缘来确保K个进入边缘。

    HIGH PRECISION WEB EXTRACTION USING SITE KNOWLEDGE
    78.
    发明申请
    HIGH PRECISION WEB EXTRACTION USING SITE KNOWLEDGE 审中-公开
    使用站点知识的高精度网络提取

    公开(公告)号:US20100257440A1

    公开(公告)日:2010-10-07

    申请号:US12416381

    申请日:2009-04-01

    IPC分类号: G06F17/21 G06F17/00

    CPC分类号: G06F16/986

    摘要: Techniques for high precision web extraction using site knowledge are provided. Portions of repeating text are identified in unlabeled web pages from a particular web site. Based on the portions of repeating text, the unlabeled web pages are partitioned into a set of segments. Multiple labels are assigned to respectively corresponding multiple attributes in the set of segments, where assigning the multiple labels comprises applying a classification model to each separate segment in the set of segments. First one or more labels are identified that were erroneously assigned to one or more attributes in the set of segments. Second one or more correct labels for the one or more attributes are determined. The first one or more labels in the set of segments are corrected by assigning the second one or more labels to the one or more attributes.

    摘要翻译: 提供使用现场知识的高精度网络提取技术。 在特定网站的未标记的网页中标识重复文本的部分。 基于重复文本的部分,未标记的网页被分割成一组段。 多个标签被分配给该组段中的相应的多个属性,其中分配多个标签包括将分类模型应用于该组段中的每个单独的段。 识别出错误地分配给该组段中的一个或多个属性的第一个或多个标签。 确定一个或多个属性的第二个一个或多个正确标签。 通过将第二个一个或多个标签分配给一个或多个属性来校正该组段中的第一个或多个标签。

    Method for distinct count estimation over joins of continuous update stream
    79.
    发明授权
    Method for distinct count estimation over joins of continuous update stream 有权
    连续更新流连接的不同计数估计方法

    公开(公告)号:US07668856B2

    公开(公告)日:2010-02-23

    申请号:US10957185

    申请日:2004-09-30

    IPC分类号: G06F7/00

    摘要: The invention provides methods and systems for summarizing multiple continuous update streams such that an approximate answer to a query over one or more of the continuous update streams (such as a Query requiring a join operation followed by a duplicate elimination step) may be rapidly provided. The systems and methods use multiple (parallel) Join Distinct (JD) Sketch data structures corresponding to hash buckets of at least one initial attribute.

    摘要翻译: 本发明提供了用于总结多个连续更新流的方法和系统,使得可以快速地提供对连续更新流中的一个或多个(诸如需要连接操作后跟重复消除步骤的查询)的查询的近似答案。 系统和方法使用与至少一个初始属性的哈希桶对应的多个(并行)联合特征(JD)草图数据结构。

    System and method for determining the physical topology of a network having multiple subnets
    80.
    发明授权
    System and method for determining the physical topology of a network having multiple subnets 有权
    用于确定具有多个子网的网络的物理拓扑的系统和方法

    公开(公告)号:US07535911B2

    公开(公告)日:2009-05-19

    申请号:US10445585

    申请日:2003-05-27

    IPC分类号: H04L12/28

    摘要: A system for, and method of, determining a physical topology of a network having multiple subnets. In one embodiment, the system includes: (1) a skeleton path initializer that uses addressing information from elements in the network to develop a collection of skeleton paths of direct physical connections between labeled ones of the elements, the skeleton paths traversing multiple of the subnets and (2) a skeleton path refiner, coupled to the skeleton path initializer, that refines the collection by inferring, from the direct physical connections and path constraints derived therefrom, other physical connections in the skeleton paths involving unlabeled ones of the elements.

    摘要翻译: 用于确定具有多个子网的网络的物理拓扑的系统和方法。 在一个实施例中,系统包括:(1)骨架路径初始化器,其使用来自网络中的元件的寻址信息来开发标记的元件之间的直接物理连接的骨架路径的集合,穿过多个子网的骨架路径 以及(2)骨架路径精炼器,其耦合到骨架路径初始化器,其通过从包括未标记的元件的骨架路径中的直接物理连接和路径约束推断其精细化收集。