Collaborative team crawling:Large scale information gathering over the internet
    1.
    发明授权
    Collaborative team crawling:Large scale information gathering over the internet 失效
    协作小组爬行:通过互联网收集大量信息

    公开(公告)号:US06182085B2

    公开(公告)日:2001-01-30

    申请号:US09086379

    申请日:1998-05-28

    IPC分类号: G06F1730

    摘要: A distributed collection of web-crawlers to gather information over a large portion of the cyberspace. These crawlers share the overall crawling through a cyberspace partition scheme. They also collaborate with each other through load balancing to maximally utilize the computing resources of each of the crawlers. The invention takes advantage of the hierarchical nature of the cyberspace namespace and uses the syntactic components of the URL structure as the main vehicle for dividing and assigning crawling workload to individual crawler. The partition scheme is completely distributed in which each crawler makes the partitioning decision based on its own crawling status and a globally replicated partition tree data structure.

    摘要翻译: 分布式的网络爬虫收集器,用于在大量网络空间中收集信息。 这些爬虫共享通过网络空间分区方案的整体爬网。 他们还通过负载平衡相互协作,最大限度地利用每个爬虫的计算资源。 本发明利用了网络空间命名空间的层次性,并将URL结构的句法组件作为将抓取工作量分配给各个爬虫的主要工具。 分区方案是完全分布的,其中每个爬行器根据其自身的爬行状态和全局复制的分区树数据结构进行分区决定。

    Outbound information analysis for generating user interest profiles and improving user productivity
    2.
    发明授权
    Outbound information analysis for generating user interest profiles and improving user productivity 失效
    出站信息分析,用于生成用户兴趣配置文件并提高用户生产力

    公开(公告)号:US06654735B1

    公开(公告)日:2003-11-25

    申请号:US09227225

    申请日:1999-01-08

    IPC分类号: G06F1730

    摘要: A system for automatically generating user interest profiles and delivering information to users learns a user's interests by monitoring the user's outbound communication streams, i.e., the information that the user produces either by typing (e.g., while a user is composing an e-mail message or editing a word processor document) or by speaking (e.g., while a user is engaged in a phone conversation or listening to a lecture). The system uses the monitored text to build (and possibly update) a user interest profile. The profile is constructed from current text generated by the user, so that the retrieved information reflects present user interests. In addition, the profile may also retain past user interests, so that the profile reflects a combination of past and present user interests. The system then automatically queries diverse databases for information relevant to the interest profile. The databases may include internet web pages, files stored on the user's local network, and other local or remote data repositories. The queries may use a combination of internet search engines, the specific selection of which may depend upon the nature and/or content of the queries. The information retrieved in response to the queries is then presented to the user. The retrieved information may contain, for example, answers to questions that the user might ask and/or data related to the user's current and continuing interests. Because a user's current speech or typed text is highly correlated with the user's current interests, the retrieved information will be relevant to the user's actual interests. The communication stream monitoring, interest profile building, data base querying, and presentation of retrieved information are all performed automatically, in real time, and in the background of current user activities.

    摘要翻译: 用于自动生成用户兴趣简档并向用户传递信息的系统通过监视用户的出站通信流来学习用户的兴趣,即用户通过键入产生的信息(例如,当用户正在撰写电子邮件消息或 编辑文字处理器文档)或通过说话(例如,当用户进行电话交谈或听讲座时)。 系统使用受监控的文本构建(并可能更新)用户兴趣简档。 该配置文件由用户生成的当前文本构建,以便所检索的信息反映了用户的兴趣。 此外,简档也可以保留过去的用户兴趣,使得简档反映了过去和现在的用户兴趣的组合。 然后,系统会自动查询不同的数据库以获取与兴趣资料相关的信息。 数据库可以包括互联网网页,存储在用户的本地网络上的文件以及其他本地或远程数据存储库。 查询可以使用互联网搜索引擎的组合,其特定选择可以取决于查询的性质和/或内容。 然后将响应于查询检索的信息呈现给用户。 检索到的信息可以包含例如用户可能询问的问题的答案和/或与用户当前和持续兴趣相关的数据。 由于用户当前的语音或类型的文本与用户当前的兴趣高度相关,所检索的信息将与用户的实际兴趣相关。 通信流监控,兴趣信息构建,数据库查询和检索信息的呈现都是在当前用户活动的背景下实时自动执行的。

    Automatic user interest profile generation from structured document access information
    3.
    发明授权
    Automatic user interest profile generation from structured document access information 有权
    从结构化文档访问信息生成自动用户兴趣简档

    公开(公告)号:US06385619B1

    公开(公告)日:2002-05-07

    申请号:US09227117

    申请日:1999-01-08

    IPC分类号: G06F1730

    摘要: A system generates user interest profiles by monitoring and analyzing a user's access to a variety of hierarchical levels within a set of structured documents, e.g., documents available at a web site. Each information document has parts associated with it and the documents are classified into categories using a known taxonomy. The user interest profiles are automatically generated based on the type of content viewed by the user. The type of content is determined by the text within the parts of the documents viewed and the classifications of the documents viewed. In addition, the profiles also are generated based on other factors including the frequency and currency of visits to documents having a given classification, and/or the hierarchical depth of the levels or parts of the documents viewed. User profiles include an interest category code and an interest score to indicate a level of interest in a particular category. The profiles are updated automatically to accurately reflect the current interests of an individual, as well as past interests. A time-dependent decay factor is applied to the past interests. The system presents to the user documents or references to documents that match the current profile.

    摘要翻译: 系统通过监视和分析用户对一组结构化文档(例如,在网站上可获得的文档)中的各种层级的访问来生成用户兴趣简档。 每个信息文档都具有与之相关的部分,并且使用已知分类法将文档分类为类别。 基于用户观看的内容类型,自动生成用户兴趣简档。 内容的类型由所查阅文档的部分内容和查看的文档的分类决定。 另外,还可以基于包括对具有给定分类的文档的访问的频率和货币的其他因素和/或所查看的文档的级别或部分的分级深度来生成简档。 用户资料包括兴趣类别代码和利益分数,以指示特定类别的兴趣水平。 配置文件将自动更新,以准确反映个人的当前利益以及过去的兴趣。 时间依赖衰变因子适用于过去的兴趣。 系统向用户提供与当前配置文件匹配的文档或文档。

    Method and apparatus for parallel profile matching in a large scale webcasting system
    4.
    发明授权
    Method and apparatus for parallel profile matching in a large scale webcasting system 失效
    用于大规模网络广播系统中并行配置匹配的方法和装置

    公开(公告)号:US06169989A

    公开(公告)日:2001-01-02

    申请号:US09082747

    申请日:1998-05-21

    IPC分类号: G06F1700

    摘要: A method and apparatus for efficiently matching a large collection of user profiles against a large volume of data in a webcasting system. The invention generally includes in one embodiment four steps to parallelize the profiles. First, an initial profile set is partitioned into several subsets also referred to as sub-partitions using various heuristic methods. Second, each sub-partition is mapped onto one or more independent processing units. Each processing unit is not required to have equal processing performance. However, for best performance results, subset data should be mapped in one embodiment where the subset with a highest cost is mapped to a fastest processor, and the next highest cost subset mapped to the next fastest processor. Where appropriate, the invention evaluates the relative subset processing speed of each processor and adjusts future subset mapping based upon these evaluations. For each information item I that needs to be matched with a profile predicate, a third and a fourth step are executed. The third step broadcasts I to all processing units, and a fourth step performs a sequential profile match on I.

    摘要翻译: 一种用于在网络广播系统中有效地匹配大量用户简档与大量数据的方法和装置。 本发明通常在一个实施例中包括四个步骤来并行化轮廓。 首先,使用各种启发式方法将初始配置文件集划分为几个也称为子分区的子集。 第二,每个子分区映射到一个或多个独立的处理单元。 每个处理单元不需要具有相同的处理性能。 然而,为获得最佳性能结果,应在一个实施例中映射子集数据,其中具有最高成本的子集被映射到最快的处理器,并且将下一个最高成本子集映射到下一个最快的处理器。 在适当的情况下,本发明评估每个处理器的相对子集处理速度,并根据这些评估调整未来的子集映射。 对于需要与配置文件谓词匹配的每个信息项I,执行第三和第四步骤。 第三步将I广播到所有处理单元,第四步对I执行顺序配置文件匹配。

    Active calendar system
    5.
    发明授权
    Active calendar system 失效
    活动日历系统

    公开(公告)号:US06480830B1

    公开(公告)日:2002-11-12

    申请号:US09049661

    申请日:1998-03-27

    IPC分类号: G06F1700

    CPC分类号: G06Q10/109 G06Q10/1095

    摘要: An “active” calendar automatically analyzes a user's calendar entries and sends machine-readable messages to destinations appropriate to content of the calendar entry. A group of event categories is established, each category specifying one class of anticipated calendar entry. An action rule database pre-associates each event category with one or more message formats each having a content and a destination. The action rule database also contains data identifying sources containing the content and destination for each message format. These sources include records of the action rule database itself, subparts of calendar entries of the pre-associated event category, one or more other databases, or a combination of the foregoing. After the calendar receives a user-submitted computer calendar entry describing a planned event, it identifies one of the event categories of the established group that classifies the planned event. For each message format pre-associated with the identified event category, the calendar determines the content and destination for the message as specified by the action rule database, and transmits the message to the destination.

    摘要翻译: “活动”日历自动分析用户的日历条目,并将机器可读消息发送到适合日历条目内容的目的地。 建立一组事件类别,每个类别指定一类预期的日历项。 动作规则数据库将每个事件类别与每个具有内容和目的地的一个或多个消息格式相关联。 动作规则数据库还包含标识每个消息格式的内容和目标的源的数据。 这些来源包括动作规则数据库本身的记录,预关联事件类别的日历条目的子部分,一个或多个其他数据库或前述的组合。 在日历接收到描述计划事件的用户提交的计算机日历条目之后,它识别为计划事件分类的已建立组的事件类别之一。 对于与识别的事件类别预先关联的每个消息格式,日历确定由动作规则数据库指定的消息的内容和目的地,并将该消息发送到目的地。

    Graphical user interface to query music by examples
    6.
    发明授权
    Graphical user interface to query music by examples 失效
    图形用户界面,通过示例查询音乐

    公开(公告)号:US06674452B1

    公开(公告)日:2004-01-06

    申请号:US09543218

    申请日:2000-04-05

    IPC分类号: G06F1730

    摘要: A According to the invention, a music search system includes a music player, music analyzer, a search engine and a sophisticated user interface that enables users to visually build complex query profiles from the structural information of one or more musical pieces. The complex query profiles are useful for performing searches for musical pieces matching the structural information in the query profile. The system allows the user to supply an existing piece of music, or some components thereof, as query arguments, and lets the music search engine find music that is similar to the given sample by certain similarity measurement.

    摘要翻译: 根据本发明,音乐搜索系统包括音乐播放器,音乐分析器,搜索引擎和复杂的用户界面,使得用户能够从一个或多个音乐作品的结构信息可视地构建复杂的查询配置文件。 复杂查询配置文件对于执行与查询配置文件中的结构信息匹配的音乐作品的搜索很有用。 该系统允许用户提供现有的音乐片段或其一些组件作为查询参数,并且使得音乐搜索引擎通过某种相似性度量来找到与给定样本类似的音乐。

    Method and apparatus for music summarization and creation of audio summaries
    7.
    发明授权
    Method and apparatus for music summarization and creation of audio summaries 失效
    用于音乐汇总和创建音频摘要的方法和设备

    公开(公告)号:US06225546B1

    公开(公告)日:2001-05-01

    申请号:US09543715

    申请日:2000-04-05

    IPC分类号: A63H500

    摘要: A method and system for generating audio summaries of musical pieces receives computer readable data representing the musical piece and generates therefrom an audio summary including the main melody of the musical piece. A component builder generates a plurality of composite and primitive components representing the structural elements of the musical piece and creates a hierarchical representation of the components. The most primitive components, representing notes within the composition, are examined to determine repetitive patterns within the composite components. A melody detector examines the hierarchical representation of the components and uses algorithms to detect which of the repetitive patterns is the main melody of the composition. Once the main melody is detected, the segment of the musical data containing the main melody is provided in one or more formats. Musical knowledge rules representing specific genres of musical styles may be used to assist the component builder and melody detector in determining which primitive component patterns are the most likely candidates for the main melody.

    摘要翻译: 用于产生音乐作品的音频摘要的方法和系统接收表示乐曲的计算机可读数据,并从中产生包括乐曲的主旋律的音频摘要。 组件构建器生成表示乐曲的结构元素的多个复合和原始组件,并创建组件的分层表示。 检查组成中代表笔记的最原始组件,以确定复合组件内的重复模式。 旋律检测器检查组件的分层表示,并使用算法来检测哪些重复模式是组合的主旋律。 一旦检测到主旋律,则包含主旋律的音乐数据的片段以一种或多种格式提供。 可以使用表示音乐风格的特定类型的音乐知识规则来帮助组件构建器和旋律检测器确定哪些原始组件模式是主旋律最可能的候选者。

    Efficient large-scale access control for internet/intranet information systems
    8.
    发明授权
    Efficient large-scale access control for internet/intranet information systems 失效
    有效的大规模访问控制互联网/内部网信息系统

    公开(公告)号:US06219667B1

    公开(公告)日:2001-04-17

    申请号:US09086272

    申请日:1998-05-28

    申请人: Qi Lu Shang-Hua Teng

    发明人: Qi Lu Shang-Hua Teng

    IPC分类号: G06F1700

    摘要: An efficient method and apparatus for regulating access to information objects stored in a database in which there are a large number of users and access groups. The invention uses a representation of a hierarchical access group structure in terms of intervals over a set of integers and a decomposition scheme that reduces any group structure to ones that have interval representation. This representation allows the problem for checking access rights to be reduced to an interval containment problem. An interval tree, a popular data structure in computational geometry, may be implemented to efficiently execute the access-right checking method.

    摘要翻译: 一种有效的方法和装置,用于调节对存储在数据库中的信息对象的访问,其中存在大量的用户和访问组。 本发明使用对一组整数的间隔的分级存取组结构的表示,以及将任何组结构减小为具有间隔表示的组的结构的分解方案。 该表示允许将访问权限检查的问题减少到间隔容纳问题。 可以实现间隔树,即计算几何中的流行数据结构,以有效地执行访问权限检查方法。

    Integrated touch-skin notification system for wearable computing devices
    9.
    发明授权
    Integrated touch-skin notification system for wearable computing devices 失效
    用于穿戴式计算设备的综合触皮通知系统

    公开(公告)号:US06218958B1

    公开(公告)日:2001-04-17

    申请号:US09168630

    申请日:1998-10-08

    IPC分类号: G08B522

    CPC分类号: G08B6/00 H04M19/047

    摘要: A tactile notification device that can be embodied in, e.g., a wristwatch, communicates via wireless link with plural personal computing devices, including cellular telephones, pagers, and palm top computers, of the person wearing the notification device. When one of the personal computing devices alerts, e.g., when the telephone receives an incoming call, the pager receives a page, or the palm top computer receives an email, the personal computing device sends a signal to the notification device, which generates a discrete tactile signal against the person's skin. The notification device can generate different tactile signals, and each tactile signal can be correlated as desired by the user to one of the personal computing devices. In one embodiment, opposed pinch bars are provided on the skin-facing tactile surface of a wristwatch to gently pinch the skin and thereby establish a first tactile signal that can be correlated to, for example, an alert for an incoming phone call. Also, a rotating bar can be provided on the tactile surface of the wristwatch, and the tactile signal that corresponds to, e.g., an incoming page can be established by rotating the bar against the skin.

    摘要翻译: 可以体现在例如手表中的触觉通知装置通过无线连接与携带通知装置的人的多个个人计算设备(包括蜂窝电话,寻呼机和掌上电脑)进行通信。 当个人计算设备中的一个提醒时,例如,当电话接收到来电时,寻呼机接收页面,或者掌上电脑接收到电子邮件,个人计算设备向通知设备发送信号,该信号产生离散 对人的皮肤的触觉信号。 通知装置可以产生不同的触觉信号,并且每个触觉信号可以根据用户的需要与个人计算设备之一相关联。 在一个实施例中,相对的夹杆设置在手表的面向皮肤的触感表面上,以轻轻地夹紧皮肤,从而建立可以与例如呼入电话的警报相关联的第一触觉信号。 此外,可以在手表的触感表面上设置旋转杆,并且可以通过使杆相对于皮肤旋转来建立与例如输入页对应的触觉信号。

    System and method for personalized information filtering and alert generation
    10.
    发明授权
    System and method for personalized information filtering and alert generation 有权
    用于个性化信息过滤和警报生成的系统和方法

    公开(公告)号:US06381594B1

    公开(公告)日:2002-04-30

    申请号:US09613436

    申请日:2000-07-11

    IPC分类号: G06F1730

    摘要: A search engine that forms a compact representation of a plurality of user queries to efficiently find desired information in an information network. The search engine comprises a profile processor having logic to receive the queries from the users and a search module. The search module is coupled to the profile processor and has logic to receive the information content, to combine the user queries into a master query, and to match the master query with the information content to determine matching content. The search engine also includes logic to analyze the matching content to determine if any of the queries has been satisfied.

    摘要翻译: 搜索引擎,其形成多个用户查询的紧凑表示,以有效地在信息网络中找到所需信息。 搜索引擎包括具有用于接收来自用户和搜索模块的查询的逻辑的简档处理器。 搜索模块耦合到简档处理器,并且具有接收信息内容的逻辑,将用户查询组合成主查询,并且将主查询与信息内容相匹配以确定匹配内容。 搜索引擎还包括分析匹配内容以确定是否满足任何查询的逻辑。