Systems and methods for performing electronic information retrieval
    2.
    发明授权
    Systems and methods for performing electronic information retrieval 有权
    用于执行电子信息检索的系统和方法

    公开(公告)号:EP1524610B1

    公开(公告)日:2018-03-28

    申请号:EP04024558.1

    申请日:2004-10-14

    申请人: Xerox Corporation

    IPC分类号: G06F17/30

    摘要: System and methods for identifying output documents similar to an input document. A query is formulated using a list of best keywords from the input document to search for a first set of output documents. The list of best keywords is defined with a maximum number of keywords less than the total number of keywords in the list of best keywords that are identified as belonging to a domain specific dictionary of words and as having no measurable linguistic frequency. Lists of keywords are identified for each output document in the first set of documents. A second set of similar documents is determined using a measure of similarity that is computed between keywords identified in the input document and each output document in the first set of documents. In addition, a measure of similarity between two documents is computed. In computing the measure of similarity, a first list of rated keywords extracted from the first document and a second list of rated keywords extracted from the second document are received. The first and second lists of keywords are used to determine whether the first document forms part of the second document using a first computed percentage indicating what percentage of keyword ratings in the first list also exist in the second list. A second percentage is computed that indicates what percentage of keyword ratings along with a set of their neighboring keyword ratings in the first list that also exist in the second list when the first percentage indicates that the first document is included in the second document. The first percentage is used to specify the measure of similarity when the second percentage is greater than the first percentage.

    System and method for using a workspace data manager to access, manipulate and synchronize network data
    5.
    发明公开
    System and method for using a workspace data manager to access, manipulate and synchronize network data 审中-公开
    系统和方法访问工作数据管理网络数据操纵和它们同步

    公开(公告)号:EP2328100A1

    公开(公告)日:2011-06-01

    申请号:EP10184699.6

    申请日:1999-01-15

    申请人: Visto Corporation

    发明人: Mendez, Daniel

    IPC分类号: G06F17/30

    摘要: A system includes a communication module for downloading workspace data (135) from a remote site, an application program interface coupled to the communications module for communicating with a workspace data manager (160) to enable manipulation of the downloaded workspace data and thereby create manipulated data, and a general synchronization module (130) coupled to the communications module for synchronizing the manipulated data with the workspace data (135) stored at the remote site. An instantiator requests the workspace data manager to provide an interface for enabling manipulation of the downloaded workspace data. The workspace data manager may create another instance of the interface or may provide access to its only interface to enable manipulation of the data. A data reader may translate the downloaded workspace data from the format used by the remote site to the format used by the workspace data manager. Upon logout, the de-instantiator synchronizes the data with the global server and deletes workspace data. The system handles the situation where the data stored at the remote site has not changed therefore includes the downloaded data, and the situation the data stored at the remote site has been modified and therefore is different than the downloaded data.

    摘要翻译: 一种系统,包括:用于从远程站点下载的工作空间数据(135)的通信模块,以连接到所述通信模块,用于与工作空间数据管理器(160)进行通信,以使所下载的工作空间数据的操作,并由此创建被操纵的数据的应用程序接口 以及耦合到所述通信模块,用于与存储在远程站点的工作空间数据(135)同步操作的数据的通用同步模块(130)。 在初始化程序请求工作空间数据管理器提供的接口为使工作区下载数据的操作。 工作空间数据管理器可创建界面的另一个实例或者可以提供访问它的唯一接口,使数据的操作。 一个DataReader可以从远程站点的工作空间数据管理器所使用的格式使用的格式转换下载工作区数据。 一旦退出,去初始化程序同步与全球服务器数据删除工作区数据。 该系统处理,其中存储在所述远程站点处理不当的数据而改变因此包括将下载的数据,并将其存储在远程站点的数据已被修改的情况的情况,因此比所下载的数据不同。

    SYSTEMS AND METHODS FOR GENERATING CONCEPT UNITS FROM SEARCH QUERIES
    9.
    发明公开
    SYSTEMS AND METHODS FOR GENERATING CONCEPT UNITS FROM SEARCH QUERIES 审中-公开
    系统和用于产生可用于概念机组从搜索查询

    公开(公告)号:EP1611506A4

    公开(公告)日:2008-07-30

    申请号:EP04758861

    申请日:2004-04-02

    申请人: YAHOO INC

    IPC分类号: G06F17/30 G06F7/00

    摘要: Systems and method for enhancing search functionality provided to a user. In certain aspects, a query processing engine automatically decomposes queries into constituent units that are related to concepts in which a user may be interested. The query processing engine decomposes queries into one or more constituent units per query using statistical methods. In certain aspects, no real world knowledge is used in determining units. In other aspects, aspects of world and content knowledge are introduced to enhance and optimize performance, for example, manually using a team of one or more information engineers.

    COMMUNICATION CONTROL DEVICE AND COMMUNICATION CONTROL SYSTEM
    10.
    发明公开
    COMMUNICATION CONTROL DEVICE AND COMMUNICATION CONTROL SYSTEM 审中-公开
    KOMMUNIKATIONSSTEUEREINRICHTUNG UND KOMMUNIKATIONSSTEUERSYSTEM

    公开(公告)号:EP1868103A1

    公开(公告)日:2007-12-19

    申请号:EP05727698.2

    申请日:2005-03-28

    发明人: NAGOYA, Mitsugu

    IPC分类号: G06F13/00

    摘要: The present invention provides a technique for enabling a high-speed communication control apparatus.
    A packet processing circuit 20 of a communication control apparatus includes a user database 57, a virus list 161, a whitelist 162, a blacklist 163 and a common category list 164. Upon acquisition of a request for access to a content, matching between information on a user who has sent the access request and the user database 57 is performed by a search circuit 30, so as to authenticate the user. When the user is authenticated, the search circuit 30 performs matching between the URL of the content to be accessed and the virus list 161, whitelist 162, blacklist 163 and common category list 164. A process execution circuit 40 controls the permission for the access based on the search result of the search circuit 30 and determination conditions stored in a second database 60. The packet processing circuit 20 is configured with a wired logic circuit.

    摘要翻译: 本发明提供一种能够实现高速通信控制装置的技术。 通信控制装置的分组处理电路20包括用户数据库57,病毒列表161,白名单162,黑名单163和公共类别列表164.在获取对内容的访问请求时, 发送访问请求的用户和用户数据库57由搜索电路30执行,以便对用户进行认证。 当用户认证时,搜索电路30执行要访问的内容的URL与病毒列表161,白名单162,黑名单163和公共类别列表164之间的匹配。处理执行电路40控制基于访问的许可 关于搜索电路30的搜索结果和存储在第二数据库60中的确定条件。分组处理电路20配置有布线逻辑电路。