LARGE SCALE SEARCH BOT DETECTION
    1.
    发明申请
    LARGE SCALE SEARCH BOT DETECTION 审中-公开
    大规模搜索检测

    公开(公告)号:US20110208714A1

    公开(公告)日:2011-08-25

    申请号:US12708541

    申请日:2010-02-19

    IPC分类号: G06F17/30 G06F21/00

    摘要: A framework may be used for identifying low-rate search bot traffic within query logs by capturing groups of distributed, coordinated search bots. Search log data may be input to a history-based anomaly detection engine to determine if query-click pairs associated with a query are suspicious in view of historical query-click pairs for the query. Users associated with suspicious query-click pairs may be input to a matrix-based bot detection engine to determine correlations between queries submitted by the users. Those users indicating strong correlations may be categorized as bots, whereas those who do not may be categorized as part of flash crowd traffic.

    摘要翻译: 可以通过捕获分布式,协调的搜索机器人组来识别查询日志中的低速搜索bot流量的框架。 搜索日志数据可以被输入到基于历史的异常检测引擎,以鉴于查询的历史查询 - 点击对来确定与查询相关联的查询 - 点击对是否是可疑的。 与可疑查询点击对相关联的用户可以输入到基于矩阵的机器人检测引擎,以确定用户提交的查询之间的相关性。 指示强相关性的用户可能被归类为机器人,而不能被分类为闪存人群流量的一部分的那些用户。

    GENERATING ANONYMOUS LOG ENTRIES
    2.
    发明申请
    GENERATING ANONYMOUS LOG ENTRIES 有权
    产生匿名登录

    公开(公告)号:US20090198746A1

    公开(公告)日:2009-08-06

    申请号:US12024989

    申请日:2008-02-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30

    摘要: Assigning session identifications to log entries and generating anonymous log entries are provided. In order to balance users' privacy concerns with the need for analysis of the log entries to provide high quality search results, non-user-specific data fields, such as a user's location (e.g., city, state, and latitude/longitude) and connection speed, are inserted into the log entries, and user-specific data fields, such as the IP address and cookie identifications, are deleted from the log entries. In addition or alternatively, prior to anonymization of the log entries, session identifications are assigned to identified groups of log entries. The groups are identified based on factors such as the user's identification, the IP address, the time of search, and differences between the search terms used in the search queries.

    摘要翻译: 为会话标识分配日志条目和生成匿名日志条目。 为了平衡用户的隐私问题,需要分析日志条目以提供高质量的搜索结果,非用户特定的数据字段(例如用户的位置(例如城市,州和纬度/经度))和 连接速度被插入到日志条目中,并且从日志条目中删除用户特定的数据字段,例如IP地址和cookie标识。 另外或替代地,在匿名日志条目之前,将会话标识分配给所识别的日志条目组。 基于用户的识别,IP地址,搜索时间以及搜索查询中使用的搜索词之间的差异来确定组。

    Evaluating the ranking quality of a ranked list
    3.
    发明授权
    Evaluating the ranking quality of a ranked list 有权
    评估排名列表的排名质量

    公开(公告)号:US09449078B2

    公开(公告)日:2016-09-20

    申请号:US12243937

    申请日:2008-10-01

    IPC分类号: G06F17/30

    摘要: The ranking quality of a ranked list may be evaluated. In an example embodiment, a method is implemented by a system to access log data, ascertain which entries of a ranked list are skipped, and determine a ranking quality metric from the skipped entries. More specifically, log data that reflects user interactions with a ranked list having multiple entries is accessed. The user interactions include at least indications of which of the multiple entries are selected entries. It is ascertained which entries of the multiple entries of the ranked list are skipped entries based on the selected entries. The ranking quality metric for the ranked list is determined responsive to the skipped entries.

    摘要翻译: 可以评估排名列表的排名质量。 在一个示例性实施例中,系统通过系统实现访问日志数据,确定跳过排名列表的哪些条目并从跳过的条目确定排序质量度量的方法。 更具体地,访问反映与具有多个条目的排名列表的用户交互的日志数据。 用户交互包括至少指示多个条目中的哪一个是选择的条目。 基于所选择的条目,确定排序列表的多个条目的哪些条目被跳过条目。 响应于跳过的条目来确定排名列表的排名质量度量。

    GENERATING ANONYMOUS LOG ENTRIES
    4.
    发明申请
    GENERATING ANONYMOUS LOG ENTRIES 审中-公开
    产生匿名登录

    公开(公告)号:US20110167043A1

    公开(公告)日:2011-07-07

    申请号:US13050706

    申请日:2011-03-17

    IPC分类号: G06F17/30

    CPC分类号: G06F16/00

    摘要: Assigning session identifications to log entries and generating anonymous log entries are provided. In order to balance users' privacy concerns with the need for analysis of the log entries to provide high quality search results, non-user-specific data fields, such as a user's location (e.g., city, state, and latitude/longitude) and connection speed, are inserted into the log entries, and user-specific data fields, such as the IP address and cookie identifications, are deleted from the log entries. In addition or alternatively, prior to anonymization of the log entries, session identifications are assigned to identified groups of log entries. The groups are identified based on factors such as the user's identification, the IP address, the time of search, and differences between the search terms used in the search queries.

    摘要翻译: 为会话标识分配日志条目和生成匿名日志条目。 为了平衡用户的隐私问题,需要分析日志条目以提供高质量的搜索结果,非用户特定的数据字段(例如用户的位置(例如城市,州和纬度/经度))和 连接速度被插入到日志条目中,并且从日志条目中删除用户特定的数据字段,例如IP地址和cookie标识。 另外或替代地,在匿名日志条目之前,将会话标识分配给所识别的日志条目组。 基于用户的识别,IP地址,搜索时间以及搜索查询中使用的搜索词之间的差异来确定组。

    Generating anonymous log entries
    5.
    发明授权
    Generating anonymous log entries 有权
    生成匿名日志条目

    公开(公告)号:US07937383B2

    公开(公告)日:2011-05-03

    申请号:US12024989

    申请日:2008-02-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30

    摘要: Assigning session identifications to log entries and generating anonymous log entries are provided. In order to balance users' privacy concerns with the need for analysis of the log entries to provide high quality search results, non-user-specific data fields, such as a user's location (e.g., city, state, and latitude/longitude) and connection speed, are inserted into the log entries, and user-specific data fields, such as the IP address and cookie identifications, are deleted from the log entries. In addition or alternatively, prior to anonymization of the log entries, session identifications are assigned to identified groups of log entries. The groups are identified based on factors such as the user's identification, the IP address, the time of search, and differences between the search terms used in the search queries.

    摘要翻译: 为会话标识分配日志条目和生成匿名日志条目。 为了平衡用户的隐私问题,需要分析日志条目以提供高质量的搜索结果,非用户特定的数据字段(例如用户的位置(例如城市,州和纬度/经度))和 连接速度被插入到日志条目中,并且从日志条目中删除用户特定的数据字段,例如IP地址和cookie标识。 另外或替代地,在匿名日志条目之前,将会话标识分配给所识别的日志条目组。 基于用户的识别,IP地址,搜索时间以及搜索查询中使用的搜索词之间的差异来确定组。

    EVALUATING THE RANKING QUALITY OF A RANKED LIST
    6.
    发明申请
    EVALUATING THE RANKING QUALITY OF A RANKED LIST 有权
    评估排名列表的排名质量

    公开(公告)号:US20100082566A1

    公开(公告)日:2010-04-01

    申请号:US12243937

    申请日:2008-10-01

    IPC分类号: G06F17/30

    摘要: The ranking quality of a ranked list may be evaluated. In an example embodiment, a method is implemented by a system to access log data, ascertain which entries of a ranked list are skipped, and determine a ranking quality metric from the skipped entries. More specifically, log data that reflects user interactions with a ranked list having multiple entries is accessed. The user interactions include at least indications of which of the multiple entries are selected entries. It is ascertained which entries of the multiple entries of the ranked list are skipped entries based on the selected entries. The ranking quality metric for the ranked list is determined responsive to the skipped entries.

    摘要翻译: 可以评估排名列表的排名质量。 在一个示例实施例中,系统通过系统实现访问日志数据的方法,确定排列列表的哪些条目被跳过,并且从跳过的条目确定排序质量度量。 更具体地,访问反映与具有多个条目的排名列表的用户交互的日志数据。 用户交互包括至少指示多个条目中的哪一个是选择的条目。 基于所选择的条目,确定排序列表的多个条目的哪些条目被跳过条目。 响应于跳过的条目来确定排名列表的排名质量度量。

    USER ANALYSIS THROUGH USER LOG FEATURE EXTRACTION
    7.
    发明申请
    USER ANALYSIS THROUGH USER LOG FEATURE EXTRACTION 审中-公开
    用户分析通过用户日志功能提取

    公开(公告)号:US20120278354A1

    公开(公告)日:2012-11-01

    申请号:US13097277

    申请日:2011-04-29

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/063

    摘要: Systems, methods, and computer media for efficiently processing user log data are provided. A received user log data analysis request specifies: target user log features that identify users in a target user group, analysis user log features that identify data associated with the users in the target user group, and an analysis to perform on the identified data associated with the users in the target user group. Occurrences of specified features are extracted from user logs and stored. Users associated with an occurrence of each of the extracted and stored target user log features are identified as users in the target user group. Occurrences of the analysis user log features that are associated with a user in the target user group are extracted and reformatted for the analysis specified in the analysis request.

    摘要翻译: 提供了用于有效处理用户日志数据的系统,方法和计算机媒体。 接收到的用户日志数据分析请求指定:标识目标用户组中的用户的目标用户日志功能,分析用于识别与目标用户组中的用户相关联的数据的用户日志功能,以及对与 目标用户组中的用户。 指定功能的发生从用户日志中提取并存储。 与提取和存储的每个目标用户日志特征中的每一个相关联的用户被标识为目标用户组中的用户。 与目标用户组中的用户相关联的分析用户日志功能的出现被提取并重新格式化以用于分析请求中指定的分析。

    Adaptive systems and methods for making software easy to use via software usage mining
    8.
    发明授权
    Adaptive systems and methods for making software easy to use via software usage mining 有权
    通过软件使用挖掘使软件易于使用的自适应系统和方法

    公开(公告)号:US07802197B2

    公开(公告)日:2010-09-21

    申请号:US11112683

    申请日:2005-04-22

    IPC分类号: G06F3/048

    CPC分类号: G06F9/451

    摘要: A system for dynamically updating user accessible features of a software application on a client computer has a user interface, a local usage data file, and a data mining engine. The user interface is adapted to receive operator inputs. The local usage data file is adapted to store usage information corresponding to the operator inputs. The data mining engine is adapted to process the stored usage information and to generate local adjustments to a user interface of the software application based on the operator inputs. In one embodiment, a server is adapted to receive usage data from a plurality of application instances on a plurality of client computers and to generate global adjustments based on the received usage data. In one embodiment, the system has a merge feature adapted to blend and resolve conflicts between local and global adjustments to generate an interface adjustment for the user interface.

    摘要翻译: 用于在客户端计算机上动态地更新软件应用的用户可访问特征的系统具有用户界面,本地使用数据文件和数据挖掘引擎。 用户界面适于接收操作员输入。 本地使用数据文件适于存储对应于操作者输入的使用信息。 数据挖掘引擎适于处理存储的使用信息,并且基于操作者输入产生对软件应用的用户界面的局部调整。 在一个实施例中,服务器适于从多个客户端计算机上的多个应用实例接收使用数据,并且基于接收到的使用数据生成全局调整。 在一个实施例中,系统具有适于混合和解决局部和全局调整之间的冲突的合并特征,以生成用户界面的接口调整。

    Data mining techniques for improving search engine relevance
    9.
    发明申请
    Data mining techniques for improving search engine relevance 审中-公开
    数据挖掘技术,提高搜索引擎的相关性

    公开(公告)号:US20060224579A1

    公开(公告)日:2006-10-05

    申请号:US11096153

    申请日:2005-03-31

    申请人: Zijian Zheng

    发明人: Zijian Zheng

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: The subject invention relates to systems and methods that automatically learn data relevance from past search activities and apply such learning to facilitate future search activities. In one aspect, an automated information retrieval system is provided. The system includes a learning component that analyzes stored information retrieval data to determine relevance patterns from past user information search activities. A search component employs the learning component to determine a subset of current search results based at least in part on the relevance patterns, wherein numerous variables can be processed in accordance with the learning component to efficiently generate focused, prioritized, and relevant search results.

    摘要翻译: 本发明涉及自动地从过去的搜索活动中学习数据相关性并应用这种学习以促进未来搜索活动的系统和方法。 一方面,提供了一种自动信息检索系统。 该系统包括分析存储的信息检索数据以确定来自过去用户信息搜索活动的相关性模式的学习部件。 搜索组件使用学习组件至少部分地基于相关性模式来确定当前搜索结果的子集,其中可以根据学习组件处理许多变量以有效地生成聚焦,优先级和相关的搜索结果。

    Composite tip array for polymer pen lithography
    10.
    发明授权
    Composite tip array for polymer pen lithography 有权
    用于聚合物笔光刻的复合尖端阵列

    公开(公告)号:US09079338B2

    公开(公告)日:2015-07-14

    申请号:US13467552

    申请日:2012-05-09

    摘要: A method of preparing a tip for lithography, includes forming a mold having at least one recess; disposing a first polymer in the recess to form an apex of the tip,; curing the first polymer in the recess; and disposing a second polymer in the recess to form a base of the tip. The Young's Modulus of the second polymer is lower than the Young's Modulus of the first polymer. The tip structure for lithography includes a substrate, and a layered structure including a tip having an apex of a first polymer and a base of a second polymer. The first polymer is less resiliently deformable than the second polymer.

    摘要翻译: 一种制备光刻用尖端的方法,包括:形成具有至少一个凹部的模具; 在所述凹部中设置第一聚合物以形成所述尖端的顶点; 固化凹槽中的第一聚合物; 并在所述凹部中设置第二聚合物以形成所述尖端的基部。 第二聚合物的杨氏模量低于第一聚合物的杨氏模量。 用于光刻的尖端结构包括基底和包括具有第一聚合物的顶点和第二聚合物的基底的末端的层状结构。 第一聚合物比第二聚合物弹性变形少。