-
公开(公告)号:US07555480B2
公开(公告)日:2009-06-30
申请号:US11456753
申请日:2006-07-11
申请人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
发明人: Benyu Zhang , Chenxi Lin , Hua-Jun Zeng , Jian Wang , Ke Tang , Zheng Chen
CPC分类号: G06F17/30864 , Y10S707/99935
摘要: The invention provides a method of interactively crawling data records on a web page. Users may select various data records of interest on a web page to generate templates to search for similar data items on the same web page or on different web pages. A tree matching algorithm may be used to compare and extract data matching the generated template.
摘要翻译: 本发明提供了一种在网页上交互地爬行数据记录的方法。 用户可以在网页上选择感兴趣的各种数据记录,以生成在同一网页或不同网页上搜索类似数据项的模板。 树匹配算法可用于比较和提取与生成的模板匹配的数据。
-
公开(公告)号:US20080288491A1
公开(公告)日:2008-11-20
申请号:US11803503
申请日:2007-05-15
申请人: Min Wu , Chenxi Lin , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Min Wu , Chenxi Lin , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06Q30/02 , G06F17/30867
摘要: Described is a behavioral targeting technology for online advertising, by which an original attribute is uniformly expanded. Users that meet an original attribute are aggregated into a mid-result used to determine similarity relative to candidate attribute types. The most similar candidate attributes are selected for the expanded attribute. A URL/URL pattern suggestion technology is provided, with similarity computed from users/URLs visited by the users. URLs are separated into URL tree nodes, for calculating the number of users who have visited each URL and the number of users who have visited the URL on a sub-tree whose root is the node. URL/URL patterns are output based on similarity. Domains are also suggested based on user-visits. Similarities between pairs of domains may be computed (e.g., offline), with an output for a given domain provided in based on its similarity with each other domain.
摘要翻译: 描述了一种用于在线广告的行为定位技术,通过该技术,原始属性被均匀地扩展。 满足原始属性的用户将聚合成中间结果,用于确定与候选属性类型相似度。 为扩展属性选择最相似的候选属性。 提供URL / URL模式建议技术,从用户访问的用户/ URL计算相似度。 URL被分隔成URL树节点,用于计算访问每个URL的用户数和在其根是节点的子树上访问过URL的用户数。 基于相似性输出URL / URL模式。 还可以根据用户访问建议域。 可以基于其与每个其他域的相似性来计算(例如,脱机)对域之间的相似性,其中提供给定域的输出。
-
公开(公告)号:US20080288483A1
公开(公告)日:2008-11-20
申请号:US11804627
申请日:2007-05-18
申请人: Chenxi Lin , Lei Ji , Huajun Zeng , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Chenxi Lin , Lei Ji , Huajun Zeng , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F17/30675
摘要: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.
摘要翻译: 描述了一种有效的检索机制,其基于查询词辨别快速定位文档(例如,对应于在线广告)。 根据其排序的重要性来选择搜索项的最顶层子集(例如,两个),例如按照倒排的文档频率排序。 然后使用最上面的术语来缩小被搜索以查找文档标识符和相关分数的反向查询索引的行数,例如通过BM25算法离线计算。 例如,对于每个重要术语的每个文档标识符,可以通过比较文档标识符来跳过每个其他行中的指针,然后是一个指针,来执行每个狭窄的行子集(也包含该文档标识符)的快速搜索 二进制搜索查找特定文档。 然后可以使用该组特定文件的分数来排列其作为结果返回的相对重要性。
-
24.
公开(公告)号:US20080288481A1
公开(公告)日:2008-11-20
申请号:US11803462
申请日:2007-05-15
申请人: Huajun Zeng , Chenxi Lin , Dingyi Han , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Huajun Zeng , Chenxi Lin , Dingyi Han , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06Q30/02
摘要: Described is a technology by which online advertisements for returning with a query response are ranked according to reputation. The reputation may correspond to a product or service and/or seller reputation. In one example, a set of relevant advertisement items are located and ranked using reputation data as a factor. For example, for each item, a ranking value is based on a mathematical combination of a product reputation score, a seller reputation score and a relevance score, with the items ranked by their computed values. The scores may be weighted differently. The reputation data may be mined from a review source, such as customer reviews available on the web. In one example implementation, a 3-gram model that considers terms in the review along with the two terms proceeding each term is used to analyze the reviews to determine whether each review is positive or negative with respect to the reputation.
摘要翻译: 描述了一种技术,通过这种技术,根据信誉对用于返回查询响应的在线广告进行排名。 声誉可能对应于产品或服务和/或卖方声誉。 在一个示例中,使用信誉数据作为因素来定位和排列一组相关广告项目。 例如,对于每个项目,排序值基于产品信誉评分,卖方信誉评分和相关性分数的数学组合,其中项目按其计算值排列。 得分的加权方式可能不同。 信誉数据可以从审查来源开采,例如网络上可用的客户评价。 在一个示例实施中,使用考虑审查中的术语的3克模型以及每个术语进行的两个术语进行分析,以确定每个评论对于声誉是否为正或负。
-
公开(公告)号:US20070239792A1
公开(公告)日:2007-10-11
申请号:US11392640
申请日:2006-03-30
申请人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
发明人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
IPC分类号: G06F17/30
CPC分类号: G06F17/30616 , Y10S707/99936 , Y10S707/99953 , Y10S707/99954
摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.
摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。
-
公开(公告)号:US20070239712A1
公开(公告)日:2007-10-11
申请号:US11392760
申请日:2006-03-30
申请人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
发明人: Zheng Chen , Lei Li , Chenxi Lin , Qiaoling Liu , Jian Wang , Benyu Zhang
IPC分类号: G06F17/30
CPC分类号: G06F17/30112 , G06F17/3012 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99937 , Y10S707/99938
摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.
摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。
-
公开(公告)号:US20070005649A1
公开(公告)日:2007-01-04
申请号:US11173098
申请日:2005-07-01
申请人: Jian Wang , Fengping Zeng , Hua-Jun Zeng , Benyu Zhang , Zheng Chen , Chenxi Lin , Bing Sun
发明人: Jian Wang , Fengping Zeng , Hua-Jun Zeng , Benyu Zhang , Zheng Chen , Chenxi Lin , Bing Sun
IPC分类号: G06F17/00
CPC分类号: G06F16/957
摘要: The invention provides a method of creating contextual titles for web pages or documents. The method includes the extracting of phrases from a web page or document. The phrases are evaluated for use as contextual titles for the web page or document. The contextual title is utilized to access the web page or document by users.
摘要翻译: 本发明提供了一种为网页或文档创建上下文标题的方法。 该方法包括从网页或文档中提取短语。 这些短语被评估用作网页或文档的上下文标题。 使用上下文标题来访问用户的网页或文档。
-
公开(公告)号:US20060271834A1
公开(公告)日:2006-11-30
申请号:US11136029
申请日:2005-05-24
申请人: Jian Wang , Hua-Jun Zeng , Chenxi Lin , Zheng Chen , Benyu Zhang , Bing Sun
发明人: Jian Wang , Hua-Jun Zeng , Chenxi Lin , Zheng Chen , Benyu Zhang , Bing Sun
IPC分类号: G06F17/00
CPC分类号: G06F17/3089
摘要: The invention provides a method of creating a personal home page containing information of interest assembled from various web sites. The method includes the partitioning of web pages into web blocks. Users may collect various web blocks from different web pages and utilize those web blocks to define the dynamic personal homepage. In addition, the web blocks may be tracked to update content in the personal home page based on corresponding changes in the original web page.
摘要翻译: 本发明提供了一种创建包含从各种网站组装的感兴趣的信息的个人主页的方法。 该方法包括将网页划分成网页块。 用户可以从不同的网页收集各种网页块,并利用这些网页块定义动态个人主页。 此外,可以基于原始网页中的相应变化来跟踪网页块以更新个人主页中的内容。
-
公开(公告)号:US09300134B2
公开(公告)日:2016-03-29
申请号:US13532916
申请日:2012-06-26
申请人: Chenxi Lin , Xiaosong Yang
发明人: Chenxi Lin , Xiaosong Yang
CPC分类号: H02J3/00 , H02J2003/001 , H02J2003/007 , Y02E60/76 , Y04S40/22
摘要: In at least some embodiments, a computer system includes a processor and a storage device coupled to the processor. The storage device stores a program that, when executed, causes the processor to simulate restoration of a power grid system and to generate a restoration plan for the power grid system based on the simulation.
摘要翻译: 在至少一些实施例中,计算机系统包括耦合到处理器的处理器和存储设备。 存储装置存储执行时使处理器模拟电网系统的恢复并基于该仿真生成电网系统的恢复计划的程序。
-
-
-
-
-
-
-
-