-
公开(公告)号:US09208231B1
公开(公告)日:2015-12-08
申请号:US13305626
申请日:2011-11-28
Applicant: Trystan G. Upstill , Matteo Slanina
Inventor: Trystan G. Upstill , Matteo Slanina
IPC: G06F17/30
CPC classification number: G06F17/30864
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying languages that are relevant to resources. In one aspect, a method includes selecting in a data processing apparatus a first resource; accessing click data that identifies, for each of a plurality of requests for the first resource, a respective search engine user interface from which the request was received; identifying a search engine user interface language for each of the plurality of requests based on the click data; determining a respective language relevance score for the first resource for each identified search engine user interface language; and selecting one or more languages as being relevant to the first resource based on the language relevance scores.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别与资源相关的语言。 一方面,一种方法包括在数据处理装置中选择第一资源; 访问针对第一资源的多个请求中的每一个的点击数据,从其接收到请求的相应搜索引擎用户界面; 基于所述点击数据识别所述多个请求中的每一个的搜索引擎用户界面语言; 为每个识别的搜索引擎用户界面语言确定第一资源的相应语言相关性得分; 以及基于所述语言相关性得分来选择与所述第一资源相关的一种或多种语言。
-
2.
公开(公告)号:US09098582B1
公开(公告)日:2015-08-04
申请号:US12421931
申请日:2009-04-10
Applicant: Derrick E. Bass , Xin Liu , Matteo Slanina , Trystan Upstill
Inventor: Derrick E. Bass , Xin Liu , Matteo Slanina , Trystan Upstill
IPC: G06F17/30
CPC classification number: G06F17/30864 , G06F17/30882
Abstract: Methods, systems, and apparatus, including computer program products, for identifying languages that are relevant to resource. In an aspect, language features are identified for incoming resource links to a resource and outgoing resource links from the resource. The language features or use by a language classification model to generate language relevance scores. The language relevance scores for each of the incoming resource links and outgoing resource links are used to generate a corresponding relevance measure for each of a plurality of languages. Each relevance measure is a measure of the relevance of the language to the resource.
Abstract translation: 用于识别与资源相关的语言的方法,系统和装置,包括计算机程序产品。 在一个方面,针对来自资源的资源和外来资源链接的资源链接识别语言特征。 语言特征或语言分类模型用于生成语言相关性分数。 用于每个传入资源链接和输出资源链接的语言相关性得分用于为多种语言中的每一种生成相应的相关性度量。 每个相关性度量衡量语言与资源的相关性。
-