专利检索 ap:("Jun Wu" OR "Tang Xi Liu" OR "Feng Hong" OR "Yong-Gang Wang" OR "Bo Yang" OR "Lei Zhang") AND inv:"Jun Wu" 第 1 页

1.

发明授权
Domain dictionary creation by detection of new topic words using divergence value comparison 有权
标题翻译：通过使用发散值比较检测新主题词来创建域名词典

公开(公告)号：US08386240B2

公开(公告)日：2013-02-26

申请号：US13158125

申请日：2011-06-10

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yong-Gang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yong-Gang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21 , G06F17/20 , G06F17/27

CPC分类号： G06F17/2745

摘要： Methods, systems, and apparatus, including computer program products, to identify topic words in a collection of documents that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on a document collection and the topic document collection is determined. A candidate topic word divergence value for a candidate topic word is determined based on the document collection and the topic document collection. The candidate topic word is determined to be a topic word if the candidate topic word divergence value is greater than the reference topic word divergence value.

摘要翻译： 公开了包括计算机程序产品在包括与主题相关的主题文档的文档集合中的主题词的方法，系统和装置。确定基于文档收集和主题文档收集的参考主题词分歧值。基于文档收集和主题文档收集来确定候选主题词的候选主题词分歧值。如果候选主题词发散值大于参考主题词发散值，则将候选主题词确定为主题词。

2.

发明申请
Word Detection 有权
标题翻译：字检测

公开(公告)号：US20110137642A1

公开(公告)日：2011-06-09

申请号：US13016338

申请日：2011-01-28

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21

CPC分类号： G06F17/2715 , G06F17/2223 , G06F17/2735 , G06F17/2863

摘要： Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.

摘要翻译： 提供了包括计算机程序产品在内的方法，系统和装置，其中将来自web文档的数据分成训练语料库和开发语料库。为训练语料库确定单词的第一个单词概率，并为开发语料库确定单词的第二个单词概率。比较了基于训练语料库和开发语料库的单词概率的不确定性值，并根据比较来确定新词。

3.

发明授权
Word detection 有权
标题翻译：词检测

公开(公告)号：US07917355B2

公开(公告)日：2011-03-29

申请号：US11844153

申请日：2007-08-23

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21 , G06F17/27 , G06F17/20

CPC分类号： G06F17/2715 , G06F17/2223 , G06F17/2735 , G06F17/2863

摘要： Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.

摘要翻译： 提供了包括计算机程序产品在内的方法，系统和装置，其中将来自web文档的数据分成训练语料库和开发语料库。为训练语料库确定单词的第一个单词概率，并为开发语料库确定单词的第二个单词概率。比较了基于训练语料库和开发语料库的单词概率的不确定性值，并根据比较来确定新词。

4.

发明授权
Word detection 有权
标题翻译：词检测

公开(公告)号：US08463598B2

公开(公告)日：2013-06-11

申请号：US13016338

申请日：2011-01-28

申请人： Jun Wu , Tang Xi Liu , Feng Hong , YongGang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , YongGang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/20 , G06F17/21 , G06F17/27

CPC分类号： G06F17/2715 , G06F17/2223 , G06F17/2735 , G06F17/2863

摘要： Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.

摘要翻译： 提供了包括计算机程序产品在内的方法，系统和装置，其中将来自web文档的数据分成训练语料库和开发语料库。为训练语料库确定单词的第一个单词概率，并为开发语料库确定单词的第二个单词概率。比较了基于训练语料库和开发语料库的单词概率的不确定性值，并根据比较来确定新词。

5.

发明申请
DOMAIN DICTIONARY CREATION 有权
标题翻译：域名字典创建

公开(公告)号：US20110238413A1

公开(公告)日：2011-09-29

申请号：US13158125

申请日：2011-06-10

申请人： Jun Wu , Tang Xi Liu , Feng Hong , YongGang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , YongGang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21

CPC分类号： G06F17/2745

摘要： Methods, systems, and apparatus, including computer program products, to identify topic words in a collection of documents that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on a document collection and the topic document collection is determined. A candidate topic word divergence value for a candidate topic word is determined based on the document collection and the topic document collection. The candidate topic word is determined to be a topic word if the candidate topic word divergence value is greater than the reference topic word divergence value.

摘要翻译： 公开了包括计算机程序产品在包括与主题相关的主题文档的文档集合中的主题词的方法，系统和装置。确定基于文档收集和主题文档收集的参考主题词分歧值。基于文档收集和主题文档收集来确定候选主题词的候选主题词分歧值。如果候选主题词发散值大于参考主题词发散值，则将候选主题词确定为主题词。

6.

发明授权
Domain dictionary creation by detection of new topic words using divergence value comparison 有权
标题翻译：通过使用发散值比较检测新主题词来创建域名词典

公开(公告)号：US07983902B2

公开(公告)日：2011-07-19

申请号：US11844067

申请日：2007-08-23

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21 , G06F17/20 , G06F17/27

CPC分类号： G06F17/2745

摘要： Methods, systems, and apparatus, including computer program products, to identify topic words in a document corpus that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on the document corpus and the topic document corpus is determined. A candidate topic word divergence value for a candidate topic word is determined based on the document corpus and the topic document corpus. The candidate topic word is determined to be a topic word if the candidate topic word divergence value is greater than the reference topic word divergence value.

摘要翻译： 公开了包括计算机程序产品在包括与主题相关的主题文档的文档语料库中的主题词的方法，系统和装置。确定基于文档语料库和主题文档语料库的参考主题词分歧值。基于文档语料库和主题文档语料库确定候选主题词的候选主题词分歧值。如果候选主题词发散值大于参考主题词发散值，则将候选主题词确定为主题词。

7.

发明申请
Domain Dictionary Creation 有权
标题翻译：域名词典创作

公开(公告)号：US20090055381A1

公开(公告)日：2009-02-26

申请号：US11844067

申请日：2007-08-23

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/30

CPC分类号： G06F17/2745

摘要： Methods, systems, and apparatus, including computer program products, to identify topic words in a document corpus that includes topic documents related to a topic are disclosed. A reference topic word divergence value based on the document corpus and the topic document corpus is determined. A candidate topic word divergence value for a candidate topic word is determined based on the document corpus and the topic document corpus. The candidate topic word is determined to be a topic word if the candidate topic word divergence value is greater than the reference topic word divergence value.

摘要翻译： 公开了包括计算机程序产品在包括与主题相关的主题文档的文档语料库中的主题词的方法，系统和装置。确定基于文档语料库和主题文档语料库的参考主题词分歧值。基于文档语料库和主题文档语料库确定候选主题词的候选主题词分歧值。如果候选主题词发散值大于参考主题词发散值，则将候选主题词确定为主题词。

8.

发明申请
Word Detection 有权
标题翻译：字检测

公开(公告)号：US20090055168A1

公开(公告)日：2009-02-26

申请号：US11844153

申请日：2007-08-23

申请人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

发明人： Jun Wu , Tang Xi Liu , Feng Hong , Yonggang Wang , Bo Yang , Lei Zhang

IPC分类号： G06F17/21

CPC分类号： G06F17/2715 , G06F17/2223 , G06F17/2735 , G06F17/2863

摘要： Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.

摘要翻译： 提供了包括计算机程序产品在内的方法，系统和装置，其中将来自web文档的数据分成训练语料库和开发语料库。为训练语料库确定单词的第一个单词概率，并为开发语料库确定单词的第二个单词概率。比较了基于训练语料库和开发语料库的单词概率的不确定性值，并根据比较来确定新词。

9.

发明申请
RESOURCE LOCATOR SUGGESTIONS FROM INPUT CHARACTER SEQUENCE 有权
标题翻译：资源定位器从输入字符序列建议

公开(公告)号：US20100005086A1

公开(公告)日：2010-01-07

申请号：US12211712

申请日：2008-09-16

申请人： Yonggang Wang , Feng Hong , Wei Xu , Xiliu Tang , Henry Ou , Bo Yang , Lei Zhang , Runhua Yang , Jun Wu , Baogang Yao

发明人： Yonggang Wang , Feng Hong , Wei Xu , Xiliu Tang , Henry Ou , Bo Yang , Lei Zhang , Runhua Yang , Jun Wu , Baogang Yao

IPC分类号： G06F17/30 , G06F3/048

CPC分类号： G06F3/0484 , G06F3/018 , G06F3/0237 , G06F17/276

摘要： Methods, systems, and apparatus, including computer program products, in which an input method editor receives Roman character inputs, identifies keywords for candidate sets of a non-Roman character, and identifies an associated resource location. Upon identifying an associated resource location, associating the resource location with the candidate set of non-Roman characters.

摘要翻译： 方法，系统和装置，包括计算机程序产品，其中输入法编辑器接收罗马字符输入，识别非罗马字符的候选集合的关键字，并识别相关联的资源位置。在识别相关联的资源位置时，将资源位置与非罗马字符的候选集相关联。

10.

发明授权
Resource locator suggestions from input character sequence 有权
标题翻译：来自输入字符序列的资源定位器建议

公开(公告)号：US08745051B2

公开(公告)日：2014-06-03

申请号：US12211712

申请日：2008-09-16

申请人： Yonggang Wang , Feng Hong , Wei Xu , Xiliu Tang , Henry Ou , Bo Yang , Lei Zhang , Runhua Yang , Jun Wu , Baogang Yao

发明人： Yonggang Wang , Feng Hong , Wei Xu , Xiliu Tang , Henry Ou , Bo Yang , Lei Zhang , Runhua Yang , Jun Wu , Baogang Yao

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F3/0484 , G06F3/018 , G06F3/0237 , G06F17/276

摘要： Methods, systems, and apparatus, including computer program products, in which an input method editor receives Roman character inputs, identifies keywords for candidate sets of a non-Roman character, and identifies an associated resource location. Upon identifying an associated resource location, associating the resource location with the candidate set of non-Roman characters.

摘要翻译： 方法，系统和装置，包括计算机程序产品，其中输入法编辑器接收罗马字符输入，识别非罗马字符的候选集合的关键字，并识别相关联的资源位置。在识别相关联的资源位置时，将资源位置与非罗马字符的候选集相关联。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类