-
公开(公告)号:US08862459B2
公开(公告)日:2014-10-14
申请号:US13087407
申请日:2011-04-15
申请人: Long Jiang , Ming Zhou , Su Hao
发明人: Long Jiang , Ming Zhou , Su Hao
CPC分类号: G06F17/21 , G06F17/24 , G06F17/2863
摘要: Embodiments are disclosed for automatically generating a banner given a first scroll sentence and a second scroll sentence of a Chinese couplet. The first and/or second scroll sentence can be generated by an automatic computer system or by a human (e.g., manually generated and then provided as input to an automated banner generation system) or obtained from any source (e.g., a book) and provided as input. In one embodiment, an information retrieval process is utilized to identify banner candidates that best match the first and second scroll sentences. In one embodiment, candidate banners are automatically generated. In one embodiment, a ranking model is applied in order to rank banner candidates derived from the banner search and generation processes. One or more banners are then selected from the ranked banner candidates.
摘要翻译: 公开了用于自动生成给定中文对联的第一滚动句和第二滚动句的横幅的实施例。 第一和/或第二滚动句可以由自动计算机系统或人(例如,手动生成然后作为自动横幅生成系统的输入提供)或从任何来源(例如,书)获得并提供 作为输入。 在一个实施例中,使用信息检索处理来识别与第一和第二滚动句子最匹配的横幅候选。 在一个实施例中,自动生成候选横幅。 在一个实施例中,应用排序模型以排序从横幅搜索和生成处理导出的横幅候选。 然后从排名的横幅候选中选择一个或多个横幅。
-
公开(公告)号:US08000955B2
公开(公告)日:2011-08-16
申请号:US11788448
申请日:2007-04-20
申请人: Long Jiang , Ming Zhou , Su Hao
发明人: Long Jiang , Ming Zhou , Su Hao
IPC分类号: G06F17/27
CPC分类号: G06F17/21 , G06F17/24 , G06F17/2863
摘要: Embodiments are disclosed for automatically generating a banner given a first scroll sentence and a second scroll sentence of a Chinese couplet. The first and/or second scroll sentence can be generated by an automatic computer system or by a human (e.g., manually generated and then provided as input to an automated banner generation system) or obtained from any source (e.g., a book) and provided as input. In one embodiment, an information retrieval process is utilized to identify banner candidates that best match the first and second scroll sentences. In one embodiment, candidate banners are automatically generated. In one embodiment, a ranking model is applied in order to rank banner candidates derived from the banner search and generation processes. One or more banners are then selected from the ranked banner candidates.
摘要翻译: 公开了用于自动生成给定中文对联的第一滚动句和第二滚动句的横幅的实施例。 第一和/或第二滚动句可以由自动计算机系统或人(例如,手动生成然后作为自动横幅生成系统的输入提供)或从任何来源(例如,书)获得并提供 作为输入。 在一个实施例中,使用信息检索处理来识别与第一和第二滚动句子最匹配的横幅候选。 在一个实施例中,自动生成候选横幅。 在一个实施例中,应用排序模型以排序从横幅搜索和生成处理导出的横幅候选。 然后从排名的横幅候选中选择一个或多个横幅。
-
公开(公告)号:US20110257959A1
公开(公告)日:2011-10-20
申请号:US13087407
申请日:2011-04-15
申请人: Long Jiang , Ming Zhou , Su Hao
发明人: Long Jiang , Ming Zhou , Su Hao
CPC分类号: G06F17/21 , G06F17/24 , G06F17/2863
摘要: Embodiments are disclosed for automatically generating a banner given a first scroll sentence and a second scroll sentence of a Chinese couplet. The first and/or second scroll sentence can be generated by an automatic computer system or by a human (e.g., manually generated and then provided as input to an automated banner generation system) or obtained from any source (e.g., a book) and provided as input. In one embodiment, an information retrieval process is utilized to identify banner candidates that best match the first and second scroll sentences. In one embodiment, candidate banners are automatically generated. In one embodiment, a ranking model is applied in order to rank banner candidates derived from the banner search and generation processes. One or more banners are then selected from the ranked banner candidates.
摘要翻译: 公开了用于自动生成给定中文对联的第一滚动句和第二滚动句的横幅的实施例。 第一和/或第二滚动句可以由自动计算机系统或人(例如,手动生成然后作为自动横幅生成系统的输入提供)或从任何来源(例如,书)获得并提供 作为输入。 在一个实施例中,利用信息检索处理来识别与第一和第二滚动句子最匹配的横幅候选。 在一个实施例中,自动生成候选横幅。 在一个实施例中,应用排序模型以排序从横幅搜索和生成处理导出的横幅候选。 然后从排名的横幅候选中选择一个或多个横幅。
-
公开(公告)号:US20080154580A1
公开(公告)日:2008-06-26
申请号:US11788448
申请日:2007-04-20
申请人: Long Jiang , Ming Zhou , Su Hao
发明人: Long Jiang , Ming Zhou , Su Hao
CPC分类号: G06F17/21 , G06F17/24 , G06F17/2863
摘要: Embodiments are disclosed for automatically generating a banner given a first scroll sentence and a second scroll sentence of a Chinese couplet. The first and/or second scroll sentence can be generated by an automatic computer system or by a human (e.g., manually generated and then provided as input to an automated banner generation system) or obtained from any source (e.g., a book) and provided as input. In one embodiment, an information retrieval process is utilized to identify banner candidates that best match the first and second scroll sentences. In one embodiment, candidate banners are automatically generated. In one embodiment, a ranking model is applied in order to rank banner candidates derived from the banner search and generation processes. One or more banners are then selected from the ranked banner candidates.
摘要翻译: 公开了用于自动生成给定中文对联的第一滚动句和第二滚动句的横幅的实施例。 第一和/或第二滚动句可以由自动计算机系统或人(例如,手动生成然后作为自动横幅生成系统的输入提供)或从任何来源(例如,书)获得并提供 作为输入。 在一个实施例中,使用信息检索处理来识别与第一和第二滚动句子最匹配的横幅候选。 在一个实施例中,自动生成候选横幅。 在一个实施例中,应用排序模型以排序从横幅搜索和生成处理导出的横幅候选。 然后从排名的横幅候选中选择一个或多个横幅。
-
公开(公告)号:US20070011132A1
公开(公告)日:2007-01-11
申请号:US11155829
申请日:2005-06-17
申请人: Ming Zhou , Long Jiang
发明人: Ming Zhou , Long Jiang
IPC分类号: G06F17/30
CPC分类号: G06F16/951
摘要: Named entity translation of a named entity in a source language is translated to a target language by combining a transliteration of the named entity with data mining in the target language.
摘要翻译: 通过将命名实体的音译与目标语言中的数据挖掘相结合,将源语言中的命名实体命名实体翻译转换为目标语言。
-
公开(公告)号:US08484014B2
公开(公告)日:2013-07-09
申请号:US12362428
申请日:2009-01-29
申请人: Xiaohua Liu , Ming Zhou , Hao Wei , Jing Zhao , Matthew R. Scott , Long Jiang , Gang Chen
发明人: Xiaohua Liu , Ming Zhou , Hao Wei , Jing Zhao , Matthew R. Scott , Long Jiang , Gang Chen
CPC分类号: G06F17/30684 , G10L15/19
摘要: A method and system for identifying documents relevant to a query that specifies a part of speech is provided. A retrieval system receives from a user an input query that includes a word and a part of speech. Upon receiving an input query that includes a word and a part of speech, the retrieval system identifies documents with a sentence that includes that word collocated with a word that is used as that part of speech. The retrieval system displays to the user an indication of the identified documents.
摘要翻译: 提供了一种用于识别与指定一部分语音的查询相关的文档的方法和系统。 检索系统从用户接收包括单词和一部分语音的输入查询。 在接收到包括单词和一部分语音的输入查询时,检索系统用包含该单词的句子识别文档,该单词与用作该部分语音的单词并置。 检索系统向用户显示所识别的文档的指示。
-
公开(公告)号:US08275604B2
公开(公告)日:2012-09-25
申请号:US12406722
申请日:2009-03-18
申请人: Long Jiang , Shiquan Yang , Ming Zhou , Xiaohua Liu
发明人: Long Jiang , Shiquan Yang , Ming Zhou , Xiaohua Liu
CPC分类号: G06F17/28 , G06F17/2827
摘要: Embodiments for the adaptive learning of translation layout patterns to mine bilingual data are disclosed. In accordance with at least one embodiment, the adaptive learning of patterns to mine bilingual data includes processing a bilingual web page into a Document Object Model (DOM) tree. The embodiment further includes linking the bilingual snippet pairs of each node into a plurality bilingual snippet pairs. The embodiment also includes determining one or more best fit candidate patterns based on the plurality of translation snippets via a Support Vector Machine classifier. The embodiment additionally includes mining one or more translation pairs from the bilingual web page using the one or more best fit candidate patterns. The translation pairs are further stored in a data storage. The one or more translation pairs including at least one of a term pair, a phrase pair, or a sentence pair.
摘要翻译: 披露了双语数据翻译布局模式自适应学习的实施例。 根据至少一个实施例,对双语数据挖掘的模式的自适应学习包括将双语网页处理成文档对象模型(DOM)树。 该实施例还包括将每个节点的双语片段对链接成多个双语片段对。 该实施例还包括经由支持向量机分类器基于多个翻译片段来确定一个或多个最佳拟合候选模式。 该实施例另外包括使用一个或多个最佳拟合候选模式从双语网页挖掘一个或多个翻译对。 翻译对进一步存储在数据存储器中。 所述一个或多个翻译对包括术语对,短语对或句子对中的至少一个。
-
公开(公告)号:US20110213763A1
公开(公告)日:2011-09-01
申请号:US13101061
申请日:2011-05-04
申请人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
IPC分类号: G06F17/30
CPC分类号: G06F16/951
摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.
摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。
-
公开(公告)号:US07962507B2
公开(公告)日:2011-06-14
申请号:US11941968
申请日:2007-11-19
申请人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
发明人: Weizhu Chen , Long Jiang , Ming Zhou , Benyu Zhang , Zheng Chen , Jian Wang
CPC分类号: G06F17/30864
摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.
摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。
-
公开(公告)号:US20100241416A1
公开(公告)日:2010-09-23
申请号:US12406722
申请日:2009-03-18
申请人: Long Jiang , Shiquan Yang , Ming Zhou , Xiaohua Liu
发明人: Long Jiang , Shiquan Yang , Ming Zhou , Xiaohua Liu
IPC分类号: G06F17/28
CPC分类号: G06F17/28 , G06F17/2827
摘要: Embodiments for the adaptive learning of translation layout patterns to mine bilingual data are disclosed. In accordance with at least one embodiment, the adaptive learning of patterns to mine bilingual data includes processing a bilingual web page into a Document Object Model (DOM) tree. The embodiment further includes linking the bilingual snippet pairs of each node into a plurality bilingual snippet pairs. The embodiment also includes determining one or more best fit candidate patterns based on the plurality of translation snippets via a Support Vector Machine classifier. The embodiment additionally includes mining one or more translation pairs from the bilingual web page using the one or more best fit candidate patterns. The translation pairs are further stored in a data storage. The one or more translation pairs including at least one of a term pair, a phrase pair, or a sentence pair.
摘要翻译: 披露了双语数据翻译布局模式自适应学习的实施例。 根据至少一个实施例,对双语数据挖掘的模式的自适应学习包括将双语网页处理成文档对象模型(DOM)树。 该实施例还包括将每个节点的双语片段对链接成多个双语片段对。 该实施例还包括经由支持向量机分类器基于多个翻译片段来确定一个或多个最佳拟合候选模式。 该实施例另外包括使用一个或多个最佳拟合候选模式从双语网页挖掘一个或多个翻译对。 翻译对进一步存储在数据存储器中。 所述一个或多个翻译对包括术语对,短语对或句子对中的至少一个。
-
-
-
-
-
-
-
-
-