专利检索 ap:("Geoffrey D. Nunberg" OR "Hinrich Schuetze" OR "Jan O. Pedersen" OR "Brett L. Kessler" OR "Gregory Grefenstette") AND inv:"Brett L. Kessler" 第 1 页

1.

发明授权
Article and method of automatically determining text genre using surface features of untagged texts 失效
标题翻译：使用未标记文本的表面特征自动确定文本类型的文章和方法

公开(公告)号：US06973423B1

公开(公告)日：2005-12-06

申请号：US09100189

申请日：1998-06-18

申请人： Geoffrey D. Nunberg , Hinrich Schuetze , Jan O. Pedersen , Brett L. Kessler , Gregory Grefenstette

发明人： Geoffrey D. Nunberg , Hinrich Schuetze , Jan O. Pedersen , Brett L. Kessler , Gregory Grefenstette

IPC分类号： G06F17/20 , G06F17/27 , G06F17/30

CPC分类号： G06F17/2745 , G06F17/274

摘要： A processor implemented method of identifying the text genre of a machine-readable, untagged text. The processor implemented method begins by generating a cue vector from the text, which represents occurrences in the text of a first set of nonstructural, surface cues, which are easily computable. Afterward, the processor determines whether the text is an instance of a first text genre using the cue vector and a weighting vector associated with the first text genre.

摘要翻译： 一种处理器实现的方法，用于识别机器可读，未标记的文本的文本类型。处理器实现的方法开始于从文本生成提示向量，其代表第一组非结构化表面线索的文本中的出现，其易于计算。之后，处理器确定文本是否是使用提示向量的第一文本类型的实例以及与第一文本类型相关联的加权向量。

2.

发明授权
Article and method of automatically filtering information retrieval results using test genre 失效
标题翻译：使用测试类型自动过滤信息检索结果的文章和方法

公开(公告)号：US06505150B2

公开(公告)日：2003-01-07

申请号：US09100201

申请日：1998-06-18

申请人： Geoffrey D. Nunberg , Hinrich Schuetze , Jan O. Pedersen , Brett L. Kessler

发明人： Geoffrey D. Nunberg , Hinrich Schuetze , Jan O. Pedersen , Brett L. Kessler

IPC分类号： G10L1720

CPC分类号： G06F17/277 , G06F17/271 , G06F17/2775 , G06F17/2785 , G06F17/30705 , G06F17/30707

摘要： A method of filtering according to text genre the results of a topic search of a heterogeneous corpus of untagged, machine-readable texts. Because each text of the corpus has a topic and a text genre, the corpus includes multiple text genres and covers multiple topics. According to the method, a processor first searches the corpus for a first multiplicity of texts that have a first topic. Next, the processor identifies a first set of texts of the first multiplicity that are instances of a first text genre and identifies a second set of texts of the first multiplicity that are instances of a second text genre. Finally, the processor identifies to a computer user the first multiplicity of texts in an order based upon the first text genre and second text genre.

摘要翻译： 根据文本进行过滤的方法类型是对未标记的机器可读文本的异构语料库的主题搜索的结果。因为语料库的每个文本都有一个主题和一个文本类型，所以语料库包含多个文本类型并涵盖多个主题。根据该方法，处理器首先在语料库中搜索具有第一主题的第一多个文本。接下来，处理器识别作为第一文本类型的实例的第一多重性的第一组文本，并且识别作为第二文本类型的实例的第一多重性的第二组文本。最后，处理器基于第一文本类型和第二文本类型向计算机用户标识第一多个文本。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类