Methods and apparatus for communicating information in a supervised learning system
    1.
    发明授权
    Methods and apparatus for communicating information in a supervised learning system 失效
    在监督学习系统中传达信息的方法和装置

    公开(公告)号:US06668248B1

    公开(公告)日:2003-12-23

    申请号:US10325911

    申请日:2002-12-23

    IPC分类号: G06F1518

    CPC分类号: G06N99/005

    摘要: A method and apparatus for adding new learning tasks to an incremental supervised learner provide a flexible incremental representation of all training examples encountered, thereby permitting state representations for new learning tasks to take advantage of incremental training already completed by encoding all past training examples as negative examples for a hypothetical learning task. The state representation of the hypothetical learning task is copied as the initial state representation for a new learning task to be initiated, is initialized with negative training examples of all previously presented training examples, thereby permitting the learning task to incorporate the previous examples efficiently.

    摘要翻译: 用于将新的学习任务添加到增量监督学习者的方法和装置提供了所遇到的所有训练示例的灵活的增量表示,从而允许新学习任务的状态表示利用已经通过将所有过去的训练示例编码为负的示例已经完成的增量训练 一个假设的学习任务。 假设学习任务的状态表示被复制为要启动的新学习任务的初始状态表示,用所有先前提出的训练示例的负训练示例初始化,从而允许学习任务有效地并入前述示例。

    Method and apparatus for searching distributed networks using a plurality of search devices
    2.
    发明授权
    Method and apparatus for searching distributed networks using a plurality of search devices 有权
    使用多个搜索装置搜索分布式网络的方法和装置

    公开(公告)号:US06370527B1

    公开(公告)日:2002-04-09

    申请号:US09222129

    申请日:1998-12-29

    IPC分类号: G06F1730

    摘要: A meta-search engine apparatus and method for searching distributed networks using a plurality of search devices. The meta-search engine apparatus sends search queries to a plurality of search engines and compiles the results obtained from each of these search engines into a single ranked list. The results obtained from each of the search engines includes a listing of the titles of found sources of the search terms, or related search terms, and a summary of the source. The compilation and ranking is based primarily on the occurrence of search terms, or related search terms, in the titles and summaries but may also be based on, for example, relative weights given to each search engine, the number of search engines returning the same source as a result of a search, weighting of sections of the results obtained from the search engines, and the like.

    摘要翻译: 一种用于使用多个搜索装置搜索分布式网络的元搜索引擎装置和方法。 元搜索引擎装置向多个搜索引擎发送搜索查询,并将从每个这些搜索引擎获得的结果编译成单个排名列表。 从每个搜索引擎获得的结果包括搜索词的找到源的标题或相关搜索词的列表以及源的摘要。 编辑和排序主要基于标题和摘要中搜索词或相关搜索词的发生,但也可以基于例如给予每个搜索引擎的相对权重,返回相同的搜索引擎的数量 作为搜索结果的来源,从搜索引擎获得的结果的部分的加权等。

    Document expansion in speech retrieval
    3.
    发明授权
    Document expansion in speech retrieval 有权
    语音检索中的文档扩展

    公开(公告)号:US07761298B1

    公开(公告)日:2010-07-20

    申请号:US12013692

    申请日:2008-01-14

    IPC分类号: G10L15/00

    CPC分类号: G06F17/30746 G06F17/3074

    摘要: Methods of document expansion for a speech retrieval document by a recognizer. A database of vectors of automatic transcriptions of documents is accessed and the vectors are truncated by removing all terms that are not recognizable by the recognizer to create truncated vectors. Terms in the vectors are then weighted to associate the truncated vectors with the untruncated vectors. Terms not recognized by the recognizer are then added back to the weighted, truncated vectors. The retrieval effectiveness may then be measured.

    摘要翻译: 用于识别器的语音检索文档的文档扩展方法。 访问文档的自动转录的向量的数据库,并通过去除识别器不能识别的所有术语来创建截断的向量来截断向量。 然后对向量中的术语进行加权,将截断的载体与未经截断的载体相关联。 识别器未识别的术语然后被加回加权的截断向量。 然后可以测量检索效果。

    Document expansion in speech retrieval
    4.
    发明授权
    Document expansion in speech retrieval 有权
    语音检索中的文档扩展

    公开(公告)号:US07113910B1

    公开(公告)日:2006-09-26

    申请号:US09740284

    申请日:2000-12-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30746 G06F17/3074

    摘要: Methods of document expansion for a speech retrieval document by a recognizer. A database of vectors of automatic transcriptions of documents is accessed and the vectors are truncated by removing all terms that are not recognizable by the recognizer to create truncated vectors. Terms in the vectors are then weighted to associate the truncated vectors with the untruncated vectors. Terms not recognized by the recognizer are then added back to the weighted, truncated vectors. The retrieval effectiveness may then be measured.

    摘要翻译: 用于识别器的语音检索文档的文档扩展方法。 访问文档的自动转录的向量的数据库,并通过去除识别器不能识别的所有术语来创建截断的向量来截断向量。 然后对向量中的术语进行加权,将截断的载体与未经截断的载体相关联。 识别器未识别的术语然后被加回加权的截断向量。 然后可以测量检索效果。

    Efficient and effective distributed information management
    5.
    发明授权
    Efficient and effective distributed information management 有权
    高效有效的分布式信息管理

    公开(公告)号:US06347317B1

    公开(公告)日:2002-02-12

    申请号:US09678371

    申请日:2000-10-02

    IPC分类号: G06F1730

    摘要: A method stores, indexes, searches and retrieves data information in a large data storage and retrieval system. Large amounts of data information, subject to searching and retrieval, are broken down and stored in sub-collections. Each sub-collection separately performs indexing of only the data information contained within that sub-collection and forms an inverted index. Statistical information derived from the inverted index of each sub-collection is collected by a global collection custodian and compiled into a global index. The global index is then passed to each sub-collection and is used by each during searching and retrieving of data information. Search results from each sub-collection are passed to the global collection custodian and organized there before being passed to a system user.

    摘要翻译: 一种方法在大型数据存储和检索系统中存储,索引,搜索和检索数据信息。 检索和检索的大量数据信息被分解并存储在子集合中。 每个子集合分别执行仅包含在该子集合内的数据信息的索引并且形成反向索引。 从每个子集合的反向索引导出的统计信息由全局收集保管人收集并编译成全局索引。 然后将全局索引传递给每个子集合,并在数据信息的搜索和检索期间由每个子集合使用。 来自每个子集合的搜索结果将传递给全局收集保管人,并在传送给系统用户之前将其组织在一起。

    Methods and apparatus for communicating information in a supervised learning system
    6.
    发明授权
    Methods and apparatus for communicating information in a supervised learning system 有权
    在监督学习系统中传达信息的方法和装置

    公开(公告)号:US06931383B2

    公开(公告)日:2005-08-16

    申请号:US10689888

    申请日:2003-10-21

    IPC分类号: G06F15/18 G06N5/00

    CPC分类号: G06N99/005

    摘要: Apparatus for adding new learning tasks to an incremental supervised learner provides a flexible incremental representation of all encountered training examples, thereby permitting state representations for new learning tasks to take advantage of incremental training already completed by encoding all past training examples as negative examples for a hypothetical learning task. The state representation of the hypothetical learning task is copied as the initial state representation for a new learning task to be initiated, and is initialized with negative training examples of all previously presented training examples, thereby permitting the learning task to efficiently incorporate the previous examples.

    摘要翻译: 将新的学习任务添加到增量监督学习者的设备提供了所有遇到的训练示例的灵活的增量表示,从而允许新学习任务的状态表示利用已经通过编码所有过去的训练示例已经完成的增量训练作为假设的 学习任务 假设学习任务的状态表示被复制为要启动的新学习任务的初始状态表示,并且被初始化为所有先前呈现的训练示例的负训练示例,从而允许学习任务有效地并入先前的示例。

    Efficient and effective distributed information management

    公开(公告)号:US06567810B1

    公开(公告)日:2003-05-20

    申请号:US10023833

    申请日:2001-12-21

    IPC分类号: G06F1730

    摘要: A method stores, indexes, searches and retrieves data information in a large data storage and retrieval system. Large amounts of data information, subject to searching and retrieval, are broken down and stored in sub-collections. Each sub-collection separately performs indexing of only the data information contained within that sub-collection and forms an inverted index. Statistical information derived from the inverted index of each sub-collection is collected by a global collection custodian and compiled into a global index. The global index is then passed to each sub-collection and is used by each during searching and retrieving of data information. Search results from each sub-collection are passed to the global collection custodian and organized there before being passed to a system user.

    Efficient and effective distributed information management

    公开(公告)号:US6163782A

    公开(公告)日:2000-12-19

    申请号:US79073

    申请日:1998-05-14

    IPC分类号: G06F17/30

    摘要: A method stores, indexes, searches and retrieves data information in a large data storage and retrieval system. Large amounts of data information, subject to searching and retrieval, are broken down and stored in sub-collections. Each sub-collection separately performs indexing of only the data information contained within that sub-collection and forms an inverted index. Statistical information derived from the inverted index of each sub-collection is collected by a global collection custodian and compiled into a global index. The global index is then passed to each sub-collection and is used by each during searching and retrieving of data information. Search results from each sub-collection are passed to the global collection custodian and organized there before being passed to a system user.

    Document expansion in speech retrieval
    9.
    发明授权
    Document expansion in speech retrieval 有权
    语音检索中的文档扩展

    公开(公告)号:US07395207B1

    公开(公告)日:2008-07-01

    申请号:US11466815

    申请日:2006-08-24

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30746 G06F17/3074

    摘要: Methods of document expansion for a speech retrieval document by a recognizer. A database of vectors of automatic transcriptions of documents is accessed and the vectors are truncated by removing all terms that are not recognizable by the recognizer to create truncated vectors. Terms in the vectors are then weighted to associate the truncated vectors with the untruncated vectors. Terms not recognized by the recognizer are then added back to the weighted, truncated vectors. The retrieval effectiveness may then be measured.

    摘要翻译: 用于识别器的语音检索文档的文档扩展方法。 访问文档的自动转录的向量的数据库,并通过去除识别器不能识别的所有术语来创建截断的向量来截断向量。 然后对向量中的术语进行加权,将截断的载体与未经截断的载体相关联。 识别器未识别的术语然后被加回加权的截断向量。 然后可以测量检索效果。

    Methods and apparatus for communicating information in a supervised learning system
    10.
    发明授权
    Methods and apparatus for communicating information in a supervised learning system 有权
    在监督学习系统中传达信息的方法和装置

    公开(公告)号:US06523017B1

    公开(公告)日:2003-02-18

    申请号:US09563506

    申请日:2000-05-03

    IPC分类号: G06F1518

    CPC分类号: G06N99/005

    摘要: A method and apparatus for communicating accumulated state information between internal and external tasks in a supervised learning system. A supervised learning system encodes state information for a hypothetical learning task on initialization. This hypothetical learning task state information indicates that no training instances have been received. During the supervised learning, training instances are presented to the supervised learner. The training instances are encoded with feature vector and target value information. For each task name paired with a non-default target value, the learner initializes a new learning task by copying the hypothetical learning task state representation for use as the state representation for the new learning task. Predictors are then produced for all learning tasks, except the hypothetical learning task. The new training instance is used to update all learning tasks as specified in the target vector. The new training instance is then used.to update the hypothetical learning task state representation as a negative example. Further training instances are handled similarly, new learning tasks are started based on the examination of the sparse target vector for task name, target value pairs which match received training instance target values and for which tasks have not yet been started.

    摘要翻译: 一种用于在监督学习系统中在内部和外部任务之间传送累积状态信息的方法和装置。 监督学习系统在初始化时编码假设学习任务的状态信息。 这种假设的学习任务状态信息表示没有接收到训练实例。 在受监督的学习过程中,培训实例被提供给受监督的学习者。 训练实例使用特征向量和目标值信息进行编码。 对于与非默认目标值配对的每个任务名称,学习者通过复制假设的学习任务状态表达来初始化新的学习任务,以用作新学习任务的状态表示。 然后为所有学习任务生成预测因子,除了假设的学习任务。 新的训练实例用于更新目标向量中指定的所有学习任务。 然后使用新的训练实例来更新假设的学习任务状态表示作为一个负面例子。 进一步训练实例的处理方式类似,基于检查任务名称的稀疏目标向量,匹配接收到的训练实例目标值的目标值对以及哪些任务尚未开始,开始新的学习任务。