NAMED ENTITY EXTRACTING APPARATUS, METHOD, AND PROGRAM
    1.
    发明申请
    NAMED ENTITY EXTRACTING APPARATUS, METHOD, AND PROGRAM 有权
    NAMED ENTITY提取设备,方法和程序

    公开(公告)号:US20090119274A1

    公开(公告)日:2009-05-07

    申请号:US11916222

    申请日:2005-12-26

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/278

    摘要: A named entity extracting apparatus that extracts a named entity suitable for a user by enabling an order to be set in which the named entity is extracted from texts includes: an extraction order reading unit 103 that acquires a named entity pattern name stored in association with an extraction order in an extraction order storage unit 102; a named entity extracting unit 105 that extracts the named entity from input texts using a named entity pattern corresponding to the named entity pattern name acquired by the extraction order reading unit 103; and an extraction end judging unit 106 which outputs, in the case where extraction has not ended, a text on which the extraction is in progress to the extraction order reading unit 103, and continues the named entity extraction processing.

    摘要翻译: 一种命名实体提取装置,其通过启用从文本中提取命名实体的命令来提取适合于用户的命名实体,包括:提取顺序读取单元103,其获取与 提取顺序存储单元102中的提取顺序; 命名实体提取单元105,其使用与由提取顺序读取单元103获取的命名实体模式名称对应的命名实体模式从输入文本中提取命名实体; 以及提取结束判断单元106,其在提取未结束的情况下,将提取的文本输出到提取顺序读取单元103,并继续进行命名实体提取处理。

    CONTENT SEARCHING DEVICE AND CONTENT SEARCHING METHOD
    2.
    发明申请
    CONTENT SEARCHING DEVICE AND CONTENT SEARCHING METHOD 有权
    内容搜索设备和内容搜索方法

    公开(公告)号:US20100293169A1

    公开(公告)日:2010-11-18

    申请号:US12672085

    申请日:2009-03-02

    IPC分类号: G06F17/30

    摘要: To provide a content searching device which can efficiently present to the user a topical related keyword.A content searching device (100), which searches content from a content database with a use of a related keyword, includes: a related segment calculating unit (106) which calculates, for each content attribute, a related segment which is defined in order for first content and second content to be included in a same time segment, the related segment being calculated based on whether or not a degree of difference, for each content attribute, calculated out of a plurality of first keywords and a plurality of second keywords meets a predetermined reference value, the plurality of the first keywords each describing the first content to be stored in the content database (101), and the plurality of the second keywords each describing the second content having been stored in the content database (101); and a dictionary updating unit (107) which updates a degree of relevance stored in a dictionary database (102), the degree of relevance being updated with a use of the related segment, and the degree of relevance, among the plurality of keywords, being calculated for each content attribute.

    摘要翻译: 提供一种内容搜索装置,其能够向用户有效地呈现主题相关的关键字。 一种内容搜索装置(100),其使用相关的关键字搜索来自内容数据库的内容,包括:相关段计算单元(106),其针对每个内容属性计算相关段,所述相关段按照 第一内容和第二内容将被包括在相同的时间段中,相关段是基于由多个第一关键字和多个第二关键字计算出的每个内容属性的差异程度是否满足 预定参考值,每个描述要存储在内容数据库(101)中的第一内容的多个第一关键字以及每个描述已经存储在内容数据库中的第二内容的多个第二关键字; 以及字典更新单元(107),其更新存储在字典数据库(102)中的相关程度,使用相关段更新相关程度,以及所述多个关键字之间的相关程度是 为每个内容属性计算。

    Named entity extracting apparatus, method, and program
    3.
    发明授权
    Named entity extracting apparatus, method, and program 有权
    命名实体提取装置,方法和程序

    公开(公告)号:US07761437B2

    公开(公告)日:2010-07-20

    申请号:US11916222

    申请日:2005-12-26

    IPC分类号: G06F7/00

    CPC分类号: G06F17/278

    摘要: A named entity extracting apparatus that extracts a named entity suitable for a user by enabling an order to be set in which the named entity is extracted from texts includes: an extraction order reading unit 103 that acquires a named entity pattern name stored in association with an extraction order in an extraction order storage unit 102; a named entity extracting unit 105 that extracts the named entity from input texts using a named entity pattern corresponding to the named entity pattern name acquired by the extraction order reading unit 103; and an extraction end judging unit 106 which outputs, in the case where extraction has not ended, a text on which the extraction is in progress to the extraction order reading unit 103, and continues the named entity extraction processing.

    摘要翻译: 一种命名实体提取装置,其通过启用从文本中提取命名实体的命令来提取适合于用户的命名实体,包括:提取顺序读取单元103,其获取与 提取顺序存储单元102中的提取顺序; 命名实体提取单元105,其使用与由提取顺序读取单元103获取的命名实体模式名称对应的命名实体模式从输入文本中提取命名实体; 以及提取结束判断单元106,其在提取未结束的情况下,将提取的文本输出到提取顺序读取单元103,并继续进行命名实体提取处理。

    Related word presentation device
    4.
    发明授权
    Related word presentation device 有权
    相关词汇表示设备

    公开(公告)号:US08504357B2

    公开(公告)日:2013-08-06

    申请号:US12521927

    申请日:2008-07-30

    IPC分类号: G06F17/21

    CPC分类号: G06F17/3064

    摘要: A related word presentation device includes a program information storage unit that stores program information of each program; and an information dividing unit that generates, for each of the attributes of the words included in the program information, at least one group which includes a reference word belonging to the attribute and a set of words which co-occur with the reference word in a program. A degree-of-relevance calculating unit stores attribute-based association dictionaries each of which indicates, for the corresponding attribute of words, (i) the words and (ii) the degrees of relevance between the words calculated based on the frequency of co-occurrence in each of groups. A search condition obtaining unit obtains the search word and the attribute; a substitute word obtaining unit selects substitute words from the attribute-based association dictionary for the obtained attribute; and an output unit presents the selected substitute word.

    摘要翻译: 相关词表示装置包括存储每个节目的节目信息的节目信息存储单元; 以及信息划分单元,对于包括在节目信息中的单词的每个属性,生成至少一个组,其包括属于该属性的参考单词和与该参考单词共同出现的一组单词 程序。 相关度计算单元存储基于属性的关联字典,每个字典对于相应的单词属性指示(i)单词和(ii)基于共同的频率计算的单词之间的相关程度, 发生在每组。 搜索条件获取单元获取搜索词和属性; 替代词获取单元从所获得的属性的基于属性的关联词典中选择替代词; 并且输出单元呈现所选择的替代词。

    Informationn retrieval apparatus
    5.
    发明授权
    Informationn retrieval apparatus 有权
    信息检索装置

    公开(公告)号:US08108407B2

    公开(公告)日:2012-01-31

    申请号:US12447333

    申请日:2007-11-06

    IPC分类号: G06F7/00

    CPC分类号: G06F17/3064

    摘要: An information retrieval apparatus, which can present to a user only a related word matching a user search intent, includes: an associative dictionary storage unit for storing words included in plural pieces of text to be searched and relevance degrees between the words; an appearance frequency storage unit for storing an appearance frequency that is the number of pieces of text in which the words stored in the associative dictionary storage unit appear, among the plural pieces of text to be searched; and a related word obtaining unit that obtains a related word to be presented to the user, from the relevance degree between the search word entered by the user and another word among the words, the appearance frequency, and the user search intent.

    摘要翻译: 一种信息检索装置,其仅向用户呈现与用户搜索意图相匹配的相关字,包括:关联词典存储单元,用于存储多个被搜索文本中包含的单词和单词之间的相关度; 出现频率存储单元,用于存储作为要搜索的多条文本中出现的存储在关联词典存储单元中的单词的文本数目的出现频率; 以及相关词获取单元,从用户输入的搜索词与单词中的另一个单词,出现频率和用户搜索意图之间的相关程度,获得要呈现给用户的相关单词。

    Contents retrieval device for retrieving contents that user wishes to view from among a plurality of contents
    6.
    发明授权
    Contents retrieval device for retrieving contents that user wishes to view from among a plurality of contents 有权
    用于从多个内容中检索用户希望观看的内容的内容检索装置

    公开(公告)号:US07831610B2

    公开(公告)日:2010-11-09

    申请号:US12521926

    申请日:2008-08-05

    IPC分类号: G06F7/00

    CPC分类号: G06F17/3064

    摘要: A contents retrieval device (100) presenting an appropriate related keyword to a user even when an object user wishes to retrieve dynamically changes. The contents retrieval device (100) includes a contents estimation unit (107) retrieving contents according to a search keyword, a document space database (103) storing document spaces according to an occurrence frequency of the keyword, a document space selection unit (104) selecting a the narrowing-down document space and an expansion document space from the document space database (103) according to the search keyword and the occurrence frequency of the document space indicating a degree of relevance with the contents according to the search keyword, a related keyword estimation unit (108) selecting keywords corresponding to the narrowing-down document space and the expansion document space as a narrowing-down keyword and an expansion keyword, respectively, and an output unit displaying the selected narrowing-down and expansion keywords.

    摘要翻译: 一种内容检索装置(100),即使当对象用户想要动态地检索时,向用户呈现适当的相关关键字。 内容检索装置(100)包括根据搜索关键字检索内容的内容估计单元(107),根据关键字的发生频率存储文档空间的文档空间数据库(103),文档空间选择单元(104) 根据搜索关键词和文档空间的出现频率,根据搜索关键词,从文档空间数据库(103)中选择缩小文档空间和扩展文档空间,指示与内容相关程度的文档空间的出现频度,相关的 关键词估计单元(108)分别选择与缩小文档空间相对应的关键字和扩展文档空间作为缩小关键字和展开关键字,以及输出单元,显示所选择的缩小和展开关键字。

    INFORMATION RETRIEVAL APPARATUS
    7.
    发明申请
    INFORMATION RETRIEVAL APPARATUS 有权
    信息检索设备

    公开(公告)号:US20100100541A1

    公开(公告)日:2010-04-22

    申请号:US12447333

    申请日:2007-11-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3064

    摘要: An information retrieval apparatus, which can present to a user only a related word matching a user search intent, includes: an associative dictionary storage unit (102A) for storing words included in plural pieces of text to be searched and relevance degrees between the words; an appearance frequency storage unit (102B) for storing an appearance frequency that is the number of pieces of text in which the words stored in the associative dictionary storage unit (102A) appear, among the plural pieces of text to be searched; and a related word obtaining unit (104) that obtains a related word to be presented to the user, from the relevance degree between the search word entered by the user and another word among the words, the appearance frequency, and the user search intent.

    摘要翻译: 一种信息检索装置,其仅向用户呈现与用户搜索意图相匹配的相关字,包括:用于存储多个待搜索的文本中包含的单词和词之间的相关度的关联字典存储单元(102A) 出现频率存储单元,用于存储作为要搜索的多条文本中出现在存储在关联字典存储单元(102A)中的字的文本数目的出现频率; 以及从用户输入的搜索词与单词中的另一个单词,出现频率和用户搜索意图之间的相关程度,获得要呈现给用户的相关单词的相关单词获取单元(104)。

    INTERACTIVE PROGRAM SEARCH APPARATUS
    8.
    发明申请
    INTERACTIVE PROGRAM SEARCH APPARATUS 有权
    互动计划搜索设备

    公开(公告)号:US20100114884A1

    公开(公告)日:2010-05-06

    申请号:US12594287

    申请日:2009-02-18

    IPC分类号: G06F17/30

    摘要: In an interactive program search apparatus (100) which presents search condition candidates for expanding or narrowing down search results, reason words indicating the reason why the search condition candidates are presented are adaptively determined based on user's preference, search actions, and watching actions. An association-source word extracting unit (109) extracts an association-source word from the program search results, and an associated word extracting unit (110) extracts associated words associated with the association-source word, from an association dictionary storage unit (103). A reason word extracting unit (111) extracts reason words illustrating the relationships between the association-source word and the associated words, using the association-source word, the associated words, and the obtainment history information composed of words included in the program information of the programs selected by the user in the past and selected words among the words.

    摘要翻译: 在提供用于扩展或缩小搜索结果的搜索条件候选的交互式节目搜索装置(100)中,基于用户的偏好,搜索动作和观看动作来自适应地确定指示搜索条件候选的原因的原因词。 关联源词提取单元(109)从程序搜索结果中提取关联源字,并且相关联的字提取单元(110)从关联词典存储单元(103)中提取与关联源字相关联的关联词 )。 原因词提取单元(111)使用关联源字,关联词和由包含在程序信息中包含的单词组成的获取历史信息来提取示出关联源词和关联词之间的关系的原因词 过去用户选择的程序以及单词中选定的单词。

    RELATED WORD PRESENTATION DEVICE
    9.
    发明申请
    RELATED WORD PRESENTATION DEVICE 有权
    相关信息显示装置

    公开(公告)号:US20100042405A1

    公开(公告)日:2010-02-18

    申请号:US12521927

    申请日:2008-07-30

    IPC分类号: G06F17/21 G06N5/02 G06F17/30

    CPC分类号: G06F17/3064

    摘要: A related word presentation device (100) for appropriately performing omission prevention search includes: a program information storage unit (101) which stores program information (101a) of each program; an information dividing unit (103a) which generates, for each of the attributes of the words included in the program information (101a), at least one group which includes, as a unit, a reference word which is a word belonging to the attribute and a set of words which co-occur with the reference word in a program; a degree-of-relevance calculating unit (103b) which stores, in an association dictionary storage unit (102), attribute-based association dictionaries (102a, 102b, 102c) each of which indicates, for the corresponding attribute of words, (i) the words and (ii) the degrees of relevance between the words calculated based on the frequency of co-occurrence in each of groups; a search condition obtaining unit (104) which obtains the search word and the attribute; a substitute word obtaining unit (105) which selects substitute words from the attribute-based association dictionary for the obtained attribute; and an output unit (106) which presents the selected substitute word.

    摘要翻译: 用于适当地执行省略预防搜索的相关词表示装置(100)包括:存储每个节目的节目信息(101a)的节目信息存储单元(101) 信息分割单元,对于包括在节目信息(101a)中的单词的每个属性,生成包括作为属于该属性的单词的参考单元的至少一个组,以及 与程序中的参考单词共同出现的一组单词; 相关度计算单元(103b),其在关联字典存储单元(102)中存储基于属性的关联字典(102a,102b,102c),每个字典表示对应的单词属性(i )和(ii)基于每组中共同发生频率计算的词之间的相关程度; 获取搜索词和属性的搜索条件获取单元(104); 替代词获取单元,从所述基于属性的关联词典中为所获得的属性选择替代词; 以及呈现所选择的替代词的输出单元(106)。

    Interactive program search apparatus
    10.
    发明授权
    Interactive program search apparatus 有权
    互动节目搜索装置

    公开(公告)号:US08161043B2

    公开(公告)日:2012-04-17

    申请号:US12594287

    申请日:2009-02-18

    IPC分类号: G06F7/00 G06F17/30

    摘要: In an interactive program search apparatus (100) which presents search condition candidates for expanding or narrowing down search results, reason words indicating the reason why the search condition candidates are presented are adaptively determined based on user's preference, search actions, and watching actions. An association-source word extracting unit (109) extracts an association-source word from the program search results, and an associated word extracting unit (110) extracts associated words associated with the association-source word, from an association dictionary storage unit (103). A reason word extracting unit (111) extracts reason words illustrating the relationships between the association-source word and the associated words, using the association-source word, the associated words, and the obtainment history information composed of words included in the program information of the programs selected by the user in the past and selected words among the words.

    摘要翻译: 在提供用于扩展或缩小搜索结果的搜索条件候选的交互式节目搜索装置(100)中,基于用户的偏好,搜索动作和观看动作来自适应地确定指示搜索条件候选的原因的原因词。 关联源词提取单元(109)从程序搜索结果中提取关联源字,并且相关联的字提取单元(110)从关联词典存储单元(103)中提取与关联源字相关联的关联词 )。 原因词提取单元(111)使用关联源字,关联词和由包含在程序信息中包含的单词组成的获取历史信息来提取示出关联源词和关联词之间的关系的原因词 过去用户选择的程序以及单词中选定的单词。