METHODS AND APPARATUS FOR GENERATING A DATA DICTIONARY
    1.
    发明申请
    METHODS AND APPARATUS FOR GENERATING A DATA DICTIONARY 有权
    用于生成数据字典的方法和装置

    公开(公告)号:US20120191717A1

    公开(公告)日:2012-07-26

    申请号:US13428544

    申请日:2012-03-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30598 G06F17/30731

    摘要: There is provided an ecommerce method and system to generate a data dictionary for searching data items stored in a database. In one embodiment, the system comprises a candidate list generator module to generate a list of keywords from search query information and generate a set of token pairs including a keyword from the list of keywords and a token, the token being a synonym of the keyword. Demand information retrieved from query logs maintained for user-provided query entries is used to apply candidate selection rules to token pairs. The system also comprises a validation module and a data dictionary module to receive validated token pairs as entries in a vocabulary.

    摘要翻译: 提供了一种电子商务方法和系统,用于生成用于搜索存储在数据库中的数据项的数据字典。 在一个实施例中,系统包括候选列表生成器模块,用于从搜索查询信息生成关键字列表,并生成一组令牌对,包括来自关键字列表的关键字和令牌,令牌是关键字的同义词。 从为用户提供的查询条目维护的查询日志检索到的请求信息用于将候选选择规则应用于令牌对。 系统还包括验证模块和数据字典模块,用于接收经验证的令牌对作为词汇表中的条目。

    Methods and apparatus for generating a data dictionary
    2.
    发明授权
    Methods and apparatus for generating a data dictionary 有权
    用于生成数据字典的方法和装置

    公开(公告)号:US08145662B2

    公开(公告)日:2012-03-27

    申请号:US12347938

    申请日:2008-12-31

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30598 G06F17/30731

    摘要: There is provided a method and system generate a data dictionary for searching data items stored in an information resource. In one embodiment, the system generates a list of synonyms for keywords entered in search queries to the system. A keyword and synonym form a token pair. Token pairs are evaluated according to a bidirectional divergence value calculated for distributions of search results, wherein the searches are based on the token pairs. Token pairs are then selected based on the divergence value. The selected token pairs are compiled into a data dictionary. In one embodiment, the data dictionary is a synonym dictionary used for user search query expansion to find matching items.

    摘要翻译: 提供了一种方法和系统生成用于搜索存储在信息资源中的数据项的数据字典。 在一个实施例中,系统生成在系统的搜索查询中输入的关键词的同义词列表。 关键字和同义词形成一个令牌对。 根据针对搜索结果的分布计算的双向发散值来评估令牌对,其中搜索基于令牌对。 然后基于发散值选择令牌对。 所选的令牌对被编译成数据字典。 在一个实施例中,数据字典是用于用户搜索查询扩展以找到匹配项的同义字典。

    METHODS AND APPARATUS FOR GENERATING A DATA DICTIONARY
    3.
    发明申请
    METHODS AND APPARATUS FOR GENERATING A DATA DICTIONARY 有权
    用于生成数据字典的方法和装置

    公开(公告)号:US20100169361A1

    公开(公告)日:2010-07-01

    申请号:US12347938

    申请日:2008-12-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30598 G06F17/30731

    摘要: There is provided a method and system generate a data dictionary for searching data items stored in an information resource. In one embodiment, the system generates a list of synonyms for keywords entered in search queries to the system. A keyword and synonym form a token pair. Token pairs are evaluated according to a bidirectional divergence value calculated for distributions of search results, wherein the searches are based on the token pairs. Token pairs are then selected based on the divergence value. The selected token pairs are compiled into a data dictionary. In one embodiment, the data dictionary is a synonym dictionary used for user search query expansion to find matching items.

    摘要翻译: 提供了一种方法和系统生成用于搜索存储在信息资源中的数据项的数据字典。 在一个实施例中,系统生成在系统的搜索查询中输入的关键词的同义词列表。 关键字和同义词形成一个令牌对。 根据针对搜索结果的分布计算的双向发散值来评估令牌对,其中搜索基于令牌对。 然后基于发散值选择令牌对。 所选的令牌对被编译成数据字典。 在一个实施例中,数据字典是用于用户搜索查询扩展以找到匹配项的同义字典。

    Population of sets using advanced queries

    公开(公告)号:US10831837B2

    公开(公告)日:2020-11-10

    申请号:US12610038

    申请日:2009-10-30

    申请人: Karin Mauge

    发明人: Karin Mauge

    摘要: A method and a system are described for generation of sets of alternative terms based on queries received from users. For example, a query module may receive a query comprising syntax indicating alternative terms and may parse the alternative terms from the query. A frequency module forms groups of alternative terms from the parsed alternative terms and determines a first number of occurrences corresponding to each of the groups based on the received query and previous queries. For a first pair of the groups comprising a first alternative term and a second alternative term, a threshold module adds the first alternative term to an existing set of terms that already includes the second alternative term. The addition is based on a second number of occurrences of the first alternative and at least one other member of the existing set of terms.

    MATCHING BRANDS AND SIZES
    5.
    发明申请
    MATCHING BRANDS AND SIZES 审中-公开
    匹配品牌和尺码

    公开(公告)号:US20140129390A1

    公开(公告)日:2014-05-08

    申请号:US14070372

    申请日:2013-11-01

    IPC分类号: G06Q30/06

    CPC分类号: G06Q30/0629 G06Q30/0631

    摘要: Consumers face a problem matching brand and size of one product to a different brand and size of a related product because not all brands measure sizes in the same way. As such, internet commerce companies recommend shoe sizes to users based upon a user-specified size or a user-specified pair of the shoes and size in a brand that the user suggests. Data in a peer-to-peer marketplace is mined to determine corresponding shoe sizes across various brands.

    摘要翻译: 消费者面对一个产品的品牌和尺寸与相关产品的不同品牌和尺寸相匹配的问题,因为并不是所有的品牌都以相同的方式测量尺寸。 因此,互联网商业公司基于用户指定的尺寸或用户指定的鞋子对和用户建议的品牌的尺寸来向用户推荐鞋子尺寸。 点对点市场中的数据被开采以确定不同品牌的相应的鞋子尺寸。

    POPULATION OF SETS USING ADVANCED QUERIES
    6.
    发明申请
    POPULATION OF SETS USING ADVANCED QUERIES 审中-公开
    使用高级查询的群体群体

    公开(公告)号:US20110106828A1

    公开(公告)日:2011-05-05

    申请号:US12610038

    申请日:2009-10-30

    申请人: Karin Mauge

    发明人: Karin Mauge

    IPC分类号: G06F17/30

    摘要: A method and a system are described for generation of sets of alternative terms based on queries received from users. For example, a query module may receive a query comprising syntax indicating alternative terms and may parse the alternative terms from the query. A frequency module forms groups of alternative terms from the parsed alternative terms and determines a first number of occurrences corresponding to each of the groups based on the received query and previous queries. For a first pair of the groups comprising a first alternative term and a second alternative term, a threshold module adds the first alternative term to an existing set of terms that already includes the second alternative term. The addition is based on a second number of occurrences of the first alternative and at least one other member of the existing set of terms.

    摘要翻译: 描述了一种方法和系统,用于基于从用户接收到的查询来生成替代项集合。 例如,查询模块可以接收包括表示替代项的语法的查询,并且可以从查询中解析备选项。 频率模块根据所解析的替代术语形成替代术语组,并且基于接收的查询和先前的查询来确定与每个组相对应的出现次数。 对于包括第一替代术语和第二替代项的组的第一对,阈值模块将第一替代术语添加到已经包括第二替代术语的现有术语集合中。 该增加是基于第一替代方案的第二数量和现有术语集合的至少一个其他成员。