SYSTEM AND METHOD FOR COMMUNITY AIDED RESEARCH AND SHOPPING
    2.
    发明申请
    SYSTEM AND METHOD FOR COMMUNITY AIDED RESEARCH AND SHOPPING 有权
    社区辅助研究和购物的系统与方法

    公开(公告)号:US20090307100A1

    公开(公告)日:2009-12-10

    申请号:US12133100

    申请日:2008-06-04

    IPC分类号: G06Q30/00

    摘要: A system and method for providing product research help to a consumer doing internet shopping. A consumer helper application for internet commerce has been designed to integrate the techniques that are currently used by many internet consumers into a single application program. For example, the consumer helper application allows a user to take notes, store web site links, store web site annotations, obtain collaborative input, and perform other tasks commonly used by web site consumers. The consumer helper application stores all of the information gathered in a single place. Information provided by other consumers may be shared using the consumer helper application. The consumer helper application will suggest specific items for a consumer to purchase based up on the information gathered by the consumer into the consumer helper application.

    摘要翻译: 提供产品研究的系统和方法可帮助消费者进行互联网购物。 用于互联网商务的消费者帮助应用程序被设计为将许多互联网消费者当前使用的技术集成到单个应用程序中。 例如,消费者帮助应用程序允许用户记录,存储网站链接,存储网站注释,获得协作输入,以及执行网站消费者常用的其他任务。 消费者帮助应用程序将收集到的所有信息存储在一个地方。 其他消费者提供的信息可以使用消费者帮助应用程序共享。 消费者帮助应用程序将根据消费者收集到消费者帮助应用程序中的信息来建议消费者购买的具体项目。

    METHODS FOR AUTOMATIC GENERATION OF PARALLEL CORPORA
    3.
    发明申请
    METHODS FOR AUTOMATIC GENERATION OF PARALLEL CORPORA 有权
    自动生成并行公司的方法

    公开(公告)号:US20150248401A1

    公开(公告)日:2015-09-03

    申请号:US14587552

    申请日:2014-12-31

    IPC分类号: G06F17/28 G06Q30/06

    摘要: A method of forming parallel corpora comprises receiving sets of items in first language and second languages, each of the sets having one or more associated descriptions and metadata. The metadata is collected from the two sets of items and are aligned using the metadata. The aligned metadata are mapped from the first language to the second language for each of the sets. The descriptions of two items are fetched and the structural similarity of the descriptions is measured to assess whether two items are likely to be translations of each other. For mapped items with structurally similar descriptions, the mapped item descriptions are formed into respective sentences in first language and in the second language. The sentences are parallel corpora which may be used to translate an item from the first language to the second language, and also to train a machine translation system.

    摘要翻译: 形成并行语料库的方法包括以第一语言和第二语言接收项目集合,每个集合具有一个或多个相关联的描述和元数据。 元数据是从两组项目中收集的,并使用元数据对齐。 对于每个集合,对齐的元数据从第一语言映射到第二语言。 提取两个项目的描述,并测量描述的结构相似性,以评估两个项目是否可能是彼此的翻译。 对于具有结构相似描述的映射项目,映射项目描述被形成为第一语言和第二语言中的相应句子。 句子是可以用于将项目从第一语言翻译成第二语言的并行语料库,并且还训练机器翻译系统。

    MAPPING ITEM RECORDS TO PRODUCT RECORDS
    5.
    发明申请
    MAPPING ITEM RECORDS TO PRODUCT RECORDS 有权
    映射项目记录到产品记录

    公开(公告)号:US20100082592A1

    公开(公告)日:2010-04-01

    申请号:US12568600

    申请日:2009-09-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30604 G06Q30/08

    摘要: An item record in an item database contains an item description generated by a seller of an item. A server machine is configured to access the item database, analyze the item description, and extract descriptive information by inferring an attribute and a corresponding attribute value from the item description. The attribute and its attribute value constitute an attribute-value pair. The server machine uses the attribute-value pair to map the item record to a product record stored in a product database. The mapping of the item record to the product record is based on comparing the attribute-value pair of the item record to a reference attribute-value pair in the product record to identify the product record. The mapping is performed upon detection of a match between the attribute-value pairs.

    摘要翻译: 项目数据库中的项目记录包含由项目的卖方生成的项目描述。 服务器机器被配置为访问项目数据库,分析项目描述,并通过从项目描述中推断属性和对应的属性值来提取描述性信息。 属性及其属性值构成属性值对。 服务器机器使用属性值对将项目记录映射到存储在产品数据库中的产品记录。 项目记录与产品记录的映射是基于将产品记录中的属性值对与产品记录中的参考属性 - 值对进行比较,以识别产品记录。 在检测到属性值对之间的匹配时执行映射。

    IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT
    6.
    发明申请
    IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT 有权
    电子文件内容的识别

    公开(公告)号:US20090158138A1

    公开(公告)日:2009-06-18

    申请号:US11956687

    申请日:2007-12-14

    申请人: Jean-David Ruvini

    发明人: Jean-David Ruvini

    IPC分类号: G06F3/14

    摘要: In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using an attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking of the plurality of sections of the electronic document in a machine-readable medium.

    摘要翻译: 在一些实施例中,一种方法包括接收包括多个部分的电子文档。 该方法包括使用包括该部分的宽度,该部分中的多个超链接的密度,该部分中的多个超链接的密度中的至少一个的部分的属性来将多个部分标记为内容部分或非内容部分, 该部分中的文本字体以及电子文档的标题与该部分中的文本是否重叠。 该方法还包括将电子文档的多个部分的标记存储在机器可读介质中。

    DETERMINING AND USING BRAND INFORMATION IN ELECTRONIC COMMERCE
    9.
    发明申请
    DETERMINING AND USING BRAND INFORMATION IN ELECTRONIC COMMERCE 审中-公开
    在电子商务中确定和使用品牌信息

    公开(公告)号:US20140081701A1

    公开(公告)日:2014-03-20

    申请号:US13623697

    申请日:2012-09-20

    IPC分类号: G06Q30/02

    CPC分类号: G06Q30/02

    摘要: An apparatus and method for predicting a brand name of a product are disclosed herein. A product identification number for the product is converted into a normalized global trade item number (GTIN). For each of a plurality of GTIN prefixes corresponding to the normalized GTIN, brand names and counts of each of the brand names using product information stored in a product catalog are identified. A probability distribution of the brand names is determined in accordance with the brand names and the counts of the brand names for the plurality of the GTIN prefixes. A predicted brand name for the product is identified from among the brand names for the plurality of the GTIN prefixes, the predicted brand name having a highest probability score in the probability distribution of the brand names.

    摘要翻译: 本文公开了一种用于预测产品品牌名称的装置和方法。 产品的产品标识号被转换为标准化的全球贸易项目编号(GTIN)。 对于与归一化GTIN相对应的多个GTIN前缀中的每一个,识别使用存储在产品目录中的产品信息的每个品牌的品牌名称和计数。 根据品牌名称和多个GTIN前缀的品牌名称的计数确定品牌名称的概率分布。 从多个GTIN前缀的品牌名称中识别产品的预测品牌名称,在品牌名称的概率分布中具有最高概率得分的预测品牌名称。

    Catalog generation based on divergent listings
    10.
    发明授权
    Catalog generation based on divergent listings 有权
    基于不同列表的目录生成

    公开(公告)号:US08315917B2

    公开(公告)日:2012-11-20

    申请号:US13285893

    申请日:2011-10-31

    IPC分类号: G06Q30/00

    摘要: A method and a system are described for catalog generation based on divergent listings. For example, a listing module may receive a divergent listing and collect data corresponding to an item in the listing. Each divergent listing of an item for sale is divergent with respect to other listings of the same item for sale. A calculation module may calculate an entropy value based on the collected data, the entropy value indicating a degree of difference between the item in the listing and a product in a catalog. A catalog module may associate the listing to the product in the catalog based on the entropy value.

    摘要翻译: 描述了基于不同列表的目录生成的方法和系统。 例如,列表模块可以接收发散列表并收集对应于列表中的项目的数据。 出售物品的每个不同的列表对于同一物品的其他物品出售是不同的。 计算模块可以基于收集的数据计算熵值,熵值表示列表中的项目与目录中的产品之间的差异程度。 目录模块可以根据熵值将列表与目录中的产品相关联。