-
公开(公告)号:US10726060B1
公开(公告)日:2020-07-28
申请号:US14749554
申请日:2015-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Archiman Dutta , Shoubhik Bhattacharya , Subhadeep Chakraborty , Deepak Kumar Nayak , Nathan Rose , Avik Sinha
Abstract: A technology for determining accuracy estimates for classifications used in an electronic catalog. In one example, classifications for product groupings included in an electronic catalog may be updated as a result of the classifications inaccurately representing products included in the product groupings. The electronic catalog of products may be grouped into a plurality of product groupings using classifications. Classifications of product groupings that inaccurately represent products included in the product grouping may be updated with suggested classifications. Update metrics for updates made to the grouping classifications may be collected and the update metrics may be used to calculate an accuracy estimate for the classifications used in the electronic catalog.
-
公开(公告)号:US10339470B1
公开(公告)日:2019-07-02
申请号:US14967174
申请日:2015-12-11
Applicant: Amazon Technologies, Inc.
Inventor: Archiman Dutta , Rahul Gupta , Subhadeep Chakraborty , Dhinesh Kumar Dhanasekaran , Deepak Kumar Nayak , Avik Sinha
IPC: G06N20/00
Abstract: Techniques are provided herein for utilizing a classification engine to improve a classification model. For example, a classification engine may derive a statistical model based at least in part on a synthetic data set. A misclassification may be determined based at least in part on an output of the statistical model. An audit question may be provided to an individual, the audit question being determined based at least in part on the determined misclassification. Response data related to the audit question may be received. The statistical model may be validated based at least in part on the response data.
-
公开(公告)号:US10217080B1
公开(公告)日:2019-02-26
申请号:US14670259
申请日:2015-03-26
Applicant: Amazon Technologies, Inc.
Inventor: Archiman Dutta
Abstract: Methods, systems, and computer-readable media for item classification using customer-visible attributes are disclosed. A plurality of terms are determined that describe a plurality of items in a marketplace. Individual ones of the items are classified in a hierarchical taxonomy comprising a plurality of classifications, and individual ones of the terms correspond to individual ones of the classifications. A description of a new item is received. The description of the new item comprises a plurality of customer-visible terms. One or more of the plurality of classifications in the hierarchical taxonomy are selected for the new item. The one or more classifications are selected for the new item based at least in part on automated matching of individual ones of the customer-visible terms to individual ones of the terms that correspond to individual ones of the classifications.
-
公开(公告)号:US09830344B2
公开(公告)日:2017-11-28
申请号:US14620504
申请日:2015-02-12
Applicant: Amazon Technologies, Inc.
Inventor: Archiman Dutta
CPC classification number: G06F17/30327 , G06F17/30598
Abstract: Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
-
公开(公告)号:US20150161187A1
公开(公告)日:2015-06-11
申请号:US14620504
申请日:2015-02-12
Applicant: Amazon Technologies, Inc.
Inventor: Archiman Dutta
IPC: G06F17/30
CPC classification number: G06F17/30327 , G06F17/30598
Abstract: Disclosed are various embodiments for assessing the quality of a node that comprises a collection of items containing textual data. The homogeneity of the node can be related to its quality. Highly ranked descriptive terms used in the node are identified and quality score is calculated that provides a measure of the quality of the node. Additionally, a node can be examined for outliers to improve node quality.
Abstract translation: 公开了用于评估包括包含文本数据的项目的集合的节点的质量的各种实施例。 节点的同质性可以与其质量有关。 识别在节点中使用的高排名的描述性术语,并且计算质量得分,其提供节点质量的度量。 另外,可以检查节点的异常值以提高节点质量。
-
-
-
-