摘要:
An apparatus and method for predicting a brand name of a product are disclosed herein. A product identification number for the product is converted into a normalized global trade item number (GTIN). For each of a plurality of GTIN prefixes corresponding to the normalized GTIN, brand names and counts of each of the brand names using product information stored in a product catalog are identified. A probability distribution of the brand names is determined in accordance with the brand names and the counts of the brand names for the plurality of the GTIN prefixes. A predicted brand name for the product is identified from among the brand names for the plurality of the GTIN prefixes, the predicted brand name having a highest probability score in the probability distribution of the brand names.
摘要:
Techniques for providing improved product suggestions for related items are described. According to various embodiments, a product listing webpage associated with a retailer website that describes a specific product may be crawled, the specific product being included in a product inventory of the retailer website. Thereafter, product relations information associated with the specific product may be identified in the product listing webpage, the product relations information describing a group of one or more additional products in the product inventory having a particular relationship with the specific product. The product relations information may then be transposed to a second product inventory associated with a second retailer.
摘要:
Techniques for competitive pricing analysis and inventory management are described. According to various exemplary embodiments, a competitive pricing system is configured to crawl competitor websites for comparative pricing information at various time intervals. Moreover, the competitive pricing system is configured to determine if a price for an item on a home retailer website represents a “deal”, based on information crawled from competitor websites. According to various exemplary embodiments, a managed inventory repository system may enable improved identification of deals and specials within an inventory of a retailer website. For example, the managed inventory repository system may perform data mining operations to identify deals or specials offered for inventory items on a home retailer website. In some embodiments, a “special” can be defined in a variety of ways to suit different business units, campaigns, metrics, etc.
摘要:
Methods and system for text conversion are described. In one embodiment, free-form text associated with an item may be received. The item may be identified based on the free-form text. The item may be compatible with a parent item. The parent item may be identified based on the free-form text. An item descriptor may be identified in the free-form text. The item descriptor may be a particular term of the free-form text. Compatibility-based text may be constructed for the item based on identification of the parent item and the item descriptor. The compatibility-based text may be capable of being used to identify a plurality of matching items. Additional methods and systems are disclosed.
摘要:
A data file describes a document to be generated and stores an instruction to provide constituent data of the document via a network. The data file is accessed at a local computer, and the document is generated based on the data file. The generated document is communicated via the network to a remote computer. At the local computer, the instruction to provide constituent data is processed using a processor of the local computer. The constituent data is provided via the network to the remote computer as an update of the generated document.
摘要:
Techniques for mapping item listings from a first taxonomy to a second taxonomy are described. In an example embodiment, item listings from a first database storing a first taxonomy and item listings from a second database storing a second taxonomy are obtained. Then, for each of the obtained item listings, a plurality of features is extracted, including at least one feature related to an image associated with the item listing and at least one feature related to text associated with the item listing. Then a mapping between item listings in the first taxonomy and item listings in the second taxonomy is created based on the plurality of features extracted by the feature extraction component, wherein the mapping identifies which item listings in the first taxonomy correlate to a same product as which item listings in the second taxonomy.
摘要:
Methods and system for item matching are described. In one embodiment, compatibility-based text for an item may be accessed. A compatibility identifier may be identified based on the compatibility-based text. The compatibility identifier may be associated with an item cluster. The compatibility identifier may be used to identify a plurality of matching items. A result may be provided based on identification of the plurality of matching items. Additional methods and systems are disclosed.
摘要:
Techniques for optimizing the performance of a webpage crawler are described. According to various embodiments, historical web crawler performance data is accessed, the data describing a performance of a web crawler during various time periods in one or more prior days. A capacity of the web crawler to fulfil uniform resource locator (URL) crawl requests for an upcoming given time period is then estimated, based on the historical web crawler performance data. Thereafter, a plurality of URL crawl requests are distributed to the web crawler during the upcoming given time period, based on the estimated capacity of the web crawler.
摘要:
A data file describes a document to be generated and stores an instruction to provide constituent data of the document via a network. The data file is accessed at a local computer, and the document is generated based on the data file. The generated document is communicated via the network to a remote computer. At the local computer, the instruction to provide constituent data is processed using a processor of the local computer. The constituent data is provided via the network to the remote computer as an update of the generated document.
摘要:
Methods and system for item matching are described. In one embodiment, compatibility-based text for an item may be accessed. A compatibility identifier may be identified based on the compatibility-based text. The compatibility identifier may be associated with an item cluster. The compatibility identifier may be used to identify a plurality of matching items. A result may be provided based on identification of the plurality of matching items. Additional methods and systems are disclosed.