摘要:
A computer-implemented system and method for providing information tagging in a networked system is disclosed. The apparatus in an example embodiment includes a tag engine configured to process a database of categorized product listings; to receive a user-provided tag associated with at least one of the product listings; to retain the user-provided tag; and to serve the user-provided tag to a user viewing at least one of the product listings.
摘要:
A system and method for providing product research help to a consumer doing internet shopping. A consumer helper application for internet commerce has been designed to integrate the techniques that are currently used by many internet consumers into a single application program. For example, the consumer helper application allows a user to take notes, store web site links, store web site annotations, obtain collaborative input, and perform other tasks commonly used by web site consumers. The consumer helper application stores all of the information gathered in a single place. Information provided by other consumers may be shared using the consumer helper application. The consumer helper application will suggest specific items for a consumer to purchase based up on the information gathered by the consumer into the consumer helper application.
摘要:
A method of forming parallel corpora comprises receiving sets of items in first language and second languages, each of the sets having one or more associated descriptions and metadata. The metadata is collected from the two sets of items and are aligned using the metadata. The aligned metadata are mapped from the first language to the second language for each of the sets. The descriptions of two items are fetched and the structural similarity of the descriptions is measured to assess whether two items are likely to be translations of each other. For mapped items with structurally similar descriptions, the mapped item descriptions are formed into respective sentences in first language and in the second language. The sentences are parallel corpora which may be used to translate an item from the first language to the second language, and also to train a machine translation system.
摘要:
In various example embodiments, a system and method to provide query linguistic service is provided. An initial query term set is received. Phrase recognition is performed on the initial query term set to determine recognized phrases. Using the determined recognized phrases, one or more synonyms for each of the recognized phrases are determined. Results matching the initial query term set and any selected synonyms from the determined one or more synonyms are determined.
摘要:
An item record in an item database contains an item description generated by a seller of an item. A server machine is configured to access the item database, analyze the item description, and extract descriptive information by inferring an attribute and a corresponding attribute value from the item description. The attribute and its attribute value constitute an attribute-value pair. The server machine uses the attribute-value pair to map the item record to a product record stored in a product database. The mapping of the item record to the product record is based on comparing the attribute-value pair of the item record to a reference attribute-value pair in the product record to identify the product record. The mapping is performed upon detection of a match between the attribute-value pairs.
摘要:
In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using an attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking of the plurality of sections of the electronic document in a machine-readable medium.
摘要:
A computer-implemented system and method for providing information tagging in a networked system is disclosed. The apparatus in an example embodiment includes a tag engine configured to process a database of categorized product listings; to receive a user-provided tag associated with at least one of the product listings; to retain the user-provided tag; and to serve the user-provided tag to a user viewing at least one of the product listings.
摘要:
In various example embodiments, a system and method to provide query linguistic service is provided. An initial query term set is received. Phrase recognition is performed on the initial query term set to determine recognized phrases. Using the determined recognized phrases, one or more synonyms for each of the recognized phrases are determined. Results matching the initial query term set and any selected synonyms from the determined one or more synonyms are determined.
摘要:
An apparatus and method for predicting a brand name of a product are disclosed herein. A product identification number for the product is converted into a normalized global trade item number (GTIN). For each of a plurality of GTIN prefixes corresponding to the normalized GTIN, brand names and counts of each of the brand names using product information stored in a product catalog are identified. A probability distribution of the brand names is determined in accordance with the brand names and the counts of the brand names for the plurality of the GTIN prefixes. A predicted brand name for the product is identified from among the brand names for the plurality of the GTIN prefixes, the predicted brand name having a highest probability score in the probability distribution of the brand names.
摘要:
A method and a system are described for catalog generation based on divergent listings. For example, a listing module may receive a divergent listing and collect data corresponding to an item in the listing. Each divergent listing of an item for sale is divergent with respect to other listings of the same item for sale. A calculation module may calculate an entropy value based on the collected data, the entropy value indicating a degree of difference between the item in the listing and a product in a catalog. A catalog module may associate the listing to the product in the catalog based on the entropy value.