-
公开(公告)号:US20210248172A1
公开(公告)日:2021-08-12
申请号:US17245406
申请日:2021-04-30
Applicant: eBay Inc.
Inventor: Daniel Lee Hurwitz , Ido Guy
Abstract: Methods, systems, and media for lot classification are disclosed. In one example, a classification system for identifying lot listings receives a description for a listing in a publication system, identifies a string in the listing, identifies a quantity word or digit in the string, and converts an identified quantity word into digit form. A normalized string is tokenized to produce tokens, the tokenizing of the normalized string including splitting the normalized string into a series of substrings using a sequence of delimiters. For each substring, an additional split is performed by separating any digit from any other adjacent character, unless that character is another digit, and maintaining an internal character order of each split substring to produce a flattened list of tokenized tokens.
-
公开(公告)号:US12001471B2
公开(公告)日:2024-06-04
申请号:US17245406
申请日:2021-04-30
Applicant: eBay Inc.
Inventor: Daniel Lee Hurwitz , Ido Guy
CPC classification number: G06F16/358 , G06F16/35 , G06N20/00
Abstract: Methods, systems, and media for lot classification are disclosed. In one example, a classification system for identifying lot listings receives a description for a listing in a publication system, identifies a string in the listing, identifies a quantity word or digit in the string, and converts an identified quantity word into digit form. A normalized string is tokenized to produce tokens, the tokenizing of the normalized string including splitting the normalized string into a series of substrings using a sequence of delimiters. For each substring, an additional split is performed by separating any digit from any other adjacent character, unless that character is another digit, and maintaining an internal character order of each split substring to produce a flattened list of tokenized tokens.
-
公开(公告)号:US11036780B2
公开(公告)日:2021-06-15
申请号:US15916207
申请日:2018-03-08
Applicant: eBay Inc.
Inventor: Daniel Lee Hurwitz , Ido Guy
Abstract: Methods, systems, and media for lot classification are disclosed. In one example, a classification system for identifying lot listings receives a description for a listing in a publication system, identifies a string in the listing, identifies a quantity word or digit in the string, and converts an identified quantity word into digit form. A normalized string is tokenized to produce tokens, the tokenizing of the normalized string including splitting the normalized string into a series of substrings using a sequence of delimiters. For each substring, an additional split is performed by separating any digit from any other adjacent character, unless that character is another digit, and maintaining an internal character order of each split substring to produce a flattened list of tokenized tokens.
-
公开(公告)号:US20240273129A1
公开(公告)日:2024-08-15
申请号:US18641994
申请日:2024-04-22
Applicant: eBay Inc.
Inventor: Daniel Lee Hurwitz , Ido Guy
CPC classification number: G06F16/358 , G06F16/35 , G06N20/00
Abstract: Methods, systems, and media for lot classification are disclosed. In one example, a classification system for identifying lot listings receives a description for a listing in a publication system, identifies a string in the listing, identifies a quantity word or digit in the string, and converts an identified quantity word into digit form. A normalized string is tokenized to produce tokens, the tokenizing of the normalized string including splitting the normalized string into a series of substrings using a sequence of delimiters. For each substring, an additional split is performed by separating any digit from any other adjacent character, unless that character is another digit, and maintaining an internal character order of each split substring to produce a flattened list of tokenized tokens.
-
公开(公告)号:US20190278865A1
公开(公告)日:2019-09-12
申请号:US15916207
申请日:2018-03-08
Applicant: eBay Inc.
Inventor: Daniel Lee Hurwitz , Ido Guy
Abstract: Methods, systems, and media for lot classification are disclosed. In one example, a classification system for identifying lot listings receives a description for a listing in a publication system, identifies a string in the listing, identifies a quantity word or digit in the string, and converts an identified quantity word into digit form. A normalized string is tokenized to produce tokens, the tokenizing of the normalized string including splitting the normalized string into a series of substrings using a sequence of delimiters. For each substring, an additional split is performed by separating any digit from any other adjacent character, unless that character is another digit, and maintaining an internal character order of each split substring to produce a flattened list of tokenized tokens.
-
-
-
-