-
公开(公告)号:US10565519B2
公开(公告)日:2020-02-18
申请号:US14792585
申请日:2015-07-06
Applicant: Oath Inc.
Inventor: Thu Kyaw , Sang Chul Song , Vineet Mahajan , Elena Haliczer
Abstract: Computerized systems and methods are disclosed for performing contextual classification of objects using supervised and unsupervised training. In accordance with one implementation, content reviewers may review training objects and submit supervised training data for preprocessing and analysis. The supervised training data may be preprocessed to identify key terms and phrases, such as by stemming, tokenization, or n-gram analysis, and form vectorized objects. The vectorized objects may be used to train one or more models for subsequent classification of objects. In certain implementations, preprocessing or training, among other steps, may be performed in parallel over multiple machines to improve efficiency. The disclosed systems and methods may be used in a wide variety of applications, such as article classification and content moderation.