-
公开(公告)号:US10176500B1
公开(公告)日:2019-01-08
申请号:US13904419
申请日:2013-05-29
Applicant: A9.com, Inc.
Inventor: Shrinivas Mohan
IPC: G10L21/00 , G10L25/00 , G10L15/00 , G10L15/06 , G10L15/04 , G06K9/72 , G06K9/54 , G06K9/46 , G06K9/18 , G06K9/66 , G06K9/34 , G06K9/00 , G06Q10/00 , G06Q30/00 , G06F7/00 , G06F17/00 , G06F17/30 , G06F15/16 , G06Q30/02
Abstract: One or more content items can be received at a data recognition module. The data recognition module can utilize, individually or in any combination, image recognition (e.g., OCR, object recognition, etc.), audio recognition (e.g., speech recognition, music identification, etc.), and/or text recognition (e.g., text crawling) in order to identify or recognize at least a portion of the one or more content items. Based on the identified content portion(s), the one or more content items and/or their respective source(s) can be classified. In one example, an image containing a not yet machine-readable curse word can be included in a source webpage. The image can be received at the data recognition module. The curse word contained in the image can be recognized/identified using an OCR process. Based, at least in part, on the recognized/identified curse word, the image and/or the webpage can be classified as likely being associated with inappropriate material.