Invention Grant
- Patent Title: Content classification based on data recognition
-
Application No.: US13904419Application Date: 2013-05-29
-
Publication No.: US10176500B1Publication Date: 2019-01-08
- Inventor: Shrinivas Mohan
- Applicant: A9.com, Inc.
- Applicant Address: unknown Palo Alto
- Assignee: A9.COM, INC.
- Current Assignee: A9.COM, INC.
- Current Assignee Address: unknown Palo Alto
- Agency: Hogan Lovells US LLP
- Main IPC: G10L21/00
- IPC: G10L21/00 ; G10L25/00 ; G10L15/00 ; G10L15/06 ; G10L15/04 ; G06K9/72 ; G06K9/54 ; G06K9/46 ; G06K9/18 ; G06K9/66 ; G06K9/34 ; G06K9/00 ; G06Q10/00 ; G06Q30/00 ; G06F7/00 ; G06F17/00 ; G06F17/30 ; G06F15/16 ; G06Q30/02

Abstract:
One or more content items can be received at a data recognition module. The data recognition module can utilize, individually or in any combination, image recognition (e.g., OCR, object recognition, etc.), audio recognition (e.g., speech recognition, music identification, etc.), and/or text recognition (e.g., text crawling) in order to identify or recognize at least a portion of the one or more content items. Based on the identified content portion(s), the one or more content items and/or their respective source(s) can be classified. In one example, an image containing a not yet machine-readable curse word can be included in a source webpage. The image can be received at the data recognition module. The curse word contained in the image can be recognized/identified using an OCR process. Based, at least in part, on the recognized/identified curse word, the image and/or the webpage can be classified as likely being associated with inappropriate material.
Information query