Invention Application
- Patent Title: JOINT MULTIGRAM-BASED DETECTION OF SPELLING VARIANTS
- Patent Title (中): 联合多媒体基因检测发现变异
-
Application No.: US14468468Application Date: 2014-08-26
-
Publication No.: US20150234804A1Publication Date: 2015-08-20
- Inventor: Matthew Nicholas Stuttle , Alexander Gutkin
- Applicant: GOOGLE INC.
- Priority: IL230993 20140216
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06N99/00 ; G06N7/00

Abstract:
Content processing includes receiving a set of a correctly spelled alert words and at least one spelling variant corresponding to each correctly spelled alert word; determining at least one alignment of joint multigrams for each correctly spelled alert word/corresponding spelling variant pair; training a model of correspondence between the set of received orthographic alert words and corresponding spelling variants using the determined alignments; and receiving a spelling variant observation from a content block. Using the trained model, the technology determines a probability that the received spelling variant observation corresponds to a received correctly spelled alert word. For a determined probability exceeding a configured threshold, the technology denies automatic acceptance of the content block.
Information query