- 专利标题: Identifying corrupted text segments
-
申请号: US16000306申请日: 2018-06-05
-
公开(公告)号: US10318650B2公开(公告)日: 2019-06-11
- 发明人: Chao Yuan Huang , Yi-Lin Tsai , Der-Joung Wang , Yen-Min Wu
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 Stephen R. Yoder
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/24 ; G06F17/27
摘要:
A computer system for taking a corrective action upon determination of an existence of a corrupted text segment within a set of web pages. Determination includes: determining a language affinity indicator corresponding to text segments within the set of web pages; generating an indexing repository based on a set of text artifacts within the text segments; creating an occurrence table for the set of text artifacts; and determining compliance of the text artifacts and text segments based on the single language grouping on which the set of text segments are based.
公开/授权文献
- US20180285411A1 IDENTIFYING CORRUPTED TEXT SEGMENTS 公开/授权日:2018-10-04
信息查询