System and method to enhance phrase search with nested thesaurus parsing

    公开(公告)号:US11256691B1

    公开(公告)日:2022-02-22

    申请号:US15197307

    申请日:2016-06-29

    申请人: EMC Corporation

    摘要: In general, in one aspect, the invention relates to a method for servicing requests. The method includes receiving, from a client system, a request comprising a query, where the query includes a first plurality of terms. The method further includes generating, using a thesaurus library, a related query including a second plurality of terms, where at least one term in the second plurality of terms is present in the first plurality of terms. The method further includes issuing the query to a content repository to obtain a first result, issuing the related query to the content repository to obtain a second result, processing the first result and the second result to generate a final result, and providing the final result to the client system.

    METHOD AND SYSTEM FOR IDEOGRAM CHARACTER ANALYSIS

    公开(公告)号:US20170262474A1

    公开(公告)日:2017-09-14

    申请号:US15033309

    申请日:2015-09-30

    申请人: EMC Corporation

    摘要: Ideogram character analysis includes partitioning an original ideogram character into strokes, and mapping each stroke to a corresponding stroke identifier (id) to create an original stroke id sequence that includes stroke identifiers. A candidate ideogram character that has a candidate stroke id sequence within a threshold distance to the original stroke id sequence is selected. One or more embodiments may create new phrase by replacing the original ideogram character with the candidate ideogram character in a search phrase. One or more embodiments perform a search using the search phrase and the new phrase to obtain a result, and present the result. One or more embodiments may replace an original ideogram character in a character recognized document with the candidate ideogram character and store the character recognized document.