-
公开(公告)号:US10360248B1
公开(公告)日:2019-07-23
申请号:US15198897
申请日:2016-06-30
申请人: EMC Corporation
发明人: Lei Zhang , Chao Chen , Jingjing Liu , Kunwu Huang , Hongtao Dai , Ying Teng
摘要: In general, in one aspect, the invention relates to a method for servicing requests, the method includes receiving, from a client system, a first request comprising a query, determining a first user associated with the first request, modifying the query to obtain a first modified query, where the first modified query includes a first permission definition token associated with the first user, processing the modified query to obtain a first object from a content repository, and providing the first object to the client system.
-
公开(公告)号:US10241998B1
公开(公告)日:2019-03-26
申请号:US15197338
申请日:2016-06-29
申请人: EMC Corporation
发明人: Lei Zhang , Chao Chen , Jingjing Liu , Kunwu Huang , Hongtao Dai , Ying Teng
摘要: A method for tokenizing documents. The method includes obtaining a document comprising text to be tokenized, isolating a first string of consecutive characters in the document, searching, in a token tree, for an expression that matches the first string, making a determination that a matching expression exists in the token tree and, based on the determination, storing the matching expression as an extracted token.
-
公开(公告)号:US11256691B1
公开(公告)日:2022-02-22
申请号:US15197307
申请日:2016-06-29
申请人: EMC Corporation
发明人: Kunwu Huang , Lei Zhang , Chao Chen , Jingjing Liu , Hongtao Dai , Ying Teng
IPC分类号: G06F16/24 , G06F16/2453 , G06F16/248 , G06F16/9535 , G06F40/247 , G06F16/95
摘要: In general, in one aspect, the invention relates to a method for servicing requests. The method includes receiving, from a client system, a request comprising a query, where the query includes a first plurality of terms. The method further includes generating, using a thesaurus library, a related query including a second plurality of terms, where at least one term in the second plurality of terms is present in the first plurality of terms. The method further includes issuing the query to a content repository to obtain a first result, issuing the related query to the content repository to obtain a second result, processing the first result and the second result to generate a final result, and providing the final result to the client system.
-
公开(公告)号:US20170262474A1
公开(公告)日:2017-09-14
申请号:US15033309
申请日:2015-09-30
申请人: EMC Corporation
发明人: Chao Chen , Kunwu Huang , Hongtao Dai , Jingjing Liu
CPC分类号: G06F16/5846 , G06F16/9038 , G06F16/93 , G06F17/2223 , G06F17/24 , G06K9/00416 , G06K2209/011
摘要: Ideogram character analysis includes partitioning an original ideogram character into strokes, and mapping each stroke to a corresponding stroke identifier (id) to create an original stroke id sequence that includes stroke identifiers. A candidate ideogram character that has a candidate stroke id sequence within a threshold distance to the original stroke id sequence is selected. One or more embodiments may create new phrase by replacing the original ideogram character with the candidate ideogram character in a search phrase. One or more embodiments perform a search using the search phrase and the new phrase to obtain a result, and present the result. One or more embodiments may replace an original ideogram character in a character recognized document with the candidate ideogram character and store the character recognized document.
-
-
-