- 专利标题: Method and electronic device for generating semantic representation of document to determine data security risk
-
申请号: US17160369申请日: 2021-01-27
-
公开(公告)号: US11687647B2公开(公告)日: 2023-06-27
- 发明人: Madhusudana Shashanka , Bonnie Arogyam Varghese , Shankar Subramaniam , Karthik Krishnan , Rency Joseph
- 申请人: CONCENTRIC SOFTWARE, INC.
- 申请人地址: US CA San Jose
- 专利权人: CONCENTRIC SOFTWARE, INC.
- 当前专利权人: CONCENTRIC SOFTWARE, INC.
- 当前专利权人地址: US CA Saratoga
- 主分类号: G06F21/55
- IPC分类号: G06F21/55
摘要:
A method and an electronic device (100) are disclosed for generating semantic representation of a document to determine data security risk associated with the document. The method includes receiving, by a document semantics controller (160) of the electronic device (100), a document in an electronic form and determining, by the document semantics controller (160), raw text. Further, the method includes generating, by the document semantics controller (160), a plurality of sentence blocks using the raw text and determining, by the document semantics controller (160), embeddings for the plurality of sentence blocks. Further, the method includes determining, by the document semantics controller (160), the semantic representation of the document based on the embeddings for each of the sentence blocks; and generating, by the document semantics controller (160), the semantic representation of the document to determine the data security risk associated with the document.
公开/授权文献
信息查询