-
公开(公告)号:US20190392250A1
公开(公告)日:2019-12-26
申请号:US16281501
申请日:2019-02-21
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak , Siddhartha Nandi
Abstract: Methods and systems for document classification are provided. One method includes generating by a processor, a plurality of topics using content of a plurality of electronic documents, where each topic includes a plurality of words associated with the plurality of electronic documents; reducing by the processor, the plurality of topics to a subset of topics to represent the plurality of electronic documents based on a parameter indicating a property of each subset topic and separation between the subset topics; automatically generating by the processor, a tag for each subset topic, based on the tag's position within the subset topic; wherein each tag is an attribute of each subset topic; storing by the processor, the subset of topics with corresponding tags in a model data structure; and updating the model data structure by the processor based on one of a new topic and a new tag associated with an electronic document.
-
公开(公告)号:US20230066617A1
公开(公告)日:2023-03-02
申请号:US17980446
申请日:2022-11-03
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak
IPC: G06F21/62
Abstract: Methods and systems for securing unstructured data are provided. One method includes generating, by a processor, a schema from unstructured data, the schema including one or more relationships between named entities of the unstructured data; identifying, by the processor, a plurality of semantic relationships between the named entities; determining, by the processor, a sensitive relationship from the plurality of semantic relationships; and anonymizing, by the processor, sensitive data associated with the sensitive relationship by replacing, a first portion of the sensitive data with generalized information.
-
公开(公告)号:US10970595B2
公开(公告)日:2021-04-06
申请号:US16281501
申请日:2019-02-21
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak , Siddhartha Nandi
Abstract: Methods and systems for document classification are provided. One method includes generating by a processor, a plurality of topics using content of a plurality of electronic documents, where each topic includes a plurality of words associated with the plurality of electronic documents; reducing by the processor, the plurality of topics to a subset of topics to represent the plurality of electronic documents based on a parameter indicating a property of each subset topic and separation between the subset topics; automatically generating by the processor, a tag for each subset topic, based on the tag's position within the subset topic; wherein each tag is an attribute of each subset topic; storing by the processor, the subset of topics with corresponding tags in a model data structure; and updating the model data structure by the processor based on one of a new topic and a new tag associated with an electronic document.
-
公开(公告)号:US11520929B2
公开(公告)日:2022-12-06
申请号:US17122892
申请日:2020-12-15
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak
Abstract: Methods and systems for securing unstructured data are provided. One method includes generating, by a processor, a schema from unstructured data, the schema including one or more relationships between named entities of the unstructured data; identifying, by the processor, a plurality of semantic relationships between the named entities; determining, by the processor, a sensitive relationship from the plurality of semantic relationships; and anonymizing, by the processor, sensitive data associated with the sensitive relationship by replacing, a first portion of the sensitive data with generalized information.
-
公开(公告)号:US12045374B2
公开(公告)日:2024-07-23
申请号:US17980446
申请日:2022-11-03
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak
CPC classification number: G06F21/6254
Abstract: Methods and systems for securing unstructured data are provided. One method includes generating, by a processor, a schema from unstructured data, the schema including one or more relationships between named entities of the unstructured data; identifying, by the processor, a plurality of semantic relationships between the named entities; determining, by the processor, a sensitive relationship from the plurality of semantic relationships; and anonymizing, by the processor, sensitive data associated with the sensitive relationship by replacing, a first portion of the sensitive data with generalized information.
-
公开(公告)号:US20220188454A1
公开(公告)日:2022-06-16
申请号:US17122892
申请日:2020-12-15
Applicant: NETAPP, INC.
Inventor: Deepti Aggarwal , Jayanta Basak
IPC: G06F21/62
Abstract: Methods and systems for securing unstructured data are provided. One method includes generating, by a processor, a schema from unstructured data, the schema including one or more relationships between named entities of the unstructured data; identifying, by the processor, a plurality of semantic relationships between the named entities; determining, by the processor, a sensitive relationship from the plurality of semantic relationships; and anonymizing, by the processor, sensitive data associated with the sensitive relationship by replacing, a first portion of the sensitive data with generalized information.
-
-
-
-
-