-
公开(公告)号:US11163840B2
公开(公告)日:2021-11-02
申请号:US15988526
申请日:2018-05-24
Applicant: Open Text SA ULC
Inventor: Martin Brousseau , Steve Pettigrew
IPC: G06F17/00 , G06F16/9535 , G06F16/955 , G06F40/30 , G06F40/295
Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.
-
22.
公开(公告)号:US10902095B2
公开(公告)日:2021-01-26
申请号:US16658929
申请日:2019-10-21
Applicant: Open Text SA ULC
Inventor: Alexander Lilko , Martin Brousseau
Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories, and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.
-
23.
公开(公告)号:US10073956B2
公开(公告)日:2018-09-11
申请号:US15471823
申请日:2017-03-28
Applicant: OPEN TEXT SA ULC
Inventor: Alexander Lilko , Martin Brousseau
CPC classification number: G06F21/10 , G06F16/254 , G06F21/604 , G06F21/6218 , H04L63/101
Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories, and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.
-
-