INTEGRATION SERVICES SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR ECM-INDEPENDENT ETL TOOLS

    公开(公告)号:US20250156505A1

    公开(公告)日:2025-05-15

    申请号:US19023301

    申请日:2025-01-16

    Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.

    Systems and methods for intelligent content filtering and persistence

    公开(公告)号:US11803600B2

    公开(公告)日:2023-10-31

    申请号:US17510583

    申请日:2021-10-26

    CPC classification number: G06F16/9535 G06F16/9558 G06F40/295 G06F40/30

    Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.

    FLEXIBLE AND SCALABLE ARTIFICIAL INTELLIGENCE AND ANALYTICS PLATFORM WITH ADVANCED CONTENT ANALYTICS AND DATA INGESTION

    公开(公告)号:US20190279101A1

    公开(公告)日:2019-09-12

    申请号:US16296015

    申请日:2019-03-07

    Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.

    INTEGRATION SERVICES SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR ECM-INDEPENDENT ETL TOOLS

    公开(公告)号:US20170199989A1

    公开(公告)日:2017-07-13

    申请号:US15471823

    申请日:2017-03-28

    Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories, and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.

    SYSTEMS AND METHODS FOR INTELLIGENT CONTENT FILTERING AND PERSISTENCE

    公开(公告)号:US20240012863A1

    公开(公告)日:2024-01-11

    申请号:US18472948

    申请日:2023-09-22

    CPC classification number: G06F16/9535 G06F16/9558 G06F40/30 G06F40/295

    Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.

Patent Agency Ranking