-
11.
公开(公告)号:US20190266180A1
公开(公告)日:2019-08-29
申请号:US16410472
申请日:2019-05-13
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/31 , G06F16/33 , G06F16/332 , G06F16/951 , G06F17/24 , G06F16/248 , G06F16/36 , G06F16/338
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
12.
公开(公告)号:US10331714B2
公开(公告)日:2019-06-25
申请号:US14079406
申请日:2013-11-13
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/00 , G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33 , G06F17/24
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
13.
公开(公告)号:US20250156505A1
公开(公告)日:2025-05-15
申请号:US19023301
申请日:2025-01-16
Applicant: Open Text SA ULC
Inventor: Alexander Lilko , Martin Brousseau
Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.
-
公开(公告)号:US11803600B2
公开(公告)日:2023-10-31
申请号:US17510583
申请日:2021-10-26
Applicant: Open Text SA ULC
Inventor: Martin Brousseau , Steve Pettigrew
IPC: G06F17/00 , G06F16/9535 , G06F16/955 , G06F40/30 , G06F40/295
CPC classification number: G06F16/9535 , G06F16/9558 , G06F40/295 , G06F40/30
Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.
-
15.
公开(公告)号:US20230333919A1
公开(公告)日:2023-10-19
申请号:US18339245
申请日:2023-06-22
Applicant: OPEN TEXT SA ULC
Inventor: Norddin Habti , Steve Pettigrew , Martin Brousseau , Lalith Subramanian
CPC classification number: G06F9/541 , G06F16/221 , G06F16/2456 , G06F16/252 , G06F16/27 , G06F16/84 , G06N5/04 , G06N20/00
Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.
-
16.
公开(公告)号:US20230297602A1
公开(公告)日:2023-09-21
申请号:US18322511
申请日:2023-05-23
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33
CPC classification number: G06F16/316 , G06F16/248 , G06F16/3325 , G06F16/3344 , G06F16/338 , G06F16/36 , G06F16/951
Abstract: Methods, systems and computer-readable media enable various techniques related to semantic navigation. One aspect is a technique for displaying semantically derived facets in the search engine interface. Each of the facets comprises faceted search results. Each of the faceted search results is displayed in association with user interface elements for including or excluding the faceted search result as additional search terms to subsequently refine the search query. Another aspect automatically infers new metadata from the content and from existing metadata and then automatically annotates the content with the new metadata to improve recall and navigation. Another aspect identifies semantic annotations by determining semantic connections between the semantic annotations and then dynamically generating a topic page based on the semantic connections.
-
17.
公开(公告)号:US20190279101A1
公开(公告)日:2019-09-12
申请号:US16296015
申请日:2019-03-07
Applicant: OPEN TEXT SA ULC
Inventor: Norddin Habti , Steve Pettigrew , Martin Brousseau , Lalith Subramanian
Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.
-
18.
公开(公告)号:US20170199989A1
公开(公告)日:2017-07-13
申请号:US15471823
申请日:2017-03-28
Applicant: OPEN TEXT SA ULC
Inventor: Alexander Lilko , Martin Brousseau
CPC classification number: G06F21/10 , G06F17/30563 , G06F21/604 , G06F21/6218 , H04L63/101
Abstract: To resolve a conflict between CMIS secondary types and certain ECM features such as content server categories, and allow the underlying ECM system to be fully CMIS-compliant, an ECM-independent ETL tool comprising a CMIS-compliant, repository-specific connector is provided. Operating on an integration services server at an integration tier between an application tier and a storage tier where the repository resides, the connector is particular configured to support CMIS secondary types and specific to the repository. On startup, the connector can import any category definition from the repository. The category definition contains properties associated with a category in the repository. When the category is attached to a document, the properties are viewable via a special category object type and a category identifier for the category. Any application can be adapted to leverage the ECM-independent ETL tool disclosed herein.
-
公开(公告)号:US20240012863A1
公开(公告)日:2024-01-11
申请号:US18472948
申请日:2023-09-22
Applicant: Open Text SA ULC
Inventor: Martin Brousseau , Steve Pettigrew
IPC: G06F16/9535 , G06F16/955 , G06F40/30 , G06F40/295
CPC classification number: G06F16/9535 , G06F16/9558 , G06F40/30 , G06F40/295
Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.
-
20.
公开(公告)号:US11361007B2
公开(公告)日:2022-06-14
申请号:US16410472
申请日:2019-05-13
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/00 , G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
-
-
-
-
-
-
-
-