-
公开(公告)号:US10282372B2
公开(公告)日:2019-05-07
申请号:US15059125
申请日:2016-03-02
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/00 , G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33 , G06F17/24
Abstract: Methods, systems and computer-readable media enable various techniques related to semantic navigation. One aspect is a technique for displaying semantically derived facets in the search engine interface. Each of the facets comprises faceted search results. Each of the faceted search results is displayed in association with user interface elements for including or excluding the faceted search result as additional search terms to subsequently refine the search query. Another aspect automatically infers new metadata from the content and from existing metadata and then automatically annotates the content with the new metadata to improve recall and navigation. Another aspect identifies semantic annotations by determining semantic connections between the semantic annotations and then dynamically generating a topic page based on the semantic connections.
-
公开(公告)号:US11726840B2
公开(公告)日:2023-08-15
申请号:US16296015
申请日:2019-03-07
Applicant: OPEN TEXT SA ULC
Inventor: Norddin Habti , Steve Pettigrew , Martin Brousseau , Lalith Subramanian
IPC: G06F16/20 , G06F9/54 , G06F16/27 , G06N20/00 , G06F16/22 , G06F16/25 , G06F16/2455 , G06F16/84 , G06N5/04
CPC classification number: G06F9/541 , G06F16/221 , G06F16/2456 , G06F16/252 , G06F16/27 , G06F16/84 , G06N5/04 , G06N20/00
Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.
-
公开(公告)号:US20220277030A1
公开(公告)日:2022-09-01
申请号:US17745525
申请日:2022-05-16
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
公开(公告)号:US12236288B2
公开(公告)日:2025-02-25
申请号:US18339245
申请日:2023-06-22
Applicant: OPEN TEXT SA ULC
Inventor: Norddin Habti , Steve Pettigrew , Martin Brousseau , Lalith Subramanian
Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.
-
公开(公告)号:US11977570B2
公开(公告)日:2024-05-07
申请号:US17745525
申请日:2022-05-16
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/00 , G06F16/248 , G06F16/31 , G06F16/33 , G06F16/332 , G06F16/338 , G06F16/36 , G06F16/951
CPC classification number: G06F16/338 , G06F16/248 , G06F16/316 , G06F16/3325 , G06F16/3344 , G06F16/36 , G06F16/951
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
6.
公开(公告)号:US20190266180A1
公开(公告)日:2019-08-29
申请号:US16410472
申请日:2019-05-13
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/31 , G06F16/33 , G06F16/332 , G06F16/951 , G06F17/24 , G06F16/248 , G06F16/36 , G06F16/338
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
公开(公告)号:US10331714B2
公开(公告)日:2019-06-25
申请号:US14079406
申请日:2013-11-13
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/00 , G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33 , G06F17/24
Abstract: Content of different formats may be sourced from various data sources such as content servers and ingested into a data integration server by an ingestion broker embodied on a non-transitory computer readable medium. The ingestion broker may normalize the content of different formats into a uniform representation that can be indexed and delivered across multiple digital channels for a variety of applications. The normalized content may be analyzed and semantic metadata may be determined from the normalized content. The normalized content can be semantically enriched by associating the semantic metadata and the like with the content. The semantic metadata can be stored in a semantic index that can be used for searching via the data integration server. During search, the semantic metadata can be instantiated as facets for user navigation and refinement of search criteria and additional semantic relationships can be assigned to the words in the normalized content.
-
公开(公告)号:US11803600B2
公开(公告)日:2023-10-31
申请号:US17510583
申请日:2021-10-26
Applicant: Open Text SA ULC
Inventor: Martin Brousseau , Steve Pettigrew
IPC: G06F17/00 , G06F16/9535 , G06F16/955 , G06F40/30 , G06F40/295
CPC classification number: G06F16/9535 , G06F16/9558 , G06F40/295 , G06F40/30
Abstract: A source content processor receives content from a crawler and calls a text mining engine. The text mining engine mines the content and provides metadata about the content. The source content processor applies a source content filtering rule to the content utilizing the metadata from the text mining engine. The source content filtering rule is previously built based on at least one of a named entity, a category, or a sentiment. The source content processor determines whether to persist the content according to a result from applying the source content filtering rule to the content and either stores the content in a data store or deletes the contents from the data ingestion pipeline such that the content is not persisted anywhere. Embodiments disclosed herein can significantly reduce the amount of irrelevant content through the data ingestion pipeline, prior to data persistence.
-
9.
公开(公告)号:US20230333919A1
公开(公告)日:2023-10-19
申请号:US18339245
申请日:2023-06-22
Applicant: OPEN TEXT SA ULC
Inventor: Norddin Habti , Steve Pettigrew , Martin Brousseau , Lalith Subramanian
CPC classification number: G06F9/541 , G06F16/221 , G06F16/2456 , G06F16/252 , G06F16/27 , G06F16/84 , G06N5/04 , G06N20/00
Abstract: Disclosed is a flexible and scalable artificial intelligence and analytics platform with advanced content analytics and content ingestion. Disparate contents can be ingested into a content analytics system of the platform through a content ingestion pipeline operated by a sophisticated text mining engine. Prior to persistence, editorial metadata can be extracted and semantic metadata inferred to gain insights across the disparate contents. The editorial metadata and the semantic metadata can be dynamically mapped, as the disparate contents are crawled from disparate sources, to an internal ingestion pipeline document conforming to a uniform mapping schema that specifies master metadata of interest. For persistence, the semantic metadata in the internal ingestion pipeline document can be mapped to metadata tables conforming to a single common data model of a central repository. In this way, ingested metadata can be leveraged across the platform, for instance, for trend analysis, mood detection, model building, etc.
-
10.
公开(公告)号:US20230297602A1
公开(公告)日:2023-09-21
申请号:US18322511
申请日:2023-05-23
Applicant: Open Text SA ULC
Inventor: Pascal Dimassimo , Steve Pettigrew , Martin Brousseau , Charles-Olivier Simard , Eric Williams , Francis Lacroix , Alex Dowgailenko , Agostino Deligia , Jean-Michel Texier
IPC: G06F16/31 , G06F16/36 , G06F16/248 , G06F16/338 , G06F16/951 , G06F16/332 , G06F16/33
CPC classification number: G06F16/316 , G06F16/248 , G06F16/3325 , G06F16/3344 , G06F16/338 , G06F16/36 , G06F16/951
Abstract: Methods, systems and computer-readable media enable various techniques related to semantic navigation. One aspect is a technique for displaying semantically derived facets in the search engine interface. Each of the facets comprises faceted search results. Each of the faceted search results is displayed in association with user interface elements for including or excluding the faceted search result as additional search terms to subsequently refine the search query. Another aspect automatically infers new metadata from the content and from existing metadata and then automatically annotates the content with the new metadata to improve recall and navigation. Another aspect identifies semantic annotations by determining semantic connections between the semantic annotations and then dynamically generating a topic page based on the semantic connections.
-
-
-
-
-
-
-
-
-