IDENTIFICATION OF CHANGES BETWEEN DOCUMENT VERSIONS

    公开(公告)号:US20210271718A1

    公开(公告)日:2021-09-02

    申请号:US16806438

    申请日:2020-03-02

    摘要: One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a revision different than another of the at least two documents; identifying, within each of the at least two documents, portions corresponding to groups of text containing a conceptual unit; assigning at least a subset of the identified portions to a category type corresponding to a topic of a given portion, wherein the assigning comprises (i) generating a semantic tag for the identified portions in the subset and (ii) tagging the identified portions in the subset with the semantic tag; and determining changes between the at least two documents, wherein the determining comprises (iii) aligning given portions across the at least two documents based upon a relationship between the given portions across the at least two documents, (iv) identifying semantic differences between the aligned portions, and (v) identifying any remaining unaligned portions.

    WATER WASTAGE DETECTION SYSTEM
    2.
    发明申请

    公开(公告)号:US20190162572A1

    公开(公告)日:2019-05-30

    申请号:US15825133

    申请日:2017-11-29

    IPC分类号: G01F15/075 G01F15/00 E03B7/07

    摘要: An indication of water flowing into a fixture and down a drain of the fixture is received. A timer is started. A flow rate and attributes of the water flowing into the fixture are monitored. Attributes of the water flowing through the drain are monitored. A determination is made that the attributes of water flowing into the fixture match, within a threshold, the attributes of water flowing through the drain. A first duration of time of water being wasted is determined. A determination is made that water flowing into the fixture has stopped. A total duration of time of water being wasted, based on at least the first duration of time, is determined. A volume of water being wasted is determined based on the total duration of time and the flow rate.

    Contrasting document-embedded structured data and generating summaries thereof

    公开(公告)号:US11500840B2

    公开(公告)日:2022-11-15

    申请号:US16804399

    申请日:2020-02-28

    摘要: Methods, systems, and computer program products for contrasting document-embedded structured data and generating summaries thereof are provided herein. A computer-implemented method includes extracting two or more tables from two or more input documents, wherein each of the two or more input documents comprises structured data and unstructured data; normalizing the two or more extracted tables using one or more alignment techniques; determining at least one of (i) one or more differences and (ii) one or more similarities across the two or more extracted tables by performing a comparison of the two or more normalized tables; deriving one or more insights from the comparison by applying at least one analytical model to the at least one of the one or more determined differences and one or more determined similarities; and outputting at least a portion of the one or more insights to at least one user.

    Contrasting Document-Embedded Structured Data and Generating Summaries Thereof

    公开(公告)号:US20210271654A1

    公开(公告)日:2021-09-02

    申请号:US16804399

    申请日:2020-02-28

    摘要: Methods, systems, and computer program products for contrasting document-embedded structured data and generating summaries thereof are provided herein. A computer-implemented method includes extracting two or more tables from two or more input documents, wherein each of the two or more input documents comprises structured data and unstructured data; normalizing the two or more extracted tables using one or more alignment techniques; determining at least one of (i) one or more differences and (ii) one or more similarities across the two or more extracted tables by performing a comparison of the two or more normalized tables; deriving one or more insights from the comparison by applying at least one analytical model to the at least one of the one or more determined differences and one or more determined similarities; and outputting at least a portion of the one or more insights to at least one user.

    Data quality-based confidence computations for KPIs derived from time-series data

    公开(公告)号:US11314584B1

    公开(公告)日:2022-04-26

    申请号:US17105036

    申请日:2020-11-25

    摘要: A system, computer program product, and method are presented for providing confidence values for replacement data for data that has issues indicative of errors, where the data issues, the replacement data, and confidence values are related to one or more KPIs. The method includes identifying one or more potentially erroneous data instances and determining one or more predicted replacement values for the potentially erroneous data instances. The method further includes determining a confidence value for each predicted replacement value and resolving the one or more potentially erroneous data instances with one predicted replacement value of the one or more predicted replacement values. The method also includes generating an explanatory basis for the resolution of the one or more potentially erroneous data instances.

    EXPLANATION FOR TIME SERIES FORECASTING MODELS

    公开(公告)号:US20220083897A1

    公开(公告)日:2022-03-17

    申请号:US17017836

    申请日:2020-09-11

    摘要: A method, system, and computer program product for explaining predictions made by black box time series models. The method may include identifying a black box time series model. The method may also include predicting one or more time instances using the black box time series model. The method may also include selecting a predicted time instance from the predicted data. The method may also include receiving training data for the black box time series model. The method may also include generating a set of white box time series models similar to the black box time series model. The method may also include selecting a preferred white box time series model. The method may also include analyzing behavior of the preferred white box time series model. The method may also include generating an explanation illustrating why the black box time series model forecasted the predicted time instance.

    Identification of changes between document versions

    公开(公告)号:US11630869B2

    公开(公告)日:2023-04-18

    申请号:US16806438

    申请日:2020-03-02

    摘要: One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a revision different than another of the at least two documents; identifying, within each of the at least two documents, portions corresponding to groups of text containing a conceptual unit; assigning at least a subset of the identified portions to a category type corresponding to a topic of a given portion, wherein the assigning comprises (i) generating a semantic tag for the identified portions in the subset and (ii) tagging the identified portions in the subset with the semantic tag; and determining changes between the at least two documents, wherein the determining comprises (iii) aligning given portions across the at least two documents based upon a relationship between the given portions across the at least two documents, (iv) identifying semantic differences between the aligned portions, and (v) identifying any remaining unaligned portions.

    DOCUMENT REVISION CHANGE SUMMARIZATION
    10.
    发明申请

    公开(公告)号:US20190286741A1

    公开(公告)日:2019-09-19

    申请号:US15922720

    申请日:2018-03-15

    IPC分类号: G06F17/30 G06F17/27 G06K9/00

    摘要: One embodiment provides a method, including: obtaining at least two documents, wherein one of the at least two documents comprises a different revision of another of the at least two documents; identifying a structure of each of the at least two documents by parsing each of the at least two documents to extract text from each of the at least two documents; aligning sections of the at least two documents, wherein the aligning comprises matching a section from one of the at least two documents and a corresponding section from another of the at least two documents; identifying at least one difference between the at least two documents; assigning a semantic label to the identified at least one difference; and providing a summary of the identified at least one difference by compressing the text surrounding the identified at least one difference using the assigned semantic label.