MACHINE LEARNING-POWERED FRAMEWORK TO TRANSFORM OVERLOADED TEXT DOCUMENTS

    公开(公告)号:WO2022271324A1

    公开(公告)日:2022-12-29

    申请号:PCT/US2022/029514

    申请日:2022-05-17

    发明人: LI, Ji

    摘要: Systems and methods for providing a machine learning-powered framework to transform overloaded text documents is provided. The system generates a plurality of candidate templates offline. During runtime, the system accesses a text document and identifies segmentation data. The segmentation data indicates a plurality of segments derived from the text document. The system accesses a plurality of candidate templates, whereby each candidate template comprises a plurality of pages having a different background element that shares a common theme. The plurality of candidate templates is ranked based on at least the segmentation data. The network generates multiple presentation pages for each of a predetermined number of top ranked candidate templates by incorporating each of the plurality of segments into a corresponding page of the plurality of pages for each of the top ranked candidate templates. The multiple presentation pages are presented for each of the top ranked candidate templates as a recommendation.

    ARTIFICIAL REALITY APPLICATION LIFECYCLE
    3.
    发明申请

    公开(公告)号:WO2023278101A1

    公开(公告)日:2023-01-05

    申请号:PCT/US2022/032288

    申请日:2022-06-05

    摘要: Aspects of the present disclosure are directed to an artificial reality (XR) application system controlling applications in an artificial reality environment. In various cases, these controls include automatically suggesting XR applications by determining an XR context and identifying applications that match the XR context. These applications can be suggested to a user, who can authorize their execution, setting permissions for the application. In some cases, applications can be divided into components which can be progressively downloaded. By providing application suggestions relevant to the current context and progressively downloading application components, applications can appear ambient, rather than relying on users to constantly download, install, or activate applications. Permissions for applications may be revoked permanently or for certain situations - either through user permissions selections or automatically in response to determined user intents. When multiple applications are simultaneously authorized to execute, the XR application system can employ a ranking system to prevent overcrowding.

    SELF-SUPERVISED DOCUMENT-TO-DOCUMENT SIMILARITY SYSTEM

    公开(公告)号:WO2022271304A1

    公开(公告)日:2022-12-29

    申请号:PCT/US2022/029096

    申请日:2022-05-13

    摘要: Examples provide a self-supervised language model for document-to-document similarity scoring and ranking long documents of arbitrary length in an absence of similarity labels. In a first stage of a two-staged hierarchical scoring, a sentence similarity matrix is created for each paragraph in the candidate document. A sentence similarity score is calculated based on the sentence similarity matrix. In the second stage, a paragraph similarity matrix is constructed based on aggregated sentence similarity scores associated with the first candidate document. A total similarity score for the document is calculated based on the normalize the paragraph similarity matrix for each candidate document in a collection of documents. The model is trained using a masked language model and intra-and-inter document sampling. The documents are ranked based on the similarity scores for the documents.

    ADVANCED RESPONSE PROCESSING IN WEB DATA COLLECTION

    公开(公告)号:WO2022268808A1

    公开(公告)日:2022-12-29

    申请号:PCT/EP2022/066874

    申请日:2022-06-21

    摘要: Advanced response processing in web data collection discloses processor-implemented apparatuses, methods, and systems of processing unstructured raw HTML responses collected in the context of a data collection service, the method comprising, in one embodiment, receiving raw unstructured HTML documents and extracting text data with associated meta information that may comprise style and formatting information. In some embodiments data field tags and values may be assigned to the text blocks extracted, classifying the data based on the processing of Machine Learning algorithms. Additionally, blocks of extracted data may be grouped and re-grouped together and presented as a single data point. In another embodiment the system may aggregate and present the text data with the associated meta information in a structured format. In certain embodiments the Machine Learning model may be a model trained on a pre-created training data set labeled manually or in an automatic fashion.

    SYSTEM AND METHOD FOR CROWDSOURCING A VIDEO SUMMARY FOR CREATING AN ENHANCED VIDEO SUMMARY

    公开(公告)号:WO2022240409A1

    公开(公告)日:2022-11-17

    申请号:PCT/US2021/032168

    申请日:2021-05-13

    申请人: CLIPR CO.

    摘要: System and method for crowdsourcing a video summary for creating an enhanced video summary are disclosed. The method includes receiving videos, analysing the videos, creating the video summary of the videos using a building block model, storing the video summary in a video library database, crowdsourcing the video summary to at least one of the plurality of users, enabling the at least one of the plurality of users to review the video summary and identify at least one new characteristic, enabling the at least one of the plurality of users to share the at least one new characteristic on the platform, comparing at least one existing characteristic of the building block model with the corresponding new characteristic, reconciling the video summary along with at least one inserted new characteristic, creating a new building block model, editing the video summary for creating the enhanced video summary.