-
51.
公开(公告)号:US20240346238A1
公开(公告)日:2024-10-17
申请号:US18638075
申请日:2024-04-17
Applicant: GONG.io Ltd.
Inventor: Eyal BEN-DAVID , Inbal HOREV , Raz NUSSBAUM , Adi KOPILOV , Nadav Shai Oved SHALEV , Shlomi MEDALION
IPC: G06F40/166 , G06F16/34
CPC classification number: G06F40/166 , G06F16/345
Abstract: Techniques for efficiently generating a brief of a call summary is provided. The method includes ingesting at least one simplified transcript, wherein a simplified transcript is a summarization of a transcript of a call and includes a plurality of bullet points of at least one main subject; representing each bullet point of the plurality of bullet points of the simplified transcript as an embedded vector using an embedding technique; determining at least one grouping of the plurality of bullet points based on the embedded vector, wherein the grouping includes at least one bullet point; feeding the at least one grouping into a trained rephrasing model to generate a rephrased content for each of the at least one grouping; and generating a summarized brief based on the rephrased content of the at least one grouping, wherein the summarized brief is generated as natural language textual data below a predetermined length.
-
公开(公告)号:US20240346237A1
公开(公告)日:2024-10-17
申请号:US18637896
申请日:2024-04-17
Applicant: YAMAHA HATSUDOKI KABUSHIKI KAISHA
Inventor: Haruyoshi HINO
IPC: G06F40/166 , G06F16/332 , G06F16/9538 , G06T11/00 , G06T13/80
CPC classification number: G06F40/166 , G06F16/3329 , G06F16/9538 , G06T11/00 , G06T13/80
Abstract: Using communicative human language information inputted through the User Interface (UI), an LLM (Language Learning Model) acquires communicative human language information to include in the content. Simultaneously, using the communicative human language information inputted through the UI, the LLM acquires information for selecting visualization software capable of generating visual information to include in the content. Based on the acquired communicative human language information, the visualization software is selected. According to the communicative human language information inputted through the UI, the selected visualization software is operated based on the text information acquired from the LLM's output. Visual information for inclusion in the content is acquired, and content containing at least a part of the acquired communicative human language information and at least a part of the acquired visual information is generated and outputted.
-
公开(公告)号:US20240346235A1
公开(公告)日:2024-10-17
申请号:US18300813
申请日:2023-04-14
Applicant: International Business Machines Corporation
Inventor: Odellia Boni , Michal Shmueli-Scheuer , Kshitij Fadnis , Pankaj Dhoolia
IPC: G06F40/166 , G06F16/23 , G06F16/958 , G06F40/14 , G06F40/197 , G06F40/30
CPC classification number: G06F40/166 , G06F16/2365 , G06F16/986 , G06F40/14 , G06F40/197 , G06F40/30
Abstract: Systems and techniques that facilitate context-aware edit management of webpage-based applications are provided. In various embodiments, a system can detect an edit made to a webpage. In various aspects, the system can determine whether a webpage-based application associated with the webpage is consistent with the edit, based on a registry that respectively maps outputs of the webpage-based application to source texts and corresponding semantic contexts of a pre-edit version of the webpage.
-
公开(公告)号:US12119028B2
公开(公告)日:2024-10-15
申请号:US17967364
申请日:2022-10-17
Applicant: Adobe Inc.
Inventor: Xue Bai , Justin Jonathan Salamon , Aseem Omprakash Agarwala , Hijung Shin , Haoran Cai , Joel Richard Brandt , Lubomira Assenova Dontcheva , Cristin Ailidh Fraser
IPC: G11B27/036 , G06F40/166 , G10L15/26 , G10L25/57 , G11B27/34 , G06F3/0482 , G06F3/04845 , G06F3/0485
CPC classification number: G11B27/036 , G06F40/166 , G10L15/26 , G10L25/57 , G11B27/34 , G06F3/0482 , G06F3/04845 , G06F3/0485
Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for identifying candidate boundaries for video segments, video segment selection using those boundaries, and text-based video editing of video segments selected via transcript interactions. In an example implementation, boundaries of detected sentences and words are extracted from a transcript, the boundaries are retimed into an adjacent speech gap to a location where voice or audio activity is a minimum, and the resulting boundaries are stored as candidate boundaries for video segments. As such, a transcript interface presents the transcript, interprets input selecting transcript text as an instruction to select a video segment with corresponding boundaries selected from the candidate boundaries, and interprets commands that are traditionally thought of as text-based operations (e.g., cut, copy, paste) as an instruction to perform a corresponding video editing operation using the selected video segment.
-
55.
公开(公告)号:US12118313B2
公开(公告)日:2024-10-15
申请号:US18166303
申请日:2023-02-08
Applicant: KABUSHIKI KAISHA TOSHIBA
Inventor: Yuka Kobayashi , Takami Yoshida , Kenji Iwata , Tsuyoshi Kushima , Hisayoshi Nagae
IPC: G06F17/00 , G06F40/166 , G06F40/242 , G06F40/289
CPC classification number: G06F40/289 , G06F40/166 , G06F40/242
Abstract: An information processing device includes at least one hardware processor. The hardware processor selects one or more pieces of partial document data from document data. The hardware processor extracts, from the partial document data, first information being a word or a phrase for specifying a first attribute of the partial document data. The hardware processor extracts, from the partial document data, second information being a word or a phrase for specifying a second attribute of the partial document data. The hardware processor calculates a first feature value representing a feature of the first information. The hardware processor calculates a second feature value representing a feature of the second information. The hardware processor analyzes the document data on the basis of the first feature value and the second feature value.
-
56.
公开(公告)号:US12118272B2
公开(公告)日:2024-10-15
申请号:US17990235
申请日:2022-11-18
Applicant: Suki AI, Inc.
Inventor: Matt Pallakoff
IPC: G06F3/16 , G06F3/0486 , G06F3/0488 , G06F40/166 , G10L15/22 , G16H10/60 , G06F3/04842
CPC classification number: G06F3/167 , G06F3/0486 , G06F3/0488 , G06F40/166 , G10L15/22 , G16H10/60 , G06F3/04842 , G10L2015/223
Abstract: Systems and methods to accept speech input and edit a note upon receipt of an indication to edit are disclosed. Exemplary implementations may: effectuate presentation of a graphical user interface that includes a note, the note including note sections, the note sections including a first note section, the individual note sections including body fields; obtain user input from the client computing platform, the user input representing an indication to edit a first body field of the first note section; obtain audio information representing sound captured by an audio section of the client computing platform, the audio information including value definition information specifying one or more values to be included in the individual body fields; perform speech recognition on the audio information to obtain a first value; and populate the first body field with the first value so that the first value is included in the first body field.
-
公开(公告)号:US12112742B2
公开(公告)日:2024-10-08
申请号:US17536890
申请日:2021-11-29
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Jongsun Lee , Jongyoub Ryu , Seonghan Ryu , Eunji Lee , Jaechul Yang , Hyungtak Choi
IPC: G10L15/22 , G06F40/166 , G06F40/30 , G10L15/18
CPC classification number: G10L15/1815 , G06F40/166 , G06F40/30 , G10L15/22 , G10L2015/223
Abstract: Provided are an electronic device for correcting a speech input, and an operating method thereof. The method may include receiving a first speech signal; obtaining first text; obtaining an intent of the first speech signal and a confidence score of the intent, by inputting the first text to a natural language understanding model; identifying a plurality of correction candidate semantic elements capable of being correction targets in the first text; receiving a second speech signal; obtaining second text; identifying whether the second speech signal is a speech signal for correcting the first text; comparing the plurality of correction candidate semantic elements in the first text with a semantic element in the second text, based on the confidence score; and correcting at least one of the plurality of correction candidate semantic elements in the first text.
-
公开(公告)号:US12112130B2
公开(公告)日:2024-10-08
申请号:US17518471
申请日:2021-11-03
Applicant: Adobe Inc.
Inventor: Sharmila Reddy Nangi , Niyati Himanshu Chhaya , Hyman Chung , Harshit Nyati , Nikhil Kaushik , Sopan Khosla
IPC: G06F40/253 , G06F3/04847 , G06F40/151 , G06F40/166 , G06F40/30
CPC classification number: G06F40/253 , G06F3/04847 , G06F40/151 , G06F40/166 , G06F40/30
Abstract: A text style transfer system is described that generates different stylized versions of input text by rewriting the input text according to a target style. To do so, the text style transfer system employs a variational autoencoder to derive separate content and style representations for the input text, where the content representation specifies semantic information conveyed by the input text and the style representation specifies one or more style attributes expressed by the input text. The style representation using counterfactual reasoning to identify different transfer strengths for applying the target style to the input text. Each transfer strength represents a minimum change to the input text that achieves a different expression of the target style. The transfer strengths are then used to generate style representation variants, which are each concatenated with the content representation of the input text to generate the plurality of different stylized versions of the input text.
-
公开(公告)号:US12111953B2
公开(公告)日:2024-10-08
申请号:US17287640
申请日:2019-10-25
Applicant: SERVICENOW CANADA INC.
Inventor: Elena Busila , Jerome Pasquero , Patrick Lazarus
IPC: G06F21/62 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
CPC classification number: G06F21/6254 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
Abstract: Systems and methods for privacy and sensitive data protection. An image of a document is received at a pre-processing stage and image pre-processing is applied to the image to ensure that the resulting image is sufficient for further processing. Pre-processing may involve processing relating to image quality and image orientation. The image is then passed to an initial processing stage. At the initial processing stage, the relevant data in the document are located and bounding boxes are placed around the data. The resulting image is then passed to a processing stage. At this stage, the type of data within the bounding boxes is determined and suitable replacement data is generated. The replacement data is then inserted into the image to thereby remove and replace the sensitive data in the image.
-
公开(公告)号:US20240331702A1
公开(公告)日:2024-10-03
申请号:US18743562
申请日:2024-06-14
Applicant: SoundHound AI IP, LLC.
Inventor: Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN
IPC: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/06 , G10L15/07
CPC classification number: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/063 , G10L15/07 , G10L2015/0631
Abstract: Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.
-
-
-
-
-
-
-
-
-