Patent search ipc:"G06F40/166" Page 6

51.

发明公开
SYSTEM AND METHOD FOR GENERATING A BRIEF OF CONVERSATION SUMMARIES USING A LARGE LANGUAGE MODEL 审中-公开

公开(公告)号：US20240346238A1

公开(公告)日：2024-10-17

申请号：US18638075

申请日：2024-04-17

Applicant: GONG.io Ltd.

Inventor： Eyal BEN-DAVID , Inbal HOREV , Raz NUSSBAUM , Adi KOPILOV , Nadav Shai Oved SHALEV , Shlomi MEDALION

IPC: G06F40/166 , G06F16/34

CPC classification number: G06F40/166 , G06F16/345

Abstract: Techniques for efficiently generating a brief of a call summary is provided. The method includes ingesting at least one simplified transcript, wherein a simplified transcript is a summarization of a transcript of a call and includes a plurality of bullet points of at least one main subject; representing each bullet point of the plurality of bullet points of the simplified transcript as an embedded vector using an embedding technique; determining at least one grouping of the plurality of bullet points based on the embedded vector, wherein the grouping includes at least one bullet point; feeding the at least one grouping into a trained rephrasing model to generate a rephrased content for each of the at least one grouping; and generating a summarized brief based on the rephrased content of the at least one grouping, wherein the summarized brief is generated as natural language textual data below a predetermined length.

52.

发明公开
CONTENT GENERATION SYSTEM 审中-公开

公开(公告)号：US20240346237A1

公开(公告)日：2024-10-17

申请号：US18637896

申请日：2024-04-17

Applicant: YAMAHA HATSUDOKI KABUSHIKI KAISHA

Inventor： Haruyoshi HINO

IPC: G06F40/166 , G06F16/332 , G06F16/9538 , G06T11/00 , G06T13/80

CPC classification number: G06F40/166 , G06F16/3329 , G06F16/9538 , G06T11/00 , G06T13/80

Abstract: Using communicative human language information inputted through the User Interface (UI), an LLM (Language Learning Model) acquires communicative human language information to include in the content. Simultaneously, using the communicative human language information inputted through the UI, the LLM acquires information for selecting visualization software capable of generating visual information to include in the content. Based on the acquired communicative human language information, the visualization software is selected. According to the communicative human language information inputted through the UI, the selected visualization software is operated based on the text information acquired from the LLM's output. Visual information for inclusion in the content is acquired, and content containing at least a part of the acquired communicative human language information and at least a part of the acquired visual information is generated and outputted.

53.

发明公开
CONTEXT-AWARE EDIT MANAGEMENT OF WEBPAGE-BASED APPLICATIONS 审中-公开

公开(公告)号：US20240346235A1

公开(公告)日：2024-10-17

申请号：US18300813

申请日：2023-04-14

Applicant: International Business Machines Corporation

Inventor： Odellia Boni , Michal Shmueli-Scheuer , Kshitij Fadnis , Pankaj Dhoolia

IPC: G06F40/166 , G06F16/23 , G06F16/958 , G06F40/14 , G06F40/197 , G06F40/30

CPC classification number: G06F40/166 , G06F16/2365 , G06F16/986 , G06F40/14 , G06F40/197 , G06F40/30

Abstract: Systems and techniques that facilitate context-aware edit management of webpage-based applications are provided. In various embodiments, a system can detect an edit made to a webpage. In various aspects, the system can determine whether a webpage-based application associated with the webpage is consistent with the edit, based on a registry that respectively maps outputs of the webpage-based application to source texts and corresponding semantic contexts of a pre-edit version of the webpage.

54.

发明授权
Video segment selection and editing using transcript interactions 有权

公开(公告)号：US12119028B2

公开(公告)日：2024-10-15

申请号：US17967364

申请日：2022-10-17

Applicant: Adobe Inc.

Inventor： Xue Bai , Justin Jonathan Salamon , Aseem Omprakash Agarwala , Hijung Shin , Haoran Cai , Joel Richard Brandt , Lubomira Assenova Dontcheva , Cristin Ailidh Fraser

IPC: G11B27/036 , G06F40/166 , G10L15/26 , G10L25/57 , G11B27/34 , G06F3/0482 , G06F3/04845 , G06F3/0485

CPC classification number: G11B27/036 , G06F40/166 , G10L15/26 , G10L25/57 , G11B27/34 , G06F3/0482 , G06F3/04845 , G06F3/0485

Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for identifying candidate boundaries for video segments, video segment selection using those boundaries, and text-based video editing of video segments selected via transcript interactions. In an example implementation, boundaries of detected sentences and words are extracted from a transcript, the boundaries are retimed into an adjacent speech gap to a location where voice or audio activity is a minimum, and the resulting boundaries are stored as candidate boundaries for video segments. As such, a transcript interface presents the transcript, interprets input selecting transcript text as an instruction to select a video segment with corresponding boundaries selected from the candidate boundaries, and interprets commands that are traditionally thought of as text-based operations (e.g., cut, copy, paste) as an instruction to perform a corresponding video editing operation using the selected video segment.

55.

发明授权
Information processing device, information processing method, and computer program product 有权

公开(公告)号：US12118313B2

公开(公告)日：2024-10-15

申请号：US18166303

申请日：2023-02-08

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor： Yuka Kobayashi , Takami Yoshida , Kenji Iwata , Tsuyoshi Kushima , Hisayoshi Nagae

IPC: G06F17/00 , G06F40/166 , G06F40/242 , G06F40/289

CPC classification number: G06F40/289 , G06F40/166 , G06F40/242

Abstract: An information processing device includes at least one hardware processor. The hardware processor selects one or more pieces of partial document data from document data. The hardware processor extracts, from the partial document data, first information being a word or a phrase for specifying a first attribute of the partial document data. The hardware processor extracts, from the partial document data, second information being a word or a phrase for specifying a second attribute of the partial document data. The hardware processor calculates a first feature value representing a feature of the first information. The hardware processor calculates a second feature value representing a feature of the second information. The hardware processor analyzes the document data on the basis of the first feature value and the second feature value.

56.

发明授权
Systems and methods to accept speech input and edit a note upon receipt of an indication to edit 有权

公开(公告)号：US12118272B2

公开(公告)日：2024-10-15

申请号：US17990235

申请日：2022-11-18

Applicant: Suki AI, Inc.

Inventor： Matt Pallakoff

IPC: G06F3/16 , G06F3/0486 , G06F3/0488 , G06F40/166 , G10L15/22 , G16H10/60 , G06F3/04842

CPC classification number: G06F3/167 , G06F3/0486 , G06F3/0488 , G06F40/166 , G10L15/22 , G16H10/60 , G06F3/04842 , G10L2015/223

Abstract: Systems and methods to accept speech input and edit a note upon receipt of an indication to edit are disclosed. Exemplary implementations may: effectuate presentation of a graphical user interface that includes a note, the note including note sections, the note sections including a first note section, the individual note sections including body fields; obtain user input from the client computing platform, the user input representing an indication to edit a first body field of the first note section; obtain audio information representing sound captured by an audio section of the client computing platform, the audio information including value definition information specifying one or more values to be included in the individual body fields; perform speech recognition on the audio information to obtain a first value; and populate the first body field with the first value so that the first value is included in the first body field.

57.

发明授权
Electronic device for correcting speech input of user and operating method thereof 有权

公开(公告)号：US12112742B2

公开(公告)日：2024-10-08

申请号：US17536890

申请日：2021-11-29

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Jongsun Lee , Jongyoub Ryu , Seonghan Ryu , Eunji Lee , Jaechul Yang , Hyungtak Choi

IPC: G10L15/22 , G06F40/166 , G06F40/30 , G10L15/18

CPC classification number: G10L15/1815 , G06F40/166 , G06F40/30 , G10L15/22 , G10L2015/223

Abstract: Provided are an electronic device for correcting a speech input, and an operating method thereof. The method may include receiving a first speech signal; obtaining first text; obtaining an intent of the first speech signal and a confidence score of the intent, by inputting the first text to a natural language understanding model; identifying a plurality of correction candidate semantic elements capable of being correction targets in the first text; receiving a second speech signal; obtaining second text; identifying whether the second speech signal is a speech signal for correcting the first text; comparing the plurality of correction candidate semantic elements in the first text with a semantic element in the second text, based on the confidence score; and correcting at least one of the plurality of correction candidate semantic elements in the first text.

58.

发明授权
Counterfactual text stylization 有权

公开(公告)号：US12112130B2

公开(公告)日：2024-10-08

申请号：US17518471

申请日：2021-11-03

Applicant: Adobe Inc.

Inventor： Sharmila Reddy Nangi , Niyati Himanshu Chhaya , Hyman Chung , Harshit Nyati , Nikhil Kaushik , Sopan Khosla

IPC: G06F40/253 , G06F3/04847 , G06F40/151 , G06F40/166 , G06F40/30

CPC classification number: G06F40/253 , G06F3/04847 , G06F40/151 , G06F40/166 , G06F40/30

Abstract: A text style transfer system is described that generates different stylized versions of input text by rewriting the input text according to a target style. To do so, the text style transfer system employs a variational autoencoder to derive separate content and style representations for the input text, where the content representation specifies semantic information conveyed by the input text and the style representation specifies one or more style attributes expressed by the input text. The style representation using counterfactual reasoning to identify different transfer strengths for applying the target style to the input text. Each transfer strength represents a minimum change to the input text that achieves a different expression of the target style. The transfer strengths are then used to generate style representation variants, which are each concatenated with the content representation of the input text to generate the plurality of different stylized versions of the input text.

59.

发明授权
Sensitive data detection and replacement 有权

公开(公告)号：US12111953B2

公开(公告)日：2024-10-08

申请号：US17287640

申请日：2019-10-25

Applicant: SERVICENOW CANADA INC.

Inventor： Elena Busila , Jerome Pasquero , Patrick Lazarus

IPC: G06F21/62 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20

CPC classification number: G06F21/6254 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20

Abstract: Systems and methods for privacy and sensitive data protection. An image of a document is received at a pre-processing stage and image pre-processing is applied to the image to ensure that the resulting image is sufficient for further processing. Pre-processing may involve processing relating to image quality and image orientation. The image is then passed to an initial processing stage. At the initial processing stage, the relevant data in the document are located and bounding boxes are placed around the data. The resulting image is then passed to a processing stage. At this stage, the type of data within the bounding boxes is determined and suitable replacement data is generated. The replacement data is then inserted into the image to thereby remove and replace the sensitive data in the image.

60.

发明公开
METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA 审中-公开

公开(公告)号：US20240331702A1

公开(公告)日：2024-10-03

申请号：US18743562

申请日：2024-06-14

Applicant: SoundHound AI IP, LLC.

Inventor： Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN

IPC: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/06 , G10L15/07

CPC classification number: G10L15/26 , G06F40/134 , G06F40/166 , G06F40/284 , G10L15/02 , G10L15/063 , G10L15/07 , G10L2015/0631

Abstract: Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification