-
公开(公告)号:US20230394860A1
公开(公告)日:2023-12-07
申请号:US17832642
申请日:2022-06-04
Applicant: Zoom Video Communications, Inc.
Inventor: Renjie Tao , Ling Tsou
IPC: G06V30/19 , G06V10/62 , G06V20/62 , G06V20/40 , G06V10/70 , G06V30/14 , G06F16/783 , G06F16/738
CPC classification number: G06V30/19013 , G06V10/62 , G06V20/62 , G06V20/41 , G06V10/768 , G06V30/1444 , G06F16/7844 , G06F16/738 , H04L65/403
Abstract: Methods and systems provide for video-based search results within a communication session. In one embodiment, the system receives video content of a communication session with a number of participants; extracts, via optical character recognition (“OCR”), textual content from the frames of the video content, each piece of textual content including a timestamp representing a temporal location of the frame within the video content; receives, from a client device associated with a user, a request to search for specified text within the video content; in response to receiving the request, determines one or more matching pieces of textual content which match to the specified text; and presents, to the client device, the matching pieces of textual content.
-
公开(公告)号:US20240414017A1
公开(公告)日:2024-12-12
申请号:US18206137
申请日:2023-06-06
Applicant: Zoom Video Communications, Inc.
Inventor: Bilung LEE , Vijay Venkataswamy Parthasarathy , Renjie Tao , Sasank Vemuri
Abstract: Some examples involve an artificial intelligence (AI) system for handling a query about a conversation, such as a conversation between attendees of a videoconferencing meeting. As one example, the system can receive a query from a user about a conversation between attendees of a videoconferencing meeting. The system can determine a relevant portion of the conversation based on the query, determine an intent of the query by providing the query as input to an intent detection model, and select a machine-learning model from among a group of machine-learning models based on the intent of the query. The system can then provide the relevant portion of the conversation as input to the selected machine-learning model. The machine-learning model can generate an output based on the relevant portion of the conversation. The system can transmit the output to the user in a response to the query.
-
公开(公告)号:US20230394854A1
公开(公告)日:2023-12-07
申请号:US17832634
申请日:2022-06-04
Applicant: Zoom Video Communications, Inc.
Inventor: Ravi Teja Polavaram , Renjie Tao , Ling Tsou , Tong Wang , Yun Zhang
IPC: G06V20/70 , H04L65/403 , G06V20/40 , G10L15/26 , G10L25/57 , G06V30/19 , G06F40/253 , G06F40/295 , G10L15/04
CPC classification number: G06V20/70 , H04L65/403 , G06V20/41 , G06V20/49 , G10L15/26 , G10L25/57 , G06V30/19 , G06F40/253 , G06F40/295 , G10L15/04
Abstract: Methods and systems provide for providing video-based chapter generation for a communication session. In one embodiment, the system receives a transcript and video content of a communication session between participants, the transcript including timestamps for a number of utterances associated with speaking participants; processes the video content to extract one or more pieces of textual content visible within the frames of the video content; segments frames of the video content into a number of contiguous topic segments; determines a title for each topic segment from one or more of: the transcript, and the extracted textual content; assigns a category label for each topic segment from a prespecified list of category labels; and transmits, to one or more client devices, the list of topic segments with determined title and assigned category label for each of the merged topic segments.
-
公开(公告)号:US12198433B2
公开(公告)日:2025-01-14
申请号:US18104138
申请日:2023-01-31
Applicant: Zoom Video Communications, Inc.
Inventor: Andrew Miller-Smith , Renjie Tao , Ling Tsou
IPC: G06K9/00 , G06V10/762 , G06V20/40 , G06V20/70 , G06V30/19
Abstract: Methods and systems provide for search results within segmented communication session content. In one embodiment, the system receives a transcript and video content of a communication session between participants, the transcript including timestamps for a number of utterances associated with speaking participants; processes the video content to extract textual content visible within the frames of the video content; segments frames of the video content into a number of contiguous topic segments; determines a title for each topic segment; assigns a category label for each topic segment; receives a request from a user to search for specified text within the video content; determines one or more titles or category labels for which a prediction of relatedness with the specified text is present; and presents content from at least one topic segment associated with the one or more titles or category labels for which a prediction of relatedness is present.
-
公开(公告)号:US20230394861A1
公开(公告)日:2023-12-07
申请号:US17832635
申请日:2022-06-04
Applicant: Zoom Video Communications, Inc.
Inventor: Renjie Tao , Ling Tsou
CPC classification number: G06V30/19173 , G06V20/41 , G06V10/82 , G06V30/15 , G06V20/62 , G06V30/1448 , G06V20/46
Abstract: Methods and systems provide for providing extraction of textual content from video of a communication session. In one embodiment, the system receives video content of a communication session which includes a number of participants. The system then extracts frames from the video content, and classifies the frames of the video content. The system identifies one or more distinguishing frames containing text. For each distinguishing frame containing text, the system detects a title within the frame, crops a title area with the title within the frame, and extracts, via optical character recognition (“OCR”), the title from the cropped title area of the frame. The system extracts, via OCR, textual content from the distinguishing frames containing text, and then transmits the extracted textual content and extracted titles to one or more client devices.
-
公开(公告)号:US20240037941A1
公开(公告)日:2024-02-01
申请号:US18104138
申请日:2023-01-31
Applicant: Zoom Video Communications, Inc.
Inventor: Andrew Miller-Smith , Renjie Tao , Ling Tsou
IPC: G06V20/40 , G06V10/762 , G06V30/19 , G06V20/70
CPC classification number: G06V20/41 , G06V20/49 , G06V10/762 , G06V30/19 , G06V20/70
Abstract: Methods and systems provide for search results within segmented communication session content. In one embodiment, the system receives a transcript and video content of a communication session between participants, the transcript including timestamps for a number of utterances associated with speaking participants; processes the video content to extract textual content visible within the frames of the video content; segments frames of the video content into a number of contiguous topic segments; determines a title for each topic segment; assigns a category label for each topic segment; receives a request from a user to search for specified text within the video content; determines one or more titles or category labels for which a prediction of relatedness with the specified text is present; and presents content from at least one topic segment associated with the one or more titles or category labels for which a prediction of relatedness is present.
-
公开(公告)号:US20230394851A1
公开(公告)日:2023-12-07
申请号:US17832636
申请日:2022-06-04
Applicant: Zoom Video Communications, Inc.
Inventor: Renjie Tao , Ling Tsou
Abstract: Methods and systems provide for providing video frame type classification in a communication session. In one embodiment, the system receives video content of a communication session with a number of participants; extracts frames from the video content; classifies the frames of the video content based on image analysis; and transmits, to one or more client devices, the classification of the frames of the video content.
-
公开(公告)号:US20230394827A1
公开(公告)日:2023-12-07
申请号:US17832637
申请日:2022-06-04
Applicant: Zoom Video Communications, Inc.
Inventor: Renjie Tao , Ling Tsou
CPC classification number: G06V20/46 , G06V40/1347
Abstract: Methods and systems provide title detection for presented slides. In one embodiment, the system receives video content of a communication session with a number of participants; extracts frames from the video content; classifies the frames of the video content; identifies one or more distinguishing frames containing a presentation slide; for each distinguishing frame containing a presentation slide, detects a title within the frame; and transmits, to one or more client devices, the titles for each of the distinguishing frames comprising a presentation slide.
-
公开(公告)号:US20250061713A1
公开(公告)日:2025-02-20
申请号:US18937782
申请日:2024-11-05
Applicant: Zoom Video Communications, Inc.
Inventor: Andrew Miller-Smith , Renjie Tao , Ling Tsou
IPC: G06V20/40 , G06V10/762 , G06V20/70 , G06V30/19
Abstract: Methods and systems provide for video-based and transcript-based segmentation of communication session content. The method may include obtaining a transcript associated with video content of a communication session and performing video-based segmentation on the video content to determine a category label from a list of category labels for each video frame of the video content. The video content may include topic segments of consecutive frames associated with a same category label. The method may further include performing transcript-based segmentation to divide one of the topic segments into a first topic segment associated with a first title and a second topic segment associated with a second title.
-
公开(公告)号:US20240330792A1
公开(公告)日:2024-10-03
申请号:US18228286
申请日:2023-07-31
Applicant: Zoom Video Communications, Inc.
Inventor: Bilung Lee , Vijay Venkataswamy Parthasarathy , Renjie Tao , Bing Zhao
IPC: G06Q10/0631 , H04L51/52
CPC classification number: G06Q10/06311 , H04L51/52
Abstract: Systems and methods for recommending communication channels and generating content for next-step communication are disclosed. A communication analytics platform accesses project metadata and communication data associated with a project. The communication analytics platform determines a recommendation of one or more communication channels for next-step communication for the project based on the project metadata and the communication data. The communication analytics platform generates content for the next-step communication using a generative artificial intelligence (AI) model based on the project metadata and the communication data. The communication analytics platform provides the recommendation of one or more communication channels and the generated content to a user associated with the project.
-
-
-
-
-
-
-
-
-