GENERATING TRANSCRIPTIONS OF AUDIO DATA FOR PRESENTATION AT A CLIENT DEVICE

    公开(公告)号:US20250166633A1

    公开(公告)日:2025-05-22

    申请号:US18511095

    申请日:2023-11-16

    Applicant: Google LLC

    Abstract: Systems and methods for generating transcriptions of audio data for presentation at a client device are provided. One or more audio streams, provided by one or more audio sources of one or more client devices of a plurality of users, are received by a broadcasting system. Sensory modality information comprising one or more of auditory, visual, or haptic characteristics of a first user is determined. First audio data from the one or more audio streams corresponding to the first user and additional audio data from the one or more audio streams corresponding to other users are determined using one or more machine learning models. At least one of a first transcription of the first audio data or one or more additional transcription of one or more of the additional audio data are provided for presentation at a first client device according to the auditory, visual, or haptic characteristics.

Patent Agency Ranking