SYSTEM AND METHOD FOR TRANSCRIPTION WORKFLOW
    42.
    发明公开

    公开(公告)号:US20240021204A1

    公开(公告)日:2024-01-18

    申请号:US17664536

    申请日:2022-05-23

    CPC classification number: G10L15/32 G10L15/30 G06F16/685 G10L15/22 G10L15/16

    Abstract: Systems, methods, and computer-readable storage media for making assignments to different speech-to-text engines based on previous transcription scores. An exemplary system can train a model by receiving a first digital audio recording, randomly assigning speech-to-text engines to transcribe the first digital audio recording, and scoring the resulting transcriptions and scoring the engines based on their performances. The system can then generate a model for selecting a speech-to-text engine from within the speech-to-text engines. When a second digital audio recording is received, the system can assign, by executing the model, at least one selected speech-to-text engine from the speech-to-text engines to transcribe the second digital audio recording.

    METHOD, SERVICE SERVER AND COMPUTER-READABLE MEDIUM FOR MATCHING MUSIC USAGE LOG AND COPYRIGHT HOLDER

    公开(公告)号:US20240012855A1

    公开(公告)日:2024-01-11

    申请号:US18346930

    申请日:2023-07-05

    CPC classification number: G06F16/683 G06F16/686 G06F21/6218

    Abstract: The present invention relates to a method, a service server and a computer-readable medium for matching a music usage log (cue sheet) and a copyright holder, and more particularly, to a method, a service server and a computer-readable medium for matching a music usage log (cue sheet) and a copyright holder, in which the service server includes a master DB including a plurality of right logs, receives a usage log for music usage from a streaming server, derives a preliminary right log by preprocessing the usage log, calculates a music name matching rate, an artist name matching rate, and an album name matching rate based on the preliminary right log and the right log to match the preliminary right log and the right log, and adds up a record value of the preliminary right log to a cumulative record value of the right log.

    METHODS FOR CONFORMING AUDIO AND SHORT-FORM VIDEO

    公开(公告)号:US20230421841A1

    公开(公告)日:2023-12-28

    申请号:US17850470

    申请日:2022-06-27

    CPC classification number: H04N21/4394 G06F16/683 H04N21/8456

    Abstract: Systems and methods are provided herein for conforming audio to a video to avoid discordance. This may be accomplished by a system receiving a video and selection of an audio asset. The system may identify a plurality of break points in the audio asset based on one or more characteristics of the audio asset. A first portion of the audio asset may be generated based on one or more characteristics of the received video (e.g., length of the video), wherein the first portion of the audio asset begins and/or ends at a break point of the plurality of break points. The system may then generate a media item comprising the video and the first portion of the audio asset.

    AUTOMATED REMOTE MUSIC IDENTIFICATION DEVICE AND SYSTEM

    公开(公告)号:US20230419927A1

    公开(公告)日:2023-12-28

    申请号:US17847029

    申请日:2022-06-22

    Applicant: Deepen Shah

    Inventor: Deepen Shah

    CPC classification number: G10H1/0008 G06F16/634 G06F16/683 G10H2240/141

    Abstract: A sound recording device and system able to automatically record and transmit sound data corresponding to a preset recording interval time and match the sound data to specific sounds and songs. A sound recording device includes housing, a processor, a microphone, cellular and wireless communication devices. The processor includes an interval timer module and location module. The interval timer module is configured to count down a predetermined time interval corresponding to a recording length of time during which the microphone is set to record at least one input sound signal. The location module is configured to determine or receive location data and store in in a location database. The wireless communication device is configured to receive the at least one input sound signal from the processor and transmit the at least one sound signal and the location data to a network or computer device.

    Automated speech-to-text processing and analysis of call data apparatuses, methods and systems

    公开(公告)号:US11838440B2

    公开(公告)日:2023-12-05

    申请号:US17676676

    申请日:2022-02-21

    CPC classification number: H04M3/2218 G06F16/64 G06F16/685 G06F16/687 G10L15/26

    Abstract: The present invention discloses a system, apparatus, and method that obtains audio and metadata information from voice calls, generates textual transcripts from those calls, and makes the resulting data searchable via a user interface. The system converts audio data from one or more sources (such as a telecommunications provider) into searchable usable text transcripts. One use of which is law enforcement and intelligence work. Another use relates to call centers to improve quality and track customer service history. Searches can be performed for callers, callees, keywords, and/or other information in calls across the system. The system can also generate automatic alerts based on callers, callees, keywords, phone numbers, and/or other information. Further the system generates and provides analytic information on the use of the phone system, the semantic content of the calls, and the connections between callers and phone numbers called, which can aid analysts in detecting patterns of behavior, and in looking for patterns of equipment use or failure.

    CONTENT PLAYBACK SYSTEM
    50.
    发明公开

    公开(公告)号:US20230376530A1

    公开(公告)日:2023-11-23

    申请号:US18247915

    申请日:2021-10-27

    Inventor: Stephen Robbins

    CPC classification number: G06F16/639 G06F16/683 G06F16/686 G06F16/7867

    Abstract: A content playback system is described having a local media store configured to store a plurality of media files and a playback unit configured to play the stored media files. The system further has a metadata extraction unit configured to extract metadata for each of the plurality of media files stored in the local media store, and a remote server. The remote server is configured to receive the extracted metadata from the metadata extraction unit and, based on the extracted metadata and a media database available to the remote server, generate a user database including identification information of media items contained in the plurality of media files stored in the local media store. The system is further configured to provide a user interface for interacting with the user database. This may enable a user to browse content stored in the local media store via the generated user interface.

Patent Agency Ranking