METHOD, APPARATUS, AND PROGRAM FOR IMPROVING RECOGNITION ACCURACY OF ACOUSTIC DATA

    公开(公告)号:US20250029631A1

    公开(公告)日:2025-01-23

    申请号:US18711933

    申请日:2022-11-11

    Applicant: COCHL INC

    Abstract: In an embodiment of the present invention for solving the above-described problem, a method of improving recognition accuracy of acoustic data is disclosed. The method may include configuring one or more acoustic frames based on acoustic data, processing each of the one or more acoustic frames as an input of an acoustic recognition model to output predicted values corresponding to each acoustic frame, identifying one or more recognized acoustic frames through threshold analysis based on the predicted values corresponding to each acoustic frame, identifying a converted acoustic frame through time series analysis based on the one or more recognized acoustic frames, and converting a predicted value corresponding to the converted acoustic frame.

    AUTHENTICATION APPARATUS, AUTHENTICATION METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20250029619A1

    公开(公告)日:2025-01-23

    申请号:US18686492

    申请日:2021-09-08

    Abstract: An authentication apparatus includes: a calculation unit that calculates, from an air conduction sound signal indicating an air conduction sound of a voice of a target person and a bone conduction sound signal indicating a bone conduction sound of the voice of the target person, an air conduction feature quantity that is a feature quantity of the air conduction sound signal and a bone conduction feature quantity that is a feature quantity of the bone conduction sound signal, and that calculates a target feature quantity that is a feature quantity of the voice of the target person by combining the air conduction feature quantity and the bone conduction feature quantity; and an authentication unit that authenticates the target person on the basis of the target feature quantity.

    METHOD FOR OPERATING A SPEECH DIALOGUE SYSTEM

    公开(公告)号:US20250029610A1

    公开(公告)日:2025-01-23

    申请号:US18708454

    申请日:2022-10-14

    Abstract: A method for operating a speech dialogue system involves determining a vehicle context and information relating to the vehicle context and checking whether the information has a validity duration shorter than a predefined reference value and is thus to be graded as urgent. If the information is urgent, it is further checked whether the information has a validity value exceeding a predetermined threshold value for the user. If the threshold value is also exceeded, a speech output adjusted to the current communication status and directed towards a vehicle user is automatically carried out.

    DETECTING AND ASSIGNING ACTION ITEMS TO CONVERSATION PARTICIPANTS IN REAL-TIME AND DETECTING COMPLETION THEREOF

    公开(公告)号:US20250029609A1

    公开(公告)日:2025-01-23

    申请号:US18908467

    申请日:2024-10-07

    Abstract: Described herein is a system for automatically detecting and assigning action items in a real-time conversation and determining whether such action items have been completed. The system detects, during a meeting, a plurality of action items and an utterance that corresponds to a completed action item. Responsive to detecting the utterance, the system generates a similarity score with respect to a first action item of the plurality of action items. The system compares the similarity score to a first threshold. Responsive to determining that the similarity score does not exceed the first threshold, the system generates a second similarity score with respect to a second action item of the plurality of action items. The system compares the second similarity score to a second threshold, which exceeds the first threshold. Responsive to determining that the second similarity score exceeds the second threshold, the system marks the second action item as completed.

    APPARATUS FOR VOICE RECOGNITION AND METHOD THEREOF

    公开(公告)号:US20250029602A1

    公开(公告)日:2025-01-23

    申请号:US18512252

    申请日:2023-11-17

    Inventor: Sung Soo Park

    Abstract: In embodiments, a voice recognition apparatus, and a method thereof, includes a microphone that extracts an utterance of a user, a memory that stores a scenario matching intent extracted from the utterance, and a processor that searches for the scenario based on the utterance and performs a voice recognition function. The processor can extract a first intent from a first utterance and extract a second intent from a second utterance. The processor can separate the first intent and the second intent into partial intent units by using separators, and generate a final intent by combining partial intents of the first intent and the second intent such that duplicate partial intents are deleted depending on definitions of the separators.

    METHOD AND APPARATUS FOR TRAINING ENCODER

    公开(公告)号:US20250029599A1

    公开(公告)日:2025-01-23

    申请号:US18635857

    申请日:2024-04-15

    Inventor: Sung Woong HWANG

    Abstract: A method and an apparatus for training a speech transformation model are provided. The method and the apparatus are capable of generating a natural speech suitable for context and improving accuracy of pronunciation by training the first encoder (e.g., a encoder of the flow-based model) and the second encoder (e.g., a encoder of the Tacotron 2 model) in parallel.

Patent Agency Ranking