REVERSIBLE SPEECH-TO-SPEECH TRANSLATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

    公开(公告)号:US20240428020A1

    公开(公告)日:2024-12-26

    申请号:US18212408

    申请日:2023-06-21

    Abstract: Disclosed are apparatuses, systems, and techniques that may use machine learning for reversible translations of speech utterances. The techniques include training and using duplex neural networks (NNs) having a first subnetwork and a second subnetwork that are mirror images of each other. Training data for training the duplex NNs may include a target output that includes a first speech utterance in a first language, a first training input that includes the target output distorted by a noise, and a second training input that includes a second speech utterance in a second language. The duplex NNs may be trained to identify, using the first training input and the second training input, at least one of the target output or the first noise.

Patent Agency Ranking