-
公开(公告)号:US20250030930A1
公开(公告)日:2025-01-23
申请号:US18356006
申请日:2023-07-20
Applicant: OPEN TEXT HOLDINGS, INC.
Inventor: Amit Kumar Gupta , Sree Harsha Vardhan Reddy Munagala
IPC: H04N21/845 , G10L15/26
Abstract: Systems and methods for the design, deployment and utilization of targeted multimedia communications based upon audiences are disclosed. More specifically, embodiments may allow the targeting of communications to users in multiple media formats from the same multimedia communication templates and the delivery of such communications to users through multiple communication channels.
-
公开(公告)号:US20250029631A1
公开(公告)日:2025-01-23
申请号:US18711933
申请日:2022-11-11
Applicant: COCHL INC
Inventor: Yoonchang HAN , Jeongsoo PARK , Subin LEE , Ilyoung JEONG , Hyungui LIM , Donmoon LEE
IPC: G10L25/51
Abstract: In an embodiment of the present invention for solving the above-described problem, a method of improving recognition accuracy of acoustic data is disclosed. The method may include configuring one or more acoustic frames based on acoustic data, processing each of the one or more acoustic frames as an input of an acoustic recognition model to output predicted values corresponding to each acoustic frame, identifying one or more recognized acoustic frames through threshold analysis based on the predicted values corresponding to each acoustic frame, identifying a converted acoustic frame through time series analysis based on the one or more recognized acoustic frames, and converting a predicted value corresponding to the converted acoustic frame.
-
3.
公开(公告)号:US20250029627A1
公开(公告)日:2025-01-23
申请号:US18908353
申请日:2024-10-07
Inventor: Huanbin ZOU , Zhicheng LI , Jun ZHAO
IPC: G10L21/0232 , G10L21/0264 , G10L25/18 , G10L25/60
Abstract: Embodiments of the present disclosure disclose a method and an apparatus for processing audio data, a device, and a storage medium, applied to a cloud server in cloud technologies. The method includes: obtaining original noise audio data to be processed and a target scenario parameter associated with the original noise audio data; determining, based on the target scenario parameter, a target noise reduction strength parameter for noise reduction processing; and performing noise reduction processing on the original noise audio data based on the target noise reduction strength parameter, to obtain target enhanced audio data.
-
公开(公告)号:US20250029619A1
公开(公告)日:2025-01-23
申请号:US18686492
申请日:2021-09-08
Applicant: NEC Corporation
Inventor: Ling GUO , Hitoshi YAMAMOTO
Abstract: An authentication apparatus includes: a calculation unit that calculates, from an air conduction sound signal indicating an air conduction sound of a voice of a target person and a bone conduction sound signal indicating a bone conduction sound of the voice of the target person, an air conduction feature quantity that is a feature quantity of the air conduction sound signal and a bone conduction feature quantity that is a feature quantity of the bone conduction sound signal, and that calculates a target feature quantity that is a feature quantity of the voice of the target person by combining the air conduction feature quantity and the bone conduction feature quantity; and an authentication unit that authenticates the target person on the basis of the target feature quantity.
-
公开(公告)号:US20250029610A1
公开(公告)日:2025-01-23
申请号:US18708454
申请日:2022-10-14
Applicant: MERCEDES-BENZ GROUP AG
Inventor: Ute EHRLICH , Jakob ZIMMERMANN
IPC: G10L15/22 , G10L13/033
Abstract: A method for operating a speech dialogue system involves determining a vehicle context and information relating to the vehicle context and checking whether the information has a validity duration shorter than a predefined reference value and is thus to be graded as urgent. If the information is urgent, it is further checked whether the information has a validity value exceeding a predetermined threshold value for the user. If the threshold value is also exceeded, a speech output adjusted to the current communication status and directed towards a vehicle user is automatically carried out.
-
公开(公告)号:US20250029609A1
公开(公告)日:2025-01-23
申请号:US18908467
申请日:2024-10-07
Applicant: Outreach Corporation
Inventor: Rohit Ganpat Mane , Abhishek Abhishek , Krishnamohan Reddy Nareddy , Rajiv Garg
Abstract: Described herein is a system for automatically detecting and assigning action items in a real-time conversation and determining whether such action items have been completed. The system detects, during a meeting, a plurality of action items and an utterance that corresponds to a completed action item. Responsive to detecting the utterance, the system generates a similarity score with respect to a first action item of the plurality of action items. The system compares the similarity score to a first threshold. Responsive to determining that the similarity score does not exceed the first threshold, the system generates a second similarity score with respect to a second action item of the plurality of action items. The system compares the second similarity score to a second threshold, which exceeds the first threshold. Responsive to determining that the second similarity score exceeds the second threshold, the system marks the second action item as completed.
-
公开(公告)号:US20250029602A1
公开(公告)日:2025-01-23
申请号:US18512252
申请日:2023-11-17
Applicant: Hyundai Motor Company , Kia Corporation
Inventor: Sung Soo Park
IPC: G10L15/18
Abstract: In embodiments, a voice recognition apparatus, and a method thereof, includes a microphone that extracts an utterance of a user, a memory that stores a scenario matching intent extracted from the utterance, and a processor that searches for the scenario based on the utterance and performs a voice recognition function. The processor can extract a first intent from a first utterance and extract a second intent from a second utterance. The processor can separate the first intent and the second intent into partial intent units by using separators, and generate a final intent by combining partial intents of the first intent and the second intent such that duplicate partial intents are deleted depending on definitions of the separators.
-
公开(公告)号:US20250029599A1
公开(公告)日:2025-01-23
申请号:US18635857
申请日:2024-04-15
Applicant: HYUNDAI MOTOR COMPANY , KIA CORPORATION
Inventor: Sung Woong HWANG
Abstract: A method and an apparatus for training a speech transformation model are provided. The method and the apparatus are capable of generating a natural speech suitable for context and improving accuracy of pronunciation by training the first encoder (e.g., a encoder of the flow-based model) and the second encoder (e.g., a encoder of the Tacotron 2 model) in parallel.
-
公开(公告)号:US20250029591A1
公开(公告)日:2025-01-23
申请号:US18906743
申请日:2024-10-04
Applicant: Sound United, LLC
Inventor: Bradley M. Starobin , Matthew Lyons , Stuart W. Lumsden , Michael DiTullo
Abstract: A system and method for quieting unwanted sound. As a non-limiting example, various aspects of this disclosure provide a system and method, for example implemented in a premises-based or home audio system, for quieting unwanted sound at a particular location.
-
公开(公告)号:US20250029581A1
公开(公告)日:2025-01-23
申请号:US18905447
申请日:2024-10-03
Applicant: YAMAHA CORPORATION
Inventor: Yoshimasa ISOZAKI , Katsunori SUZUKI , Takahiro TERADA , Takashi MORI , Jun ISHII , Ibuki HANDA , Takuya FUJISHIMA , Koji YATAKA , Yukio WAKUI , Soichi TAKIGAWA , Akira MAEZAWA , Haruki OHKAWA , Yoshikatsu MATSUBARA , Yasuhiko OBA , Fukutaro OKUYAMA , Tomoya SASAKI , Rei FURUKAWA
Abstract: A control device in one embodiment includes a first transmission unit, a first receiving unit, and a first generation unit. The first transmission unit is configured to transmit first performance data including contents of playing a keyboard instrument at a first communication base to a second communication base. The first receiving unit is configured to receive second performance data from the second communication base. The first generation unit is configured to generate a drive signal to produce a sound in accordance with the second performance data and output the drive signal to a sound generation device at the first communication base. At least one of the first performance data and the second performance data includes a key position signal indicating a key press amount on the keyboard instrument.
-
-
-
-
-
-
-
-
-