-
公开(公告)号:US11631419B2
公开(公告)日:2023-04-18
申请号:US17175220
申请日:2021-02-12
IPC分类号: G10L21/00 , G10L17/14 , G10L25/21 , G10L25/57 , G10L25/60 , G10L25/84 , H04N7/18 , H04R1/40 , H04R3/00 , G06Q50/10
摘要: A recording device records a video and an imaging time, and a voice. Based on the voice, a sound parameter calculator calculates a sound parameter for specifying magnitude of the voice in a monitoring area at the imaging time for each of pixels and for each of certain times. A sound parameter storage unit stores the sound parameter. A sound parameter display controller superimposes a voice heat map on a captured image of the monitoring area and displays the superimposed image on a monitor. At this time, the sound parameter display controller displays the voice heat map based on a cumulative time value of magnitude of the voice, according to designation of a time range.
-
公开(公告)号:US20230098137A1
公开(公告)日:2023-03-30
申请号:US17491511
申请日:2021-09-30
发明人: Patrick EHLEN , Victor BARRES
IPC分类号: G06F21/62 , G10L21/00 , G06F40/284 , G06F40/205 , G10L15/26 , G06N20/20
摘要: A method and apparatus for redacting sensitive information from audio is provided. The method comprises identifying, using a plurality of Classifiers, each corresponding to a plurality of sensitive items, a sensitive item (SI) token from a plurality of tokens comprised in a transcribed text of an audio. The SI token corresponds to one of the plurality of sensitive items, each of the plurality of tokens is a transcription of a spoken word in the audio, and each of the plurality of tokens is associated with a corresponding timestamp indicating a chronologic position of the spoken word in the audio. A redaction timespan is determined for the SI token from a first timestamp for the SI token and a second timestamp for a non-SI token immediately after the SI token, and the audio for the redaction timespan is redacted.
-
公开(公告)号:US11606634B2
公开(公告)日:2023-03-14
申请号:US17669633
申请日:2022-02-11
发明人: Hwa-Sung Kim , Dong Hyun Sohn , Ji-Eun Lee
摘要: A home appliance includes an electrical equipment compartment disposed in an upper portion of the home appliance, and including an upper side that is open, an electrical equipment compartment cover to cover the open upper side of the electrical equipment compartment, and including a speaker hole, a microphone accommodating portion which protrudes upward from an upper side of the electrical equipment compartment cover and including an accommodating space and a front portion that includes microphone holes laterally spaced apart from each other and which face toward a front of the home appliance, a microphone unit including a printed circuit board (PCB) disposed in the accommodation space behind the microphone holes and including microphone chips mounted on the PCB, and a speaker unit disposed in the electrical equipment compartment to correspond to the speaker hole.
-
公开(公告)号:US20230075670A1
公开(公告)日:2023-03-09
申请号:US18055810
申请日:2022-11-15
发明人: Shaofei XUE , Biao TIAN
摘要: Embodiments of the disclosure provide methods and apparatuses processing audio data. The method can include: acquiring audio data by an audio capturing device, determining feature information of an enclosure in which the audio capturing device is located, and reverberating the feature information into the audio data.
-
公开(公告)号:US11538074B1
公开(公告)日:2022-12-27
申请号:US17141644
申请日:2021-01-05
申请人: WALGREEN CO.
IPC分类号: H04M3/00 , H04M5/00 , H04L12/66 , G06Q30/02 , H04M3/42 , G06Q50/22 , G10L15/22 , H04M3/493 , G10L15/26 , H04M3/51 , G10L21/00 , G10L25/00
摘要: The method and system may provide a seamless handoff of user information from a drugstore to a call agent. When a customer communicates with a drugstore device regarding a drugstore-related inquiry, the drugstore device attempts to identify an answer to the drugstore-related inquiry. When the drugstore device does not identify an answer to the drugstore-related inquiry, the drugstore device initiates communication between the customer and a contact center. A transcribed version of the communication may be stored in a database accessible by the contact center along with additional user information for the customer related to the customer's experiences with the drugstore. The user information may be provided to a call agent's contact center device for display and in this manner, the call agent may be made aware of the communication to avoid asking repeat questions and to quickly and efficiently answer the customer's drugstore-related inquiry.
-
公开(公告)号:US11495215B1
公开(公告)日:2022-11-08
申请号:US16710811
申请日:2019-12-11
发明人: Minhua Wu , Shiva Sundaram , Tae Jin Park , Kenichi Kumatani
摘要: Techniques for speech processing using a deep neural network (DNN) based acoustic model front-end are described. A new modeling approach directly models multi-channel audio data received from a microphone array using a first model (e.g., multi-geometry/multi-channel DNN) that includes a frequency aligned network (FAN) architecture. Thus, the first model may perform spatial filtering to generate a first feature vector by processing individual frequency bins separately, such that multiple frequency bins are not combined. The first feature vector may be used similarly to beamformed features generated by an acoustic beamformer. A second model (e.g., feature extraction DNN) processes the first feature vector and transforms it to a second feature vector having a lower dimensional representation. A third model (e.g., classification DNN) processes the second feature vector to perform acoustic unit classification and generate text data. The DNN front-end enables improved performance despite a reduction in microphones.
-
7.
公开(公告)号:US11470201B2
公开(公告)日:2022-10-11
申请号:US16943994
申请日:2020-07-30
申请人: DELL PRODUCTS L.P.
IPC分类号: G10L15/00 , G10L25/90 , G10L15/06 , G10L21/00 , H04M1/253 , G10L13/00 , H04M7/00 , H04L65/1063 , H04L65/1069
摘要: Systems and methods are provided that may be implemented in a real time manner by an information handling system (the “client system”) to monitor one or more characteristics of a voice over internet protocol (VOIP) discussion, to use these monitored VOIP characteristics to identify one or more condition/s in real time as they are identified to occur during the current VOIP discussion, and to determine to take one or more automatic actions based on the identified VOIP condition/s so as to inform and/or alert a current human user of the client system to the occurrence of the identified VOIP condition/s as they occur.
-
公开(公告)号:US11468906B2
公开(公告)日:2022-10-11
申请号:US17075325
申请日:2020-10-20
申请人: NAVER CORPORATION
发明人: Woo-sik Byun , Sang Don Kim
IPC分类号: G10L21/00 , G06F3/16 , G11B27/031
摘要: A multiple sound source mixing method includes dividing a plurality of sound source data into segments each with a desired length; sequentially inputting sound source data of a corresponding segment for each segment through a desired number of nodes with respect to the plurality of sound source data and mixing the input sound source data into a single piece of sound source data; and concatenating the sound source data mixed for the respective segments.
-
公开(公告)号:US11450321B2
公开(公告)日:2022-09-20
申请号:US17023511
申请日:2020-09-17
申请人: Voicify, LLC
摘要: Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.
-
公开(公告)号:US11450109B2
公开(公告)日:2022-09-20
申请号:US16653904
申请日:2019-10-15
发明人: Nils B. Lahr , Garrick C. Barr
IPC分类号: G06K9/00 , G06V20/40 , G06F16/78 , G06F16/182 , G06F16/958 , G06F16/583 , G06F16/783 , G06F16/955 , H04N7/173 , H04N21/432 , H04N21/433 , H04N21/442 , H04N21/472 , H04N21/81 , H04N21/8358 , H04N21/84 , G06T7/215 , G06F3/04842 , H04L67/02 , G06T1/00 , G06F3/0482 , H04L67/06 , H04L67/10 , G06K9/62 , G10L21/00 , H04N21/222 , H04N21/239 , H04N21/431 , H04N21/4627 , H04N21/4782
摘要: Systems and methods for replacing original media bookmarks of at least a portion of a digital media file with replacement bookmarks is described. A media fingerprint engine detects the location of the original fingerprints associated with the portion of the digital media file and a region analysis algorithm characterizes regions of media file spanning the location of the original bookmarks by data class types. The replacement bookmarks are associated with the data class types and are overwritten or otherwise are substituted for the original bookmarks. The replacement bookmarks then are subjected to a fingerprint matching algorithm that incorporates media timeline and media related metadata.
-
-
-
-
-
-
-
-
-