专利检索 ipc:G10L21/00 第 1 页

1.

发明授权
Voice monitoring system and voice monitoring method 有权

公开(公告)号：US11631419B2

公开(公告)日：2023-04-18

申请号：US17175220

申请日：2021-02-12

申请人： PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.

发明人： Ryota Fujii , Hiroyuki Matsumoto , Hiroaki Hayashi , Kazunori Hayashi

IPC分类号： G10L21/00 , G10L17/14 , G10L25/21 , G10L25/57 , G10L25/60 , G10L25/84 , H04N7/18 , H04R1/40 , H04R3/00 , G06Q50/10

摘要： A recording device records a video and an imaging time, and a voice. Based on the voice, a sound parameter calculator calculates a sound parameter for specifying magnitude of the voice in a monitoring area at the imaging time for each of pixels and for each of certain times. A sound parameter storage unit stores the sound parameter. A sound parameter display controller superimposes a voice heat map on a captured image of the monitoring area and displays the superimposed image on a monitor. At this time, the sound parameter display controller displays the voice heat map based on a cumulative time value of magnitude of the voice, according to designation of a time range.

2.

发明申请
METHOD AND APPARATUS FOR REDACTING SENSITIVE INFORMATION FROM AUDIO 有权

公开(公告)号：US20230098137A1

公开(公告)日：2023-03-30

申请号：US17491511

申请日：2021-09-30

申请人： C/o Uniphore Technologies Inc.

发明人： Patrick EHLEN , Victor BARRES

IPC分类号： G06F21/62 , G10L21/00 , G06F40/284 , G06F40/205 , G10L15/26 , G06N20/20

摘要： A method and apparatus for redacting sensitive information from audio is provided. The method comprises identifying, using a plurality of Classifiers, each corresponding to a plurality of sensitive items, a sensitive item (SI) token from a plurality of tokens comprised in a transcribed text of an audio. The SI token corresponds to one of the plurality of sensitive items, each of the plurality of tokens is a transcription of a spoken word in the audio, and each of the plurality of tokens is associated with a corresponding timestamp indicating a chronologic position of the spoken word in the audio. A redaction timespan is determined for the SI token from a first timestamp for the SI token and a second timestamp for a non-SI token immediately after the SI token, and the audio for the redaction timespan is redacted.

3.

发明授权
Home appliance having speech recognition function 有权

公开(公告)号：US11606634B2

公开(公告)日：2023-03-14

申请号：US17669633

申请日：2022-02-11

申请人： Samsung Electronics Co., Ltd.

发明人： Hwa-Sung Kim , Dong Hyun Sohn , Ji-Eun Lee

IPC分类号： G10L21/00 , G10L25/00 , H04R1/04 , G10L15/26 , G10L15/28 , H04R1/02

摘要： A home appliance includes an electrical equipment compartment disposed in an upper portion of the home appliance, and including an upper side that is open, an electrical equipment compartment cover to cover the open upper side of the electrical equipment compartment, and including a speaker hole, a microphone accommodating portion which protrudes upward from an upper side of the electrical equipment compartment cover and including an accommodating space and a front portion that includes microphone holes laterally spaced apart from each other and which face toward a front of the home appliance, a microphone unit including a printed circuit board (PCB) disposed in the accommodation space behind the microphone holes and including microphone chips mounted on the PCB, and a speaker unit disposed in the electrical equipment compartment to correspond to the speaker hole.

4.

发明申请
METHOD AND APPARATUS FOR AUDIO DATA PROCESSING 有权

公开(公告)号：US20230075670A1

公开(公告)日：2023-03-09

申请号：US18055810

申请日：2022-11-15

申请人： ALIBABA GROUP HOLDING LIMITED

发明人： Shaofei XUE , Biao TIAN

IPC分类号： G10L15/20 , G10L25/84 , G10L15/22 , G10L21/0232 , H04S7/00 , G10L21/00

摘要： Embodiments of the disclosure provide methods and apparatuses processing audio data. The method can include: acquiring audio data by an audio capturing device, determining feature information of an enclosure in which the audio capturing device is located, and reverberating the feature information into the audio data.

5.

发明授权
Method and system for providing a seamless handoff from a voice channel to a call agent 有权

公开(公告)号：US11538074B1

公开(公告)日：2022-12-27

申请号：US17141644

申请日：2021-01-05

申请人： WALGREEN CO.

发明人： Lindsey Kanefsky , Kartik Subramanian , Garima Pokharel

IPC分类号： H04M3/00 , H04M5/00 , H04L12/66 , G06Q30/02 , H04M3/42 , G06Q50/22 , G10L15/22 , H04M3/493 , G10L15/26 , H04M3/51 , G10L21/00 , G10L25/00

摘要： The method and system may provide a seamless handoff of user information from a drugstore to a call agent. When a customer communicates with a drugstore device regarding a drugstore-related inquiry, the drugstore device attempts to identify an answer to the drugstore-related inquiry. When the drugstore device does not identify an answer to the drugstore-related inquiry, the drugstore device initiates communication between the customer and a contact center. A transcribed version of the communication may be stored in a database accessible by the contact center along with additional user information for the customer related to the customer's experiences with the drugstore. The user information may be provided to a call agent's contact center device for display and in this manner, the call agent may be made aware of the communication to avoid asking repeat questions and to quickly and efficiently answer the customer's drugstore-related inquiry.

6.

发明授权
Deep multi-channel acoustic modeling using frequency aligned network 有权

公开(公告)号：US11495215B1

公开(公告)日：2022-11-08

申请号：US16710811

申请日：2019-12-11

申请人： Amazon Technologies, Inc.

发明人： Minhua Wu , Shiva Sundaram , Tae Jin Park , Kenichi Kumatani

IPC分类号： G10L21/00 , G10L15/16 , G10L15/06 , G06N3/04 , G10L21/0216 , G06N3/08

摘要： Techniques for speech processing using a deep neural network (DNN) based acoustic model front-end are described. A new modeling approach directly models multi-channel audio data received from a microphone array using a first model (e.g., multi-geometry/multi-channel DNN) that includes a frequency aligned network (FAN) architecture. Thus, the first model may perform spatial filtering to generate a first feature vector by processing individual frequency bins separately, such that multiple frequency bins are not combined. The first feature vector may be used similarly to beamformed features generated by an acoustic beamformer. A second model (e.g., feature extraction DNN) processes the first feature vector and transforms it to a second feature vector having a lower dimensional representation. A third model (e.g., classification DNN) processes the second feature vector to perform acoustic unit classification and generate text data. The DNN front-end enables improved performance despite a reduction in microphones.

7.

发明授权
Systems and methods for providing real time assistance to voice over internet protocol (VOIP) users 有权

公开(公告)号：US11470201B2

公开(公告)日：2022-10-11

申请号：US16943994

申请日：2020-07-30

申请人： DELL PRODUCTS L.P.

发明人： Chia Hung Shih , Chien Yu Huang , Su Hsuan Chu , Vivek Viswanathan Iyer

IPC分类号： G10L15/00 , G10L25/90 , G10L15/06 , G10L21/00 , H04M1/253 , G10L13/00 , H04M7/00 , H04L65/1063 , H04L65/1069

摘要： Systems and methods are provided that may be implemented in a real time manner by an information handling system (the “client system”) to monitor one or more characteristics of a voice over internet protocol (VOIP) discussion, to use these monitored VOIP characteristics to identify one or more condition/s in real time as they are identified to occur during the current VOIP discussion, and to determine to take one or more automatic actions based on the identified VOIP condition/s so as to inform and/or alert a current human user of the client system to the occurrence of the identified VOIP condition/s as they occur.

8.

发明授权
Method and system for mixing multiple sound sources 有权

公开(公告)号：US11468906B2

公开(公告)日：2022-10-11

申请号：US17075325

申请日：2020-10-20

申请人： NAVER CORPORATION

发明人： Woo-sik Byun , Sang Don Kim

IPC分类号： G10L21/00 , G06F3/16 , G11B27/031

摘要： A multiple sound source mixing method includes dividing a plurality of sound source data into segments each with a desired length; sequentially inputting sound source data of a corresponding segment for each segment through a desired number of nodes with respect to the plurality of sound source data and mixing the input sound source data into a single piece of sound source data; and concatenating the sound source data mixed for the respective segments.

9.

发明授权
Voice application platform 有权

公开(公告)号：US11450321B2

公开(公告)日：2022-09-20

申请号：US17023511

申请日：2020-09-17

申请人： Voicify, LLC

发明人： Robert T. Naughton , Nicholas G. Laidlaw , Alexander M. Dunn , Jeffrey K. McMahon

IPC分类号： G10L21/00 , G10L25/00 , G10L15/22 , G10L15/30

摘要： Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.

10.

发明授权
Systems and methods for generating bookmark video fingerprint 有权

公开(公告)号：US11450109B2

公开(公告)日：2022-09-20

申请号：US16653904

申请日：2019-10-15

申请人： Synergy Sports Technology, LLC

发明人： Nils B. Lahr , Garrick C. Barr

IPC分类号： G06K9/00 , G06V20/40 , G06F16/78 , G06F16/182 , G06F16/958 , G06F16/583 , G06F16/783 , G06F16/955 , H04N7/173 , H04N21/432 , H04N21/433 , H04N21/442 , H04N21/472 , H04N21/81 , H04N21/8358 , H04N21/84 , G06T7/215 , G06F3/04842 , H04L67/02 , G06T1/00 , G06F3/0482 , H04L67/06 , H04L67/10 , G06K9/62 , G10L21/00 , H04N21/222 , H04N21/239 , H04N21/431 , H04N21/4627 , H04N21/4782

摘要： Systems and methods for replacing original media bookmarks of at least a portion of a digital media file with replacement bookmarks is described. A media fingerprint engine detects the location of the original fingerprints associated with the portion of the digital media file and a region analysis algorithm characterizes regions of media file spanning the location of the original bookmarks by data class types. The replacement bookmarks are associated with the data class types and are overwritten or otherwise are substituted for the original bookmarks. The replacement bookmarks then are subjected to a fingerprint matching algorithm that incorporates media timeline and media related metadata.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类