专利检索 ipc:G10L25/48 第 2 页

11.

发明授权
Automatic audio ducking with real time feedback based on fast integration of signal levels 有权

公开(公告)号：US11327710B2

公开(公告)日：2022-05-10

申请号：US16832883

申请日：2020-03-27

申请人： Adobe Inc.

发明人： Nico Becherer , Sven Duwenhorst

IPC分类号： G06F3/16 , G10L25/27 , G10L25/21 , G10L25/48

摘要： A computer-implemented method for audio signal processing includes analyzing a foreground audio signal to determine metrics corresponding to audio slices of the foreground audio signal. Each such metric indicates a value for an audio property of a respective audio slice. The method further includes computing a total metric for an audio slice as a function of a set of the metrics corresponding to a set of the audio slices including the audio slice. The method further includes adding a key frame to a track based on the total metric. The track includes the foreground audio signal and a background audio signal, and a location of the key frame corresponds to a location of the audio slice on the track. The key frame indicates a change to the audio property of the background audio signal at the location on the track, and the key frame is utilizable for audio ducking.

12.

发明授权
Computer-generated feedback of user speech traits meeting subjective criteria 有权

公开(公告)号：US11322172B2

公开(公告)日：2022-05-03

申请号：US15611754

申请日：2017-06-01

申请人： Microsoft Technology Licensing, LLC

发明人： Oscar Roberto Morales Garrido , Paul Thackray , Kristen Kennedy

IPC分类号： G10L25/51 , G06Q10/10 , H04M3/51 , G06Q30/00 , G09B19/04 , G10L25/90 , G10L25/48 , G10L15/26

摘要： Computer-generated feedback directed to whether user speech input meets subjective criteria is provided through the evaluation of multiple speaking traits. Initially, discrete instances of various multiple speaking traits are detected within the user speech input provided. Such multiple speaking traits include vocal fry, tag questions, uptalk, filler sounds and hedge words. Audio constructs indicative of individual instances of speaking traits are isolated and identified from appropriate samples. Speaking trait detectors then utilize such audio constructs to identify individual instances of speaking traits within the spoken input. The resulting quantities are scored based on reference to predetermined threshold quantities. The individual speaking trait scores are then amalgamated utilizing a weighting that is derived based on empirical relationships between those speaking traits and the criteria for which the user's speech input is being evaluated. Further adjustments thereof can be made by separately, manually weighting the previously determined quantities.

13.

发明授权
Howl detection in conference systems 有权

公开(公告)号：US11277518B2

公开(公告)日：2022-03-15

申请号：US16640169

申请日：2018-09-27

申请人： DOLBY LABORATORIES LICENSING CORPORATION

发明人： Kai Li , David Gunawan , Feng Deng , Qianqian Fang

IPC分类号： H04M3/56 , G10L25/48

摘要： The disclosed teleconferencing methods involve detecting a howl state during a teleconference which involves two or more teleconference client locations and a teleconference server. The teleconference server is configured for providing full-duplex audio connectivity between the teleconference client locations. The howl state is a state of acoustic feedback involving two or more teleconference devices in a teleconference client location. Detecting the howl state involves an analysis of both spectral and temporal characteristics of teleconference audio data. The disclosed teleconferencing methods involve determining which client location is causing the howl state and involve mitigating the howl state or sending a howl state detection message.

14.

发明申请
SYSTEMS AND METHODS OF LIVE STREAMING EMERGENCY DISPATCH DATA TO FIRST RESPONDERS 有权

公开(公告)号：US20220070290A1

公开(公告)日：2022-03-03

申请号：US17522230

申请日：2021-11-09

申请人： HigherGround, Inc.

发明人： William F. Reber , Rajesh ChandraMohan Garg , Samuel Hood Smith , Thomas W. Goodwin, III

IPC分类号： H04M1/72421 , G10L25/48 , H04W4/021 , H04M3/436 , H04M3/523 , H04M3/51 , H04W4/90

摘要： Systems, devices, and methods including: capturing, by a capture device, an audio and corresponding location metadata associated with an emergency call; refining the location metadata to provide a refined location metadata; correlating, by the capture device, the refined location metadata of the emergency call with a geofenced location of the computing devices of one or more first responders (FRs); screening, by the capture device or the computing device, the emergency call data; transmitting, by the capture device, a first signal to the one or more computing devices based on the correlation, the transmitted signal including a portion of the captured audio and corresponding location metadata; receiving, by the capture device, an accept signal from the one or more computing devices of one or more FRs; transmitting, by the capture device, a second signal to the one or more computing devices based on the received accept signal.

15.

发明申请
LOCALIZATION BASED ON TIME-REVERSED EVENT SOUNDS 有权

公开(公告)号：US20220036913A1

公开(公告)日：2022-02-03

申请号：US16943250

申请日：2020-07-30

申请人： Lawrence Livermore National Security, LLC

发明人： Jim Candy , Karl A. Fisher , Christopher Roland Candy

IPC分类号： G10L25/48 , G10L21/04 , H04L5/00 , H04L29/06 , H04W4/02

摘要： A system determines an event location of an event within an indoor environment based on an event sound generated by the event. The system employs time-reversal techniques based on a received event sound to identify the event location as being in the vicinity of one of a plurality of locator devices at locator locations in the environment. The system includes a base array located within the environment that receives an indication that an event has been detected. Upon receiving the event sound, the system generates a time-reversed event sound for each transceiver and transmits via each transceiver the time-reversed event sound for that transceiver. When a locator device receives a time-reversed event sound, the locator device determines whether the event is in the vicinity of that locator location of the locator device and, if so, outputs an indication that the event occurred at that locator location.

16.

发明申请
METHOD AND SYSTEM FOR CORRECTING INFANT CRYING IDENTIFICATION 有权

公开(公告)号：US20220028409A1

公开(公告)日：2022-01-27

申请号：US17004015

申请日：2020-08-27

申请人： NATIONAL YUNLIN UNIVERSITY OF SCIENCE AND TECHNOLOGY

发明人： Chuan-Yu CHANG , Jun-Ying LI

IPC分类号： G10L25/48 , G10L25/18 , G10L25/30 , G10L25/24 , G06N3/08 , G06N3/04

摘要： A method for correcting infant crying identification includes the following steps: a detecting step provides an audio unit to detect a sound around an infant to generate a plurality of audio samples. A converting step provides a processing unit to convert the audio samples to generate a plurality of audio spectrograms. An extracting step provides a common model to extract the audio spectrograms to generate a plurality of infant crying features. An incremental training step provides an incremental model to train the infant crying features to generate an identification result. A judging step provides the processing unit to judge whether the identification result is correct according to a real result of the infant. When the identification result is different from the real result, an incorrect result is generated. A correcting step provides the processing unit to correct the incremental model according to the incorrect result.

17.

发明授权
Device for monitoring the use of blister packaged contents at a distance 有权

公开(公告)号：US11227681B2

公开(公告)日：2022-01-18

申请号：US16359339

申请日：2019-03-20

申请人： Intelligent Devices SEZC Inc.

发明人： Allan Wilson , Michael Petersen , Dean Brotzel

IPC分类号： G16H20/13 , G08B5/22 , G08B21/24 , G10L25/48 , A61J1/03 , G08B3/02

摘要： There is provided a device for monitoring the use of a blister package, strip package, vial or bottle contents at a distance. A processor is connected to a compact random or quasi-random n-microphone array and is programmed to detect the sound of the content being expelled from a blister cavity, strip package, or a cap being removed from a vial or bottle. A content use data memory associated with the processor stores information relating to the expulsion or removal events. The processor is equipped with statistical means for differentiating the sound of the content being expelled, from the background noise, generating an electrical signal that is analyzed for relevance to content use events by the processor, and storing the resulting use data in memory. The processor may have an adaptive beam focussing algorithm to determine the direction of the source of the sound.

18.

发明授权
Method of generating estimated value of local inverse speaking rate (ISR) and device and method of generating predicted value of local ISR accordingly 有权

公开(公告)号：US11200909B2

公开(公告)日：2021-12-14

申请号：US16557159

申请日：2019-08-30

申请人： National Chiao Tung University

发明人： Chen-Yu Chiang , Guan-Ting Liou , Yih-Ru Wang , Sin-Horng Chen

IPC分类号： G10L25/00 , G10L25/48 , G10L15/14 , G06N20/00 , G06N7/00 , G10L15/18

摘要： A method is disclosed. The proposed method includes: providing an initial speech corpus including plural utterances; based on a condition of maximum a posteriori (MAP), according to respective sequences of syllable duration, syllable duration prosodic state, syllable tone, base-syllable type, and break type of the kth utterance, using a probability of an ISR of the kth utterance xk to estimate an estimated value {circumflex over (x)}k of the xk; and through the MAP condition, according to respective sequences of syllable duration, syllable duration prosodic state, syllable tone, base-syllable type, and break type of the given lth breath group/prosodic phrase group (BG/PG) of the kth utterance, using a probability of an ISR of the lth BG/PG of the kth utterance xk,l to estimate an estimated value {circumflex over (x)}k,l of the xk,l wherein the {circumflex over (x)}k,l is the estimated value of local ISR, and a mean of a prior probability model of the {circumflex over (x)}k,l is the {circumflex over (x)}k.

19.

发明授权
Control device, medical observation system, control method, and computer readable recording medium 有权

公开(公告)号：US11178340B2

公开(公告)日：2021-11-16

申请号：US16904586

申请日：2020-06-18

申请人： Sony Olympus Medical Solutions Inc.

发明人： Narumi Sato , Ryohei Kagawa

IPC分类号： H04N5/235 , H04N5/225 , H04N7/18 , G10L25/03 , A61B1/045 , A61B1/06 , A61B1/267 , G10L25/48

摘要： A control device includes: a hardware processor; and a memory, wherein the hardware processor is configured to: control an image sensor configured to generate an image signal by performing imaging sequentially according to predetermined frames; detect a frequency of vocal cord vibration of a subject based on a voice signal; set a pulse width and a light emission cycle for when a light source emits light, based on the frequency and a preset duty cycle; control the light source to emit the pulse light using the pulse width and the light emission cycle in one field period or one frame period of the image sensor in synchronization with the frequency; calculate, based on the light emission cycle or the frequency, a gain amount by which the image signal is to be multiplied; and multiply the image signal by the gain amount.

20.

发明授权
Audio content production, audio sequencing, and audio blending system and method 有权

公开(公告)号：US11169765B2

公开(公告)日：2021-11-09

申请号：US16566805

申请日：2019-09-10

申请人： SUPER HI FI, LLC

发明人： Zack J. Zalon , Brendon Patrick Cassidy

IPC分类号： G06F3/16 , G11B27/036 , G11B27/28 , G10L25/48 , G05B15/02

摘要： Embodiments of the invention provide an audio blending system with a computing device that processes operations including receiving a transition request from a user including an out element and/or an in element of at least one transition between at least one content item of at least one recipe. The recipe includes a sequence of a plurality of elements of content of a break, where at least one content item includes audio content and/or video content. The operations include causing a track server to couple to a metadata file of the audio file using a wired or wireless link. The metadata file includes audio content parameters measured or calculated from the audio file. The operations include calculating a transition between the out element and the in element, selecting, assembling and scheduling the sequence of plurality of elements for the transition, and adding the out element to the at least one recipe.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类