-
公开(公告)号:WO2023288265A1
公开(公告)日:2023-01-19
申请号:PCT/US2022/073721
申请日:2022-07-14
Applicant: SRI INTERNATIONAL
Inventor: LUBIN, Jeffrey , SPENCE, Clay
IPC: G10L21/007 , G10L13/02 , G10L17/14 , G10L25/03
Abstract: A computing system that receives an audio waveform representing speech from an individual and produces as output a. modified version of the audio waveform that maintains the speaker's speech characteristics as well as prosody for specific utterances (e.g., voice timbre, intonation, timing, intensity). The sy stem uses a bottleneck-based autoencoder with speech spectrograms as input and output. To produce the output audio waveform, the system includes a. reconstruction error-based loss function with two additional loss functions. The second loss function is speaker "real vs fake" discriminator that penalizes for the output not sounding like the speaker. The third loss function is a. speech intelligibility scorer that penalizes the output for speech that is difficult for the target population to understand. The produced modified audio waveform is an enhanced speech output that delivers speech m a target accent without sacrificing the personality of the speaker.
-
2.
公开(公告)号:WO2018183546A1
公开(公告)日:2018-10-04
申请号:PCT/US2018/024909
申请日:2018-03-28
Applicant: SRI INTERNATIONAL , ACHARYA, Girish , BERCOW, Douglas , BURNS, John, Brian , CLYMER, Bradley, J. , HELLER, Aaron, J. , LUBIN, Jeffrey , MAHADEVAN, Sridhar , RAMAMURTHY, Bhaskar , WATTERS, David , SUNDARESAN, Aravind
Inventor: ACHARYA, Girish , BERCOW, Douglas , BURNS, John, Brian , CLYMER, Bradley, J. , HELLER, Aaron, J. , LUBIN, Jeffrey , MAHADEVAN, Sridhar , RAMAMURTHY, Bhaskar , WATTERS, David , SUNDARESAN, Aravind
IPC: G01S13/00
CPC classification number: G01S7/415 , G01S7/412 , G01S7/417 , G01S13/04 , G01S13/26 , G01S13/582 , G01S13/723
Abstract: An identification system includes a radar sensor configured to generate a time- domain or frequency-domain signal representative of electromagnetic waves reflected from one or more objects within a three-dimensional space over a period of time and a computation engine executing on one or more processors. The computation engine is configured to process the time-domain or frequency-domain signal to generate range and velocity data indicating motion by a living subject within the three-dimensional space. The computation engine is further configured to identify, based at least on the range and velocity data indicating the motion by the living subject, the living subject and output an indication of an identity of the living subject.
-
公开(公告)号:WO2018175616A1
公开(公告)日:2018-09-27
申请号:PCT/US2018/023606
申请日:2018-03-21
Applicant: SRI INTERNATIONAL
Inventor: LUBIN, Jeffrey , BERGEN, James R.
IPC: G06K9/00 , A61B5/024 , A61B5/11 , A61B5/117 , A61B5/1171
CPC classification number: A61B5/117 , A61B5/024 , G06K9/00 , G06K9/00892 , G06K9/00906 , G06K2009/00939
Abstract: A biometric access control system for controlling access to an environment based on an authorization status of a living subject is disclosed. In one example, a data source generates image data of a tissue region of the subject. A liveness measurement unit processes the image data to detect changes over at least one of time or spatial volume in one or more structural features of the tissue region and generates, based on the detected changes, a spoofing attack detection status indicating that the image data is from living biological tissue or that a spoofing attack is detected. A biometric identification unit processes at least a portion of the same image data generated by the data source to generate biometric information indicative of an identity of the subject. Responsive to the spoofing attack detection status and the biometric information, an authorization unit outputs an authorization status for the subject.
-
-