-
公开(公告)号:US20210325940A1
公开(公告)日:2021-10-21
申请号:US17359009
申请日:2021-06-25
Applicant: Intel Corporation
Inventor: Sean J.W. Lawrence
IPC: G06F1/16 , G01S15/08 , G01B17/00 , G06F1/3218
Abstract: Techniques for ultrasound-based hinge estimation are disclosed. In the illustrative embodiment, a speaker in a base portion of a computing device generates an ultrasonic sound pulse. The pulse is sensed by a microphone in a lid portion of the computing device. The time-of-flight of the sound pulse from the speaker to the microphone is used to determine the distance between the speaker and the microphone. The angle of the lid portion relative to the microphone portion can then be determined based on the position of the speaker, the position of the microphone, and the distance between the speaker and the microphone
-
公开(公告)号:US20220230643A1
公开(公告)日:2022-07-21
申请号:US17711142
申请日:2022-04-01
Applicant: Intel Corporation
Inventor: Arun A. Pai , Mahendran Rajendran , Suganthi Balasubramanian , Sajal Kumar Das , Sean J.W. Lawrence
IPC: G10L19/005 , G10L15/22 , G10L25/60 , G10L25/57 , G10L15/183
Abstract: Techniques for teleconferencing with enhanced audio during low-quality connection conditions are disclosed. In the illustrative embodiment, a user of a compute device is teleconferencing with users of one or more remote compute devices. The compute device monitors a connection quality with a remote compute device. If the connection quality drops below a threshold, risking gaps in the audio data, the compute device generates speech code data that can be used to fill in the gaps in the audio data. The remote compute device can use the speech code data to augment the audio data by using a voice model to create additional audio data based on the speech code data.
-
公开(公告)号:US20180053043A1
公开(公告)日:2018-02-22
申请号:US15239033
申请日:2016-08-17
Applicant: Intel Corporation
Inventor: Sean J.W. Lawrence
CPC classification number: G06K9/00355 , G06K9/00335 , H04L65/602 , H04N7/147 , H04N7/15
Abstract: Various systems and methods for processing video data, including gesture masking in a video feed, are provided herein. The system can include a camera system interface to receive video data from a camera system; a gesture detection unit to determine a gesture within the video data, the gesture being performed by a user; a permission module to determine a masking permission associated with the gesture; and a video processor. The video processor can modify a portion of the video data associated with the gesture in accordance with the masking permission, and cause the portion of the video data to be displayed. The camera system includes a depth camera or an infrared camera. The gesture detection unit determines the gesture based on machine learning.
-
公开(公告)号:US20190103098A1
公开(公告)日:2019-04-04
申请号:US15720446
申请日:2017-09-29
Applicant: INTEL CORPORATION
Inventor: Sean J.W. Lawrence , Devon Worrell , Michael E. Deisher
IPC: G10L15/22 , G10L15/20 , G10K11/175
CPC classification number: G10L15/22 , G06F3/167 , G10K11/175 , G10L15/08 , G10L15/20 , G10L21/003 , G10L2015/088 , G10L2015/223 , G10L2021/02087 , H04K1/00
Abstract: Techniques are disclosed for reliably masking speech commands directed to one or more computing devices to prevent the speech commands from being rendered. In some embodiments, each of the one or more computing devices includes components configured to generate acoustic data from ambient sound waves, process the acoustic data to identify a speech command sequence, and mask the speech command sequence from being rendered. At least some of the systems and methods disclosed herein monitor inbound audio at a fine grain level of detail. Working at this level of granularity enables the system and methods described herein to detect potential speech commands early within the user's utterance thereof and to discriminate quickly between true speech commands and other user utterances. These early detection and discrimination features, in turn, enable some embodiments to manage potential communication disruptions (e.g., jitter and/or latency) by modifying rates of audio prior to rendering.
-
-
-