-
公开(公告)号:US20200302159A1
公开(公告)日:2020-09-24
申请号:US16898721
申请日:2020-06-11
Applicant: Analog Devices, Inc.
Inventor: Atulya YELLEPEDDI , Kaushal SANGHAI , John Robert McCARTY , Brian C. DONNELLY , Nicolas Le DORTZ , Johannes TRAA
Abstract: Far field devices typically rely on audio only for enabling user interaction and involve only audio processing. Adding a vision-based modality can greatly improve the user interface of far field devices to make them more natural to the user. For instance, users can look at the device to interact with it rather than having to repeatedly utter a wakeword. Vision can also be used to assist audio processing, such as to improve the beamformer. For instance, vision can be used for direction of arrival estimation. Combining vision and audio can greatly enhance the user interface and performance of far field devices.