-
公开(公告)号:US20200258526A1
公开(公告)日:2020-08-13
申请号:US16795849
申请日:2020-02-20
发明人: Philip Alan Bunker , Mayank Saxena
摘要: An audio firewall system has a microphone that generates audio data. A speech-to-text engine converts the audio data to text data. The text data is parsed for a service wake word and corresponding content data. The service wake word identifies one of a local security system and a remote assistant server. A text-to-speech engine converts the service wake word and the corresponding content data to converted audio data. The converted audio data is provided to the remote assistant server. The content data is provided to the local security system. The audio firewall system receives a response from the remote assistant server or the local security system and outputs an audio signal corresponding to the response.
-
公开(公告)号:US20200258422A1
公开(公告)日:2020-08-13
申请号:US16787256
申请日:2020-02-11
申请人: Can-U-C Ltd.
发明人: Leeroy SOLOMON , Doron SOLOMON
摘要: A method and a wearable system which includes distance sensors, cameras and headsets, which all gather data about a blind or visually impaired person's surroundings and are all connected to a portable personal communication device, the device being configured to use scenario-based algorithms and an A.I to process the data and transmit sound instructions to the blind or visually impaired person to enable him/her to independently navigate and deal with his/her environment by provision of identification of objects and reading of local texts.
-
公开(公告)号:US20200234705A1
公开(公告)日:2020-07-23
申请号:US16823264
申请日:2020-03-18
发明人: Guolai MA , Tian CHEN , Liang ZHANG , Zheng YUAN
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing voice data are provided. One of methods, implemented by an IoT device, includes: receiving voice data from a server, wherein the voice data is obtained through converting text data to voice data by the server; determining a content attribute associated with the voice data; determining a content attribute type of the content attribute associated with the voice data; determining a first play rule matching the content attribute type based on a matching relationship between content attribute types and respective first play rules, wherein the first play rule including a play starting time and a play mode; and automatically playing the voice data according to the play starting time and the play mode.
-
公开(公告)号:US20200231382A1
公开(公告)日:2020-07-23
申请号:US16254154
申请日:2019-01-22
申请人: Everseen Limited
IPC分类号: B65G1/137 , G10L13/04 , G10L15/22 , G10L15/18 , G06K9/00 , G06F21/32 , G06Q50/28 , G06F16/583
摘要: A warehouse management system is configured to include a biometric processing engine to authenticate a user though facial recognition of the user, a natural language processing engine to enables touchless interaction of the user with the system, a voice synthesizer to execute synthesized voice interrogative prompts and respond to responses received from the user, and a quantity detection engine for capturing images of transaction articles and feeding to an artificial image quantization engine (AIQE) for extracting quantities from the captured images and to an artificial image self-learning engine (AISLE) for self-learning of the AIQE.
-
公开(公告)号:US10714074B2
公开(公告)日:2020-07-14
申请号:US15921336
申请日:2018-03-14
发明人: Jie Liang , Weiyong Wu
IPC分类号: G10L13/04 , H04L29/08 , G06F40/205 , G10L13/047 , G10L13/08 , H04L29/06
摘要: The present disclosure provides a method, a browser client, and a server for reading web page information by speech. The browser client is installed with a text to speech (TTS) engine. The method includes: sending, by a browser client, a page access request to a server, where the page access request includes a page address and TTS identity information; receiving, by the browser client, response data returned by the server, where the response data includes a TTS standard version number determined by the server according to the TTS identity information, and TTS page data corresponding to the page address; and reading, by the browser client, the TTS page data by speech according to the TTS standard version number by using a TTS engine. In the present disclosure, page information is read by speech by using the TTS engine installed on the browser client. When it is inconvenient for a user to browse a page with eyes, and for users whose eyes have physical problems, the read page information can be listened by using a sense of hearing. Therefore, a convenient hearing-based manner is provided to users to browse a page.
-
公开(公告)号:US20200219506A1
公开(公告)日:2020-07-09
申请号:US16732821
申请日:2020-01-02
发明人: Achintya Kumar Bhowmik , David Alan Fabry , Amit Shahar , Justin R. Burwinkel , Jeffrey Paul Solum , Thomas Howard Burns
摘要: Embodiments herein relate to a local assistant system responding to voice input using an ear-wearable device. The system detects a wake-up signal and receives a first voice input communicating a first query content. The system includes speech recognition circuitry to determine the first query content, speech generation circuitry, and an input database of locally-handled user inputs. If the first audio input matches one of the locally-handled user inputs, then the system takes a local responsive action. If the first audio input does not match one of the locally-handled user inputs, then the system transmits at least a portion of the first query content over a wireless network to a network resource.
-
公开(公告)号:US20200219484A1
公开(公告)日:2020-07-09
申请号:US16241703
申请日:2019-01-07
发明人: Shikhar Kwatra , Paul Krystek , Sarbajit Rakshit
摘要: Embodiments for managing a chatbot by one or more processors are described. A communication is received from a first individual. The presence of a second individual within a proximity of a speaker is detected. A response to the communication is determined based on the communication and the detected presence of the second individual. The determined response is caused to be executed.
-
公开(公告)号:US10708725B2
公开(公告)日:2020-07-07
申请号:US15424731
申请日:2017-02-03
申请人: T-Mobile USA, Inc.
发明人: Niraj Nayak
IPC分类号: G10L13/00 , H04W4/12 , G10L13/04 , H04M1/725 , G10L13/047
摘要: Various embodiments generally relate to systems and methods for creation of voice memos while an electronic device is in a driving mode. In some embodiments, a triggering event can be used to indicate that the electronic device is within a car or about to be within a car and that text communications should be translated (e.g., via an application or a conversion platform) into a voice memo that can be played via a speaker. These triggering events can include a manual selection or an automatic selection based on a set of transition criteria (e.g., electronic device moving above a certain speed, following a roadway, approaching a location in a map of a marked car, etc.).
-
公开(公告)号:US20200213655A1
公开(公告)日:2020-07-02
申请号:US15736359
申请日:2017-09-18
发明人: Haiting Feng , Biao Liu , Xuewei Zhao , Qiang Huang
IPC分类号: H04N21/422 , H04N21/4788 , H04N21/431 , H04N21/61 , G10L15/30 , G10L15/22 , G10L15/26 , G10L13/04
摘要: The example system and method enable a user to post comments to appear as textual content in a streaming graphical content presentation. The textual content and the graphical content may be provided to the user's set top box for presentation on an associated display device and may be provided to another user's set top box for presentation on the other user's display device. Each of the users utilize a wireless remote control enabled to accept text and/or voice inputs of a comment for inclusion in the streaming graphical content. The set top box generates a comment based on the inputted text and/or voice received from the remote control. The set top box provides the generated comment to a media server which distributes textual content based on the generated comment to other set top boxes for presentation with the streaming graphical content.
-
公开(公告)号:US20200211553A1
公开(公告)日:2020-07-02
申请号:US16726216
申请日:2019-12-23
发明人: Gregory BOHL , Mengling HETTINGER , Prithvi KAMBHAMPATI , Behrouz SAGHAFI KHADEM , Nikhil PATEL
摘要: One or more embodiments include a virtual personal assistant module executing on a virtual personal assistant system. The virtual personal assistant module obtains first sensor data from a first sensor included in a plurality of sensors. The virtual personal assistant module analyzes the first sensor data to generate a first result. The virtual personal assistant module obtains second sensor data from a second sensor included in the plurality of sensors. The virtual personal assistant module analyzes the second sensor data and the first result to generate a second result. The virtual personal assistant module outputs a natural language audio output to the user based on the second result.
-
-
-
-
-
-
-
-
-