-
公开(公告)号:US20240134492A1
公开(公告)日:2024-04-25
申请号:US18278372
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Jessica J. PECK , James N. JONES , Ieyuki KAWASHIMA , Lynn I. STREJA
IPC: G06F3/04815 , G06F3/01 , G10L15/22
CPC classification number: G06F3/04815 , G06F3/013 , G06F3/017 , G10L15/22 , G10L2015/223
Abstract: An example process includes while displaying, on a display, an extended reality (XR) environment: receiving a user input; sampling, with a microphone, a user speech input; in accordance with a determination that the user input satisfies a criterion for initiating a digital assistant, initiating the digital assistant, including: displaying, within the XR environment, a digital assistant indicator at a first location of the XR environment; and while displaying the digital assistant indicator at the first location, providing, by the digital assistant, a response to the user speech input; after providing the response, ceasing to display the digital assistant indicator at the first location; and in accordance with ceasing to display the digital assistant indicator at the first location, displaying the digital assistant indicator at a second location of the XR environment, the second location corresponding to a physical location of an external electronic device implementing a second digital assistant.
-
公开(公告)号:US11967248B2
公开(公告)日:2024-04-23
申请号:US17425947
申请日:2019-12-12
Applicant: Jangho Lee
Inventor: Jangho Lee
CPC classification number: G09B19/06 , G06F40/263 , G09B5/06 , G10L13/08 , G10L15/005 , G10L15/22 , G10L15/26 , G10L2015/223
Abstract: A method for foreign language learning between a learner and a terminal, based on video or audio containing foreign language, particularly, to a conversation-based foreign language learning method using a speech recognition function and a TTS function of a terminal, a learner learns a foreign language in a way that: the terminal reads a current learning target sentence to the learner to allow the learner to speak the current learning target sentence after the terminal, when speech input by the learner in a speech waiting state of the terminal is the same as the current learning target sentence or belongs to the same category as the current learning target sentence; and the terminal and the learner alternately speak sentences one-by-one when the speech input by the learner is the same as the next sentence of the current learning target sentence or belongs to the same category as the next sentence.
-
公开(公告)号:US20240127813A1
公开(公告)日:2024-04-18
申请号:US18278665
申请日:2021-12-17
Applicant: Huawei Technologies Co., Ltd.
Inventor: Wenmei Gao , Henghui Lu , Yuewan Lu
CPC classification number: G10L15/22 , G06F3/167 , H04W4/021 , H04W4/06 , G10L2015/223
Abstract: This application provides a voice interaction method. For a plurality of electronic devices equipped with a same voice assistant, a voice assistant of only one electronic device is in a working mode, and voice assistants of other electronic devices are all in a silent mode. When a user gives a voice for the voice assistant, the other electronic devices whose voice assistants are in the silent mode do not respond to a first instruction triggered by the voice of the user, only the electronic device whose voice assistant is in the working mode sends, according to the first instruction triggered by the voice of the user, a second instruction to an electronic device that is in the plurality of electronic devices and that has a capability of executing a task requested by the user, and the electronic device that receives the second instruction executes the task requested by the user.
-
公开(公告)号:US20240127811A1
公开(公告)日:2024-04-18
申请号:US18239089
申请日:2023-08-28
Applicant: Health Scholars Inc.
Inventor: Brian Philip Gillett , Akmal Hisyam Idris , James Oliver Lussier , Dustin Richard Parham , Kit Lee Burgess
CPC classification number: G10L15/22 , G06T19/006 , G10L15/1822 , G10L2015/223
Abstract: A method for processing multiple intents from an audio stream in an extended reality application may include multiple steps, including: receiving a stream of words as a first utterance; processing the first utterance before the stream of words is fully received; based on the processing, determining a first intent from the first utterance before the stream of words is fully received; determining occurrence of a pause after the first utterance; and receiving a second stream of words as a second utterance, the second stream being received after the determined pause.
-
公开(公告)号:US20240126502A1
公开(公告)日:2024-04-18
申请号:US18397786
申请日:2023-12-27
Applicant: Snap Inc.
Inventor: Sharon Moll , Piotr Gurgul
IPC: G06F3/16 , G06F3/04817 , G06F3/0484 , G10L15/16 , G10L15/22
CPC classification number: G06F3/167 , G06F3/04817 , G06F3/0484 , G10L15/16 , G10L15/22 , G10L2015/223
Abstract: Systems, methods, and computer readable media for voice-controlled user interfaces (UIs) for augmented reality (AR) wearable devices are disclosed. Embodiments are disclosed that enable a user to interact with the AR wearable device without using physical user interface devices. An application has a non-voice-controlled UI mode and a voice-controlled UI mode. The user selects the mode of the UI. The application running on the AR wearable device displays UI elements on a display of the AR wearable device. The UI elements have types. Predetermined actions are associated with each of the UI element types. The predetermined actions are displayed with other information and used by the user to invoke the corresponding UI element.
-
公开(公告)号:US11961518B2
公开(公告)日:2024-04-16
申请号:US17289113
申请日:2019-11-12
Applicant: KONICA MINOLTA PLANETARIUM CO., LTD.
Inventor: Kenichi Komaba
IPC: G10L15/22 , G06F40/157 , G09B27/04 , G10L15/26
CPC classification number: G10L15/22 , G06F40/157 , G09B27/04 , G10L15/26 , G10L2015/223
Abstract: Provided is a quick-responsive voice control technique even in use in a planetarium. A control device of a projector of a planetarium includes: a storage unit that stores a plurality of commands for controlling the projector, flags indicating whether or not the respective commands can be executed, and keywords associated with the respective commands; a voice acquisition unit that acquires voice data; a control unit that controls the control device; and a communication unit that communicates with the projector. The control unit determines whether or not each of the commands for the projector can be executed on the basis of state information of the projector, the state information being acquired through the communication unit, updates the flags on the basis of the determination result, generates character string information from voice data acquired by the voice acquisition unit, acquires a command in which an executable flag is set from the storage unit using the character string information as a search key, and transmits the acquired command to the projector.
-
公开(公告)号:US11961506B2
公开(公告)日:2024-04-16
申请号:US18113284
申请日:2023-02-23
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Chansik Bok , Jihun Park
CPC classification number: G10L15/005 , G10L13/086 , G10L15/04 , G10L15/22 , G10L15/26 , G10L2015/223
Abstract: An electronic apparatus including a memory configured to store first voice recognition information related to a first language and second voice recognition information related to a second language, and a processor to obtain a first text corresponding to a user voice that is received on the basis of first voice recognition information, based on an entity name being included in the user voice according to the obtained first text, identify a segment in the user voice in which the entity name is included. The processor is to obtain a second text corresponding to the identified segment of the user voice on the basis of the second voice recognition information, and obtain control information corresponding to the user voice on the basis of the first text and the second text.
-
公开(公告)号:US20240121133A1
公开(公告)日:2024-04-11
申请号:US18133474
申请日:2023-04-11
Applicant: Tri Cascade, Inc.
Inventor: Max Chin Li
CPC classification number: H04L12/2834 , G06N20/00 , G10L15/22 , G10L15/30 , H04L12/2814 , G10L2015/223 , H04L2012/2841
Abstract: Systems and methods for an Internet of Things (IoT), smart home climate control and communication system are provided. The IoT, smart home climate control and communication system includes a first smart home device that receives signal sources from a wide area network, transmits signals, data and commands to one or more smart home devices in a home or building in an IoT LAN. The first smart home device also receives signals, data and commands from the one or more smart home devices in the home or building on the IoT LAN, and transmits signals, data and/or commands to the wide area network. The IoT LAN is distinct from a residential wireless LAN.
-
公开(公告)号:US20240119936A1
公开(公告)日:2024-04-11
申请号:US18542904
申请日:2023-12-18
Applicant: Google LLC
Inventor: Gudmundur HAFSTEINSSON , Michael J. Lebeau , Natalia Marmasse , Sumit Agarwal , Dipochand Nishar
CPC classification number: G10L15/22 , G06F3/167 , G10L15/26 , G10L15/30 , H04M1/72403 , H04W4/14 , G10L2015/221 , G10L2015/223 , H04M2201/40 , H04M2242/15
Abstract: A method for receiving processed information at a remote device is described. The method includes transmitting from the remote device a verbal request to a first information provider and receiving a digital message from the first information provider in response to the transmitted verbal request. The digital message includes a symbolic representation indicator associated with a symbolic representation of the verbal request and data used to control an application. The method also includes transmitting, using the application, the symbolic representation indicator to a second information provider for generating results to be displayed on the remote device.
-
公开(公告)号:US20240119083A1
公开(公告)日:2024-04-11
申请号:US18538773
申请日:2023-12-13
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/432 , G06F3/16 , G06F40/30 , G10L15/22
CPC classification number: G06F16/433 , G06F3/167 , G06F40/30 , G10L15/22 , G10L2015/223
Abstract: Implementations described herein relate to receiving user input directed to an automated assistant, processing the user input to determine whether data from a server and/or third-party application is needed to perform certain fulfillment of an assistant command included in the user input, and generating a prompt that requests a user consent to transmitting of a request to the server and/or the third-party application to obtain the data needed to perform the certain fulfillment. In implementations where the user consents, the data can be obtained and utilized to perform the certain fulfillment. In implementations where the user does not consent, client data can be generated locally at a client device and utilized to perform alternate fulfillment of the assistant command. In various implementations, the request transmitted to the server and/or third-party application can be modified based on ambient noise captured when the user input is received.
-
-
-
-
-
-
-
-
-