-
公开(公告)号:US20240203411A1
公开(公告)日:2024-06-20
申请号:US18081580
申请日:2022-12-14
Applicant: GOOGLE LLC
Inventor: Dongeek Shin
CPC classification number: G10L15/22 , G10L15/08 , G10L2015/088 , G10L2015/223 , G10L2015/225
Abstract: Techniques are described herein for arbitration between automated assistant devices based on interaction cues. A method includes: receiving, via one or more microphones of a first computing device, first audio data that captures a spoken utterance of a user; determining that each of one or more additional computing devices has detected the spoken utterance of the user; determining that hotword arbitration is to be initiated between the first computing device and the one or more additional computing devices; for each of the first computing device and the one or more additional computing devices, identifying a similarity score for the computing device; selecting a target computing device, from the first computing device and the one or more additional computing devices, based on the similarity scores; and causing the target computing device to respond to a query that is included in the spoken utterance of the user.
-
公开(公告)号:US20240203013A1
公开(公告)日:2024-06-20
申请号:US18199030
申请日:2023-05-18
Applicant: LG ELECTRONICS INC.
Inventor: Taeju HWANG , Jongjin PARK , Yubin YOON , Taehwan HWANG , Hyoyoung KIM
CPC classification number: G06T13/205 , G06T7/70 , G06T13/40 , G06T19/20 , G06V30/19093 , G10L15/22 , G06T2219/2016 , G10L2015/223
Abstract: An artificial intelligence device is configured to: when a spoken sentence of a three-dimensional agency is generated, extract a keyword of the spoken sentence; acquire related content associated with the keyword of the spoken sentence to detect positions of an object and text in the related content; when an object and text corresponding to the keyword of the spoken sentence exist in the related content, map the positions of the object and the text corresponding to the keyword of the spoken sentence to three-dimensional coordinates; output the related content to a surrounding space of the three-dimensional agency; and control the operation of the three-dimensional agency so that the three-dimensional agency performs an utterance operation corresponding to the spoken sentence and an indication operation of indicating three-dimensional coordinates at which the object and the text of the related content are located.
-
公开(公告)号:US12014737B2
公开(公告)日:2024-06-18
申请号:US17530227
申请日:2021-11-18
Applicant: Microsoft Technology Licensing, LLC
Inventor: Heiko Rahmel , Li-Juan Qin , Xuedong Huang , Wei Xiong
IPC: G10L15/22 , G06F3/01 , G06F3/0484 , G06F3/04842 , G06N20/00 , G10L15/08 , G10L15/30 , G10L25/48 , G10L25/90
CPC classification number: G10L15/22 , G06F3/017 , G06F3/04842 , G06N20/00 , G10L15/08 , G10L25/48 , G10L2015/088 , G10L2015/223 , G10L15/30 , G10L25/90
Abstract: Systems, methods, and computer-readable storage devices are disclosed for generating smart notes for a meeting based on participant actions and machine learning. One method including: receiving meeting data from a plurality of participant devices participating in an online meeting; continuously generating text data based on the received audio data from each participant device of the plurality of participant devices; iteratively performing the following steps until receiving meeting data for the meeting has ended, the steps including: receiving an indication that a predefined action has occurred on the first participating device; generating a participant segment of the meeting data for at least the first participant device from a first predetermined time before when the predefined action occurred to when the predefined action occurred; determining whether the receiving meeting data of the meeting has ended; and generating a summary of the meeting.
-
公开(公告)号:US12014735B2
公开(公告)日:2024-06-18
申请号:US17390119
申请日:2021-07-30
Applicant: Hyundai Motor Company , Kia Corporation
Inventor: Ki Chang Kim , Dong Chul Park , Tae Kun Yun , Jin Sung Lee
CPC classification number: G10L15/22 , G10L25/63 , G10L2015/223
Abstract: An emotion adjustment system for determining a user's emotions based on a user's voice includes: a microphone configured to receive the user's voice; a controller configured to extract a plurality of sound quality factors in response to processing the user's voice, calculate a depression index of the user based on at least one sound quality factor among the plurality of sound quality factors, identify an emotional state of the user as a depressive state when the depression index is a preset value or more, determine the depressive state as a first state or a second state based on a correlation between at least two sound quality factors among the plurality of sound quality factors, and transmit a control command corresponding to the emotional state of the user identified as the first state or the second state; and a feedback device configured to perform an operation corresponding to the control command.
-
公开(公告)号:US12014117B2
公开(公告)日:2024-06-18
申请号:US17301703
申请日:2021-04-12
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , He Lu , Willy Lew Yuk Vong , Michael Dale Whiteley , Fred Torok , Shikher Sitoke , David Ross Bronaugh , Bo Li
CPC classification number: G06F3/167 , G10L15/02 , G10L15/18 , G10L15/22 , G10L15/26 , G10L17/22 , H04L12/2816 , G10L2015/223
Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.
-
公开(公告)号:US20240195652A1
公开(公告)日:2024-06-13
申请号:US18444212
申请日:2024-02-16
Applicant: Kohler Co.
Inventor: Rafael Rexach , Jessica Schroeder , Miguel Arciniega , Ashley Springer , Shi Chao Zhang , Marwan Estiban , Anne Krauter , Perry Erickson , Anil Pendyala , Erin Geiger
IPC: H04L12/28 , A47K5/12 , A47K10/32 , A47K10/36 , A47K13/30 , E03C1/05 , E03D5/10 , E03D9/00 , G05B15/02 , G06F3/16 , G10L15/18 , G10L15/22 , G10L21/06
CPC classification number: H04L12/282 , A47K5/1217 , A47K13/305 , E03C1/055 , E03C1/057 , E03D5/105 , E03D9/002 , G05B15/02 , G10L15/1822 , G10L15/22 , G10L21/06 , H04L12/2816 , A47K2010/3226 , A47K2010/3668 , A47K2201/00 , G05B2219/2642 , G06F3/167 , G10L2015/223
Abstract: A voice controlled device comprises a housing, a dock, a coupling mechanism, and a microphone. The dock is configured to connect the housing to a plurality of host appliances. The coupling mechanism is configured to receive an identification value indicative of docking between the voice controlled device and a currently connected host appliance of the plurality of host appliances. The microphone is configured to receive one or more voice inputs for the currently connected host appliance. A command is provided based on the one or more voice inputs and the identification value.
-
公开(公告)号:US12010487B2
公开(公告)日:2024-06-11
申请号:US17750598
申请日:2022-05-23
Applicant: Thomas Stachura
Inventor: Thomas Stachura
IPC: G10L15/18 , G06F3/01 , G10L15/08 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/00 , H04R5/04 , H04R29/00
CPC classification number: H04R29/004 , G06F3/011 , G06F3/017 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L17/24 , G10L25/51 , G10L25/78 , H04R3/005 , H04R5/04 , G10L2015/088 , G10L2015/223 , G10L2025/783 , H04R2420/00 , H04R2420/01 , H04R2499/11
Abstract: Systems, apparatuses, and methods are described for a privacy blocking device configured to prevent receipt, by a listening device, of video and/or audio data until a trigger occurs. A blocker may be configured to prevent receipt of video and/or audio data by one or more microphones and/or one or more cameras of a listening device. The blocker may use the one or more microphones, the one or more cameras, and/or one or more second microphones and/or one or more second cameras to monitor for a trigger. The blocker may process the data. Upon detecting the trigger, the blocker may transmit data to the listening device. For example, the blocker may transmit all or a part of a spoken phrase to the listening device.
-
公开(公告)号:US20240185854A1
公开(公告)日:2024-06-06
申请号:US18442565
申请日:2024-02-15
Applicant: JVCKENWOOD Corporation
Inventor: Takuji TERUUCHI
CPC classification number: G10L15/22 , G10L15/25 , G10L25/63 , G10L2015/223
Abstract: A vehicular recording control device includes: an imaging data acquiring unit configured to acquire first imaging data from a first camera imaging surroundings of a host vehicle and second imaging data from a second camera imaging a cabin of the vehicle; a sightline determining unit configured to determine the direction of a sightline of an occupant of the vehicle from the second imaging data; a voice recognizing unit configured to recognize a voice command for instructing event recording; an output control unit configured to output information indicating the voice command to the occupant when the sightline determining unit determines that the sightline of the occupant is directed to a display unit; and a recording control unit configured to store the first imaging data as event data when the voice recognizing unit recognizes the voice command.
-
99.
公开(公告)号:US20240185849A1
公开(公告)日:2024-06-06
申请号:US18075214
申请日:2022-12-05
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
CPC classification number: G10L15/22 , G06F3/011 , G06T19/006 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can be accessible via a virtual environment for controlling features of the virtual environment and/or devices in a physical environment of the user. When the automated assistant is invoked, the automated assistant can materialize in the virtual environment according to any request that the automated assistant has been invoked to fulfill. For example, depending on the request from the user, the automated assistant can cause rendering of a virtual object for fulfilling the request and/or controlling an ongoing operation of the automated assistant. When the virtual object is rendered to control an operation of the automated assistant, or another application, the virtual object can include a virtual feature that the user can interact with to control the virtual environment and/or devices in a physical environment of the user.
-
100.
公开(公告)号:US20240185848A1
公开(公告)日:2024-06-06
申请号:US18075155
申请日:2022-12-05
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G10L15/22
CPC classification number: G10L15/22 , G10L2015/223 , G10L2015/225 , G10L2015/228
Abstract: Systems and methods for creating a group automated assistant session and processing requests that are intended for the users that are included in the group. A plurality of users can indicate intentions to create a group session that includes selecting an automated assistant, from the automated assistants executing on the devices of the user and providing the selected automated assistant with audio data that is captured by microphones of the user devices. In response, the selected automated assistant processes the audio data and generates a response that is provided, via one or more speakers of the device that is executing the selected automated assistant. Further, fulfillment data is provided to the automated assistants executing on other devices and, in response to being provided the fulfillment data, each automated assistant causes audio data to be rendered, via one or more speakers of each respective device, that is responsive to the request.
-
-
-
-
-
-
-
-
-