-
公开(公告)号:US20230419972A1
公开(公告)日:2023-12-28
申请号:US18461457
申请日:2023-09-05
申请人: Sonos, Inc.
摘要: Systems and methods for audio processing include capturing first sound data via at least one microphone of a network microphone device (NMD) and determining, via a voice activity detection process, that the first sound data does not include voice activity. The first sound data is stored in a buffer, and the NMD forgoes spatial processing of the first sound data. The NMD can capture second sound data and determine, via the voice activity process, that the second sound data includes voice activity. The NMD spatially processes the second sound data to produce filtered sound data. The NMD detects a wake word based on data in the buffer. After detecting the wake word, the NMD may determine an action to be performed based on the data in the buffer.
-
公开(公告)号:US11854548B1
公开(公告)日:2023-12-26
申请号:US18057891
申请日:2022-11-22
发明人: Vincent Le Chevalier
CPC分类号: G10L15/22 , G06F16/635 , G10L15/1815 , G10L17/22 , H04R1/403 , G10L2015/088 , G10L2015/223
摘要: Systems and techniques for adaptive conversation support bot are described herein. An audio stream may be obtained including a conversation of a first user. An event may be identified in the conversation using the audio stream. A first keyword phrase may be extracted from the audio stream in response to identification of the event. The audio stream may be searched for a second keyword phrase based on the first keyword phrase. An action may be performed based on the first keyword phrase and the second keyword phrase. Results of the action may be out via a context appropriate output channel. The context appropriate output channel may be determined based on a context of the conversation and a privacy setting of the first user.
-
公开(公告)号:US11854353B2
公开(公告)日:2023-12-26
申请号:US17991023
申请日:2022-11-21
申请人: HANASIS CO., LTD.
发明人: Jung Yong Lee
CPC分类号: G07G1/0018 , G06V40/165 , G06V40/171 , G10L17/22
摘要: A height-adjustable kiosk apparatus includes: a kiosk unit configured to generate processed signals by recognizing and processing an image and a voice, to receive basic signals from a user, to perform output, and to process payment; and a height adjustment unit disposed under the kiosk unit, and equipped with a driving unit configured to be driven to move the kiosk unit up and down based on the processed signals; wherein the kiosk unit includes an image recognition unit, a voice recognition unit, an image processing unit, and a voice processing unit; and wherein the height adjustment unit includes a driving signal generation unit and a priority determination unit.
-
54.
公开(公告)号:US20230402042A1
公开(公告)日:2023-12-14
申请号:US18239619
申请日:2023-08-29
申请人: Rovi Guides, Inc.
摘要: Systems and methods are disclosed herein for identifying users based on voice data and media consumption data. A media guidance application may generate a voice signature from the user's input and based on that signature identify some demographic characteristics of the user (e.g., age, gender, and other suitable characteristics). The media guidance application may retrieve user data for users that are associated with a household of the user and attempt to identify which of the users spoke the command. If multiple users are identified, based on the demographic characteristics, the media guidance application may use the content of the voice command (e.g., a type of media requested) to identify the user.
-
公开(公告)号:US20230391592A1
公开(公告)日:2023-12-07
申请号:US18144759
申请日:2023-05-08
申请人: Walmart Apollo, LLC
发明人: Donald R. High , Michael D. Atchley , Shuvro Chakrobartty , Karl Kay , Brian G. McHale , Robert C. Taylor , John P. Thompson , Eric E. Welch , David C. Winkle
IPC分类号: B66F9/06 , G01C21/20 , G06Q50/30 , G06Q30/0601 , G06Q10/087 , G06Q10/02 , E01H5/12 , G01S1/72 , G06Q10/1093 , G05D1/02 , G06Q10/0631 , E01H5/06 , G06Q10/083 , G06Q50/28 , B62B5/00 , G06Q30/02 , G01S1/02 , H04N5/77 , G06Q10/30 , H04N7/18 , G06Q30/016 , H04W4/80 , G01S1/70 , H02J7/00 , G10L13/00 , G06V20/20 , G06V20/40 , G06V20/52 , G06V20/56 , G06V20/58 , G06V20/64 , G06V30/224 , G06F18/214 , G06T7/73 , G06T7/593 , H04W4/33 , H04W4/30 , H04W4/40 , H04N13/282 , B60L53/36 , B60L53/63 , A47F3/08 , A47F10/04 , A47F13/00 , A47L11/40 , B07C5/28 , B07C5/342 , B60P3/06 , B65F3/00 , G05B19/048 , G05D1/00 , G05D1/04 , G06F3/01 , G08G1/00 , G10L15/22 , G10L17/22 , H04B10/116 , H04L67/12 , H04L67/141 , H04L67/143 , H04W4/02
CPC分类号: B66F9/063 , G01C21/206 , G06Q50/30 , G06Q30/0633 , G06Q30/0605 , G06Q10/087 , G06Q30/0613 , G06Q10/02 , E01H5/12 , G01S1/72 , G06Q10/1095 , G05D1/028 , G06Q10/06311 , E01H5/061 , G06Q10/083 , G06Q50/28 , G06Q30/0601 , G06Q30/0619 , B62B5/0076 , G06Q30/0639 , G06Q30/0281 , G06Q30/0631 , G01S1/02 , G06Q30/0641 , H04N5/77 , G06Q10/30 , G06Q30/0617 , H04N7/183 , G06Q30/016 , G06Q30/0635 , H04W4/80 , G05D1/0297 , G01S1/7034 , H02J7/0013 , G10L13/00 , G01S1/70 , G06V20/20 , G06V20/40 , G06V20/52 , G06V20/56 , G06V20/58 , G06V20/647 , G06V30/224 , G06F18/214 , G06T7/74 , G06T7/593 , H04W4/33 , H04W4/30 , H04W4/40 , H04N13/282 , B60L53/36 , B60L53/63 , A47F3/08 , A47F10/04 , A47F13/00 , A47L11/4011 , B07C5/28 , B07C5/3422 , B60P3/06 , B62B5/0026 , B62B5/0069 , B65F3/00 , G05B19/048 , G05D1/0011 , G05D1/0016 , G05D1/0022 , G05D1/0027 , G05D1/0061 , G05D1/0088 , G05D1/021 , G05D1/0214 , G05D1/0219 , G05D1/0234 , G05D1/0246 , G05D1/0255 , G05D1/0276 , G05D1/0289 , G05D1/0291 , G05D1/0293 , G05D1/04 , G06F3/017 , G06Q10/0631 , G08G1/20 , G10L15/22 , G10L17/22 , H04B10/116 , H04L67/12 , H04L67/141 , H04L67/143 , H04N7/18 , H04N7/185 , H04W4/02 , Y10S901/01 , G06Q10/06315 , G05D2201/0216 , Y02W30/82 , H04W4/021
摘要: Apparatuses, components and methods are provided herein useful to provide assistance to customers and/or workers in a shopping facility. In some embodiments, a shopping facility personal assistance system comprises: a plurality of motorized transport units located in and configured to move through a shopping facility space; a plurality of user interface units, each corresponding to a respective motorized transport unit during use of the respective motorized transport unit; and a central computer system having a network interface such that the central computer system wirelessly communicates with one or both of the plurality of motorized transport units and the plurality of user interface units, wherein the central computer system is configured to control movement of the plurality of motorized transport units through the shopping facility space based at least on inputs from the plurality of user interface units.
-
公开(公告)号:US20230374746A1
公开(公告)日:2023-11-23
申请号:US18226132
申请日:2023-07-25
申请人: Walmart Apollo, LLC
IPC分类号: E01H5/06 , G06Q30/0601 , B66F9/06 , A47F10/04 , G05D1/02 , H04N7/18 , G06Q50/28 , G06T7/73 , G06T7/593 , H04W4/029 , H04W4/30 , H04W4/80 , H04W4/40 , H04N13/282 , B60L53/36 , B60L53/63 , A47F13/00 , A47L11/40 , B07C5/28 , B07C5/342 , B65F3/00 , E01H5/12 , G01S1/02 , G01S1/72 , G05B19/048 , G05D1/00 , G05D1/04 , G06F3/01 , G06Q10/02 , G06Q10/0631 , G06Q10/083 , G06Q10/1093 , G06Q10/30 , G06Q30/016 , G06Q30/02 , G06Q50/30 , G08G1/00 , G10L15/22 , G10L17/22 , H04B10/116 , H04L67/12 , H04L67/141 , H04L67/143 , H04N5/77 , H04W4/021 , H02J7/00 , G01S1/70 , G10L13/00 , G06V20/20 , G06V20/40 , G06V20/52 , G06V20/56 , G06V20/58 , G06V20/64 , G06V30/224 , G06F18/214 , G06Q10/087 , B62B5/00 , B60P3/06 , H04W4/33 , H04W4/02 , A47F3/08 , G01C21/20
CPC分类号: E01H5/061 , G06Q30/0601 , B66F9/063 , A47F10/04 , G05D1/028 , G06Q30/0633 , H04N7/183 , G06Q50/28 , G06T7/74 , G06T7/593 , H04W4/029 , H04W4/30 , H04W4/80 , H04W4/40 , H04N13/282 , B60L53/36 , B60L53/63 , A47F13/00 , A47L11/4011 , B07C5/28 , B07C5/3422 , B65F3/00 , E01H5/12 , G01S1/02 , G01S1/72 , G05B19/048 , G05D1/0016 , G05D1/0022 , G05D1/0027 , G05D1/0219 , G05D1/0234 , G05D1/0246 , G05D1/0255 , G05D1/0289 , G05D1/0291 , G05D1/0293 , G05D1/0297 , G05D1/04 , G06F3/017 , G06Q10/02 , G06Q10/0631 , G06Q10/06311 , G06Q10/083 , G06Q10/1095 , G06Q10/30 , G06Q30/016 , G06Q30/0281 , G06Q30/0605 , G06Q30/0613 , G06Q30/0617 , G06Q30/0619 , G06Q30/0631 , G06Q30/0635 , G06Q30/0639 , G06Q50/30 , G08G1/20 , G10L15/22 , G10L17/22 , H04B10/116 , H04L67/12 , H04L67/141 , H04L67/143 , H04N5/77 , H04N7/18 , H04N7/185 , H04W4/021 , H02J7/0013 , H02J7/0071 , G01S1/7034 , G01S1/7038 , G10L13/00 , G01S1/70 , G06V20/20 , G06V20/40 , G06V20/52 , G06V20/56 , G06V20/58 , G06V20/647 , G06V30/224 , G06F18/214 , G06Q10/087 , B62B5/0026 , B62B5/0069 , B60P3/06 , G06Q30/0641 , B62B5/0076 , G05D1/0011 , H04W4/33 , H04W4/02 , A47F3/08 , G01C21/206 , G05D1/0088 , G05D1/0276 , Y02W30/82 , G06F16/90335
摘要: Methods and apparatuses are provided for use in monitoring product placement within a shopping facility. Some embodiments provide an apparatus configured to determine product placement conditions within a shopping facility, comprising: a transceiver configured to wirelessly receive communications; a product monitoring control circuit coupled with the transceiver; a memory coupled with the control circuit and storing computer instructions that when executed by the control circuit cause the control circuit to: obtain a composite three-dimensional (3D) scan mapping corresponding to at least a select area of the shopping facility and based on a series of 3D scan data; evaluate the 3D scan mapping to identify multiple product depth distances; and identify, from the evaluation of the 3D scan mapping, when one or more of the multiple product depth distances is greater than a predefined depth distance threshold from the reference offset distance of the product support structure.
-
公开(公告)号:US11798559B2
公开(公告)日:2023-10-24
申请号:US17358461
申请日:2021-06-25
CPC分类号: G10L15/30 , G10L17/22 , G10L2015/223 , G10L2015/225 , H04M3/00
摘要: Systems and methods for establishing communication connections using speech, such as establishing calls between speech-controlled devices, are described. A first speech-controlled device receives a communication request in the form of audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient, a subject for the call, and a device associated with the recipient. The server then sends a message indicating the communication request and audio data corresponding to the communication topic to the recipient's speech-controlled device. The recipient device outputs audio to the recipient requesting whether the recipient accepts the communication request. The recipient audibly refuses or accepts the communication request, and the recipient's speech-controlled device sends an indication of the recipient's audible decision to the server. If the recipient accepted the communication request, the server causes a communication connection be established between the two speech-controlled devices.
-
58.
公开(公告)号:US20230335277A1
公开(公告)日:2023-10-19
申请号:US17720536
申请日:2022-04-14
CPC分类号: G16H50/20 , A61B5/1128 , G06V20/44 , G06V20/70 , A61B5/1117 , A61B5/1113 , G06T1/0014 , G10L17/22 , G16H80/00
摘要: An AI-based system and method for automatically monitoring the health of one or more users is disclosed. The method includes capturing one or more videos of one or more users and extracting a plurality of frames from each of the one or more videos. The method includes extracting a set of skeletal positions from each of the plurality of frames and performing one or more operations on the plurality of frames to normalize the set of skeletal positions. Furthermore, the method includes detecting a set of poses of the one or more users in the plurality of frames and determining an action performed by the one or more users in the plurality of frames by using an action determination-based AI model. The method includes determining a level of severity of the action and performing one or more responsive actions to provide medical assistance to the one or more users.
-
59.
公开(公告)号:US20230335138A1
公开(公告)日:2023-10-19
申请号:US17659199
申请日:2022-04-14
发明人: Christopher Nusbaum , Brian Toler
摘要: A system onboard an aircraft includes: a proximity sensor to detect users within an area; a display monitor in the area; a processor; and a processor-readable medium storing executable instructions to perform a method that involves: detecting presence of a user within the area, based on output of the sensor; causing display of an animated digital representation of an assistant on the monitor; controlling the animated digital representation of the assistant to react to speech input; processing speech input of the user to identify at least one action to be carried out onboard the aircraft; controlling the animated digital representation of the assistant to respond to the identified at least one action; and issuing at least one command, instruction, or control signal to the component onboard the aircraft, to initiate the identified at least one action.
-
公开(公告)号:US20230306970A1
公开(公告)日:2023-09-28
申请号:US17656294
申请日:2022-03-24
摘要: In some implementations, a front-end device may receive a physical identifier associated with the user. Accordingly, the front-end device may select a plurality of images, where each image corresponds to a unique integer of integers zero through nine. The front-end device may show, on a display, the plurality of images and receive audio that includes a sequence of words that describe a subset of the plurality of images. Accordingly, the front-end device may map the sequence of words to the subset of the plurality of images and determine a first sequence of numbers corresponding to the subset of the plurality of images. Therefore, the front-end device may authenticate the user based on the first sequence of numbers matching a second sequence of numbers associated with the user.
-
-
-
-
-
-
-
-
-