-
公开(公告)号:US09864431B2
公开(公告)日:2018-01-09
申请号:US15152401
申请日:2016-05-11
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
Inventor: Cem Keskin , David Kim , Bill Chau , Jaeyoun Kim , Kazuhito Koishida , Khuram Shahid
CPC classification number: G06F3/015 , G06F3/017 , G06F2203/0381 , G06T19/006
Abstract: Computer systems, methods, and storage media for changing the state of an application by detecting neurological user intent data associated with a particular operation of a particular application state, and changing the application state so as to enable execution of the particular operation as intended by the user. The application state is automatically changed to align with the intended operation, as determined by received neurological user intent data, so that the intended operation is performed. Some embodiments relate to a computer system creating or updating a state machine, through a training process, to change the state of an application according to detected neurological data.
-
公开(公告)号:US10564713B2
公开(公告)日:2020-02-18
申请号:US16243897
申请日:2019-01-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cem Keskin , Khuram Shahid , Bill Chau , Jaeyoun Kim , Kazuhito Koishida
IPC: G09G5/00 , G06F3/01 , G06F3/0484 , G06F3/16 , G06T19/00
Abstract: Computer systems, methods, and storage media for generating a continuous motion control using neurological data and for associating the continuous motion control with a continuous user interface control to enable analog control of the user interface control. The user interface control is modulated through a user's physical movements within a continuous range of motion associated with the continuous motion control. The continuous motion control enables fine-tuned and continuous control of the corresponding user interface control as opposed to control limited to a small number of discrete settings.
-
公开(公告)号:US09741354B2
公开(公告)日:2017-08-22
申请号:US15143387
申请日:2016-04-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen
IPC: G10L19/00 , G10L19/16 , G10L19/002 , G10L19/038 , G10L19/04 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/24
CPC classification number: G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24
Abstract: An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.
-
公开(公告)号:US11244696B2
公开(公告)日:2022-02-08
申请号:US16783021
申请日:2020-02-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kazuhito Koishida , Michael Iuzzolino
IPC: G10L21/0364 , G06K9/00 , G10L15/22 , G10L15/25 , G10L25/18
Abstract: Example speech enhancement systems include a spatio-temporal residual network configured to receive video data containing a target speaker and extract visual features from the video data, an autoencoder configured to receive input of an audio spectrogram and extract audio features from the audio spectrogram, and a squeeze-excitation fusion block configured to receive input of visual features from a layer of the spatio-temporal residual network and input of audio features from a layer of the autoencoder, and to provide an output to the decoder of the autoencoder. The decoder is configured to output a mask configured based upon the fusion of audio features and visual features by the squeeze-excitation fusion block, and the instructions are executable to apply the mask to the audio spectrogram to generate an enhanced magnitude spectrogram, and to reconstruct an enhanced waveform from the enhanced magnitude spectrogram.
-
公开(公告)号:US10721594B2
公开(公告)日:2020-07-21
申请号:US14316667
申请日:2014-06-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Raja Bose , Hiroshi Horii , Jonathan Lester , Ruchita Bhargava , Kazuhito Koishida , Michelle L. Holtmann , Christina Chen
IPC: H04W24/00 , H04W4/029 , H04L12/58 , G06Q30/02 , H04M1/725 , H04M3/42 , H04L29/08 , H04M3/487 , H04W4/12 , H04W4/021
Abstract: Mobile devices provide a variety of techniques for presenting messages from sources to a user. However, when the message pertains to the presence of the user at a location, the available communications techniques may exhibit deficiencies, e.g., reliance on the memory of the source and/or user of the existence and content of a message between its initiation and the user's visit to the location, or reliance on the communication accessibility of the user, the device, and/or the source during the user's location visit. Presented herein are techniques for enabling a mobile device, at a first time, to receive a request to present an audio message during the presence of the user at a location; and, at a second time, detecting the presence of the user at the location, and presenting the audio message to the user, optionally without awaiting a request from the user to present the message.
-
公开(公告)号:US20190212810A1
公开(公告)日:2019-07-11
申请号:US16243897
申请日:2019-01-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cem Keskin , Khuram Shahid , Bill Chau , Jaeyoun Kim , Kazuhito Koishida
IPC: G06F3/01 , G06F3/0484 , G06T19/00 , G06F3/16
CPC classification number: G06F3/011 , G06F3/015 , G06F3/017 , G06F3/04847 , G06F3/165 , G06T19/006 , G06T2200/24
Abstract: Computer systems, methods, and storage media for generating a continuous motion control using neurological data and for associating the continuous motion control with a continuous user interface control to enable analog control of the user interface control. The user interface control is modulated through a user's physical movements within a continuous range of motion associated with the continuous motion control. The continuous motion control enables fine-tuned and continuous control of the corresponding user interface control as opposed to control limited to a small number of discrete settings.
-
公开(公告)号:US09349376B2
公开(公告)日:2016-05-24
申请号:US14683074
申请日:2015-04-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen
IPC: G10L21/04 , G10L19/002 , G10L19/16 , G10L19/038 , G10L19/04 , G10L19/24
CPC classification number: G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24
Abstract: An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.
-
公开(公告)号:US10203751B2
公开(公告)日:2019-02-12
申请号:US15152403
申请日:2016-05-11
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cem Keskin , Khuram Shahid , Bill Chau , Jaeyoun Kim , Kazuhito Koishida
IPC: G09G5/00 , G06F3/01 , G06F3/16 , G06T19/00 , G06F3/0484
Abstract: Computer systems, methods, and storage media for generating a continuous motion control using neurological data and for associating the continuous motion control with a continuous user interface control to enable analog control of the user interface control. The user interface control is modulated through a user's physical movements within a continuous range of motion associated with the continuous motion control. The continuous motion control enables fine-tuned and continuous control of the corresponding user interface control as opposed to control limited to a small number of discrete settings.
-
公开(公告)号:US20150213804A1
公开(公告)日:2015-07-30
申请号:US14683074
申请日:2015-04-09
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kazuhito Koishida , Sanjeev Mehrotra , Chao He , Wei-Ge Chen
IPC: G10L19/002 , G10L19/038 , G10L19/04
CPC classification number: G10L19/167 , G10L19/002 , G10L19/008 , G10L19/022 , G10L19/03 , G10L19/038 , G10L19/04 , G10L19/24
Abstract: An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.
-
公开(公告)号:US12253620B2
公开(公告)日:2025-03-18
申请号:US17449054
申请日:2021-09-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kazuhito Koishida , Alexander A. Popov , Uros Batricevic , Steven Nabil Bathiche
IPC: G10L15/22 , A61B5/00 , A61B5/0205 , A61B5/0507 , A61B5/11 , A61B5/117 , G01S5/18 , G01S5/28 , G01S13/72 , G06F1/3206 , G06F1/3231 , G06F1/324 , G06F3/01 , G06F3/03 , G06F3/0482 , G06F3/04842 , G06F3/16 , G06F18/20 , G06F18/25 , G06F18/28 , G06F18/40 , G06F21/32 , G06F21/35 , G06F40/211 , G06F40/35 , G06N5/025 , G06N5/047 , G06N20/00 , G06T7/246 , G06T7/292 , G06T7/60 , G06T7/70 , G06T7/73 , G06V10/94 , G06V20/40 , G06V20/52 , G06V20/64 , G06V30/262 , G06V40/10 , G06V40/16 , G06V40/20 , G07C9/28 , G08B13/14 , G10L15/02 , G10L15/06 , G10L15/08 , G10L15/18 , G10L15/19 , G10L15/24 , G10L15/26 , G10L15/28 , G10L15/32 , G10L17/04 , G10L17/08 , G10L25/51 , H04L9/40 , H04L51/02 , H04L67/12 , H04L67/50 , H04N7/18 , H04N21/231 , H04N21/422 , H04N21/442 , H04N23/11 , H04N23/611 , H04R1/40 , H04R3/00 , H04W4/029 , H04W4/33 , A61B5/05 , G01S5/16 , G01S11/14 , G01S13/38 , G01S13/86 , G01S13/88 , G06F3/0488 , G06F16/70 , G06N3/044 , G07C9/32 , G08B29/18 , G10L17/00 , H04N23/90
Abstract: An intelligent assistant records speech spoken by a first user and determines a self-selection score for the first user. The intelligent assistant sends the self-selection score to another intelligent assistant, and receives a remote-selection score for the first user from the other intelligent assistant. The intelligent assistant compares the self-selection score to the remote-selection score. If the self-selection score is greater than the remote-selection score, the intelligent assistant responds to the first user and blocks subsequent responses to all other users until a disengagement metric of the first user exceeds a blocking threshold. If the self-selection score is less than the remote-selection score, the intelligent assistant does not respond to the first user.
-
-
-
-
-
-
-
-
-