Patent search ap:("Microsoft Technology Licensing Page LLC") AND inv:"Alexandros NEOFYTOU"

1.

发明申请
CLASSIFYING AUDIO SCENE USING SYNTHETIC IMAGE FEATURES 有权

公开(公告)号：US20210216817A1

公开(公告)日：2021-07-15

申请号：US16844930

申请日：2020-04-09

Applicant: Microsoft Technology Licensing, LLC

Inventor： Eric Chris Wolfgang SOMMERLADE , Yang LIU , Alexandros NEOFYTOU , Sunando SENGUPTA

IPC: G06K9/62 , H04N7/14 , H04N5/272

Abstract: A computing system includes an encoder that receives an input image and encodes the input image into real image features, a decoder that decodes the real image features into a reconstructed image, a generator that receives first audio data corresponding to the input image and generates first synthetic image features from the first audio data, and receives second audio data and generates second synthetic image features from the second audio data, a discriminator that receives both the real and synthetic image features and determines whether a target feature is real or synthetic, and a classifier that classifies a scene of the second audio data based on the second synthetic image features.

2.

发明申请
Estimating Illumination in an Environment Based on an Image of a Reference Object 有权

公开(公告)号：US20220116549A1

公开(公告)日：2022-04-14

申请号：US17067781

申请日：2020-10-12

Applicant: Microsoft Technology Licensing, LLC

Inventor： Alexandros NEOFYTOU , Eric Chris Wolfgang SOMMERLADE , Alejandro SZTRAJMAN , Sunando SENGUPTA

IPC: H04N5/262 , G06K9/46 , G06K9/62 , H04N5/235 , G06T7/194 , H04N5/232 , G06N3/08

Abstract: Technology is described herein that uses an object-encoding system to convert an object image into a combined encoding. The object image depicts a reference object, while the combined encoding represents an environment image. The environment image, in turn, depicts an estimate of an environment that has produced the illumination effects exhibited by the reference object. The combined encoding includes: a first part that represents image content in the environment image within a high range of intensities values; and a second part that represents image content within a low range of intensity values. Also described herein is a training system that trains the object-encoding system based on combined encodings produced by a separately-trained environment-encoding system. Also described herein are various applications of the object-encoding system and environment-encoding system.

3.

发明公开
CONTROLLING A FUNCTION VIA GAZE DETECTION 审中-公开

公开(公告)号：US20240256035A1

公开(公告)日：2024-08-01

申请号：US18588168

申请日：2024-02-27

Applicant: Microsoft Technology Licensing, LLC

Inventor： Steven N. BATHICHE , Eric Chris Wolfgang Sommerlade , Vivek PRADEEP , Alexandros NEOFYTOU

IPC: G06F3/01 , G06F21/32 , G06N3/04 , G06T7/73 , G06V40/16 , H04N23/00

CPC classification number: G06F3/013 , G06F21/32 , G06N3/04 , G06T7/73 , G06V40/16 , H04N23/00 , G06T2207/20084

Abstract: Aspects of the present disclosure relate to systems and methods for controlling a function of a computing system using gaze detection. In examples, one or more images of a user are received and gaze information may be determined from the received one or more images. Non-gaze information may be received when the gaze information is determined to satisfy a condition. Accordingly, a function may be enabled based on the received non-gaze information. In examples, the gaze information may be determined by extracting a plurality of features from the received one or more images, providing the plurality of features to a neural network, and determining, utilizing the neural network, a location at a display device at which a gaze of the user is directed.

4.

发明公开
ENHANCED USER EXPERIENCE THROUGH BI-DIRECTIONAL AUDIO AND VISUAL SIGNAL GENERATION 审中-公开

公开(公告)号：US20240054683A1

公开(公告)日：2024-02-15

申请号：US18383956

申请日：2023-10-26

Applicant: Microsoft Technology Licensing, LLC

Inventor： Sunando SENGUPTA , Alexandros NEOFYTOU , Eric Chris Wolfgang SOMMERLADE , Yang LIU

IPC: G06T9/00 , G06T3/60 , G10L19/012 , G10L25/51 , G06F18/21

CPC classification number: G06T9/00 , G06T3/60 , G10L19/012 , G10L25/51 , G06F18/21 , G10L2019/0002

Abstract: In various embodiments, a computer-implemented method of training a neural network for creating an output signal of different modality from an input signal is described. In embodiments, the first modality may be a sound signal or a visual image and where the output signal would be a visual image or a sound signal, respectively. In embodiments a model is trained using a first pair of visual and audio networks to train a set of codebooks using known visual signals and the audio signals and using a second pair of visual and audio networks to further train the set of codebooks using the augmented visual signals and the augmented audio signals. Further, the first and the second visual networks are equally weighted and where the first and the second audio networks are equally weighted.

5.

发明公开
RELIGHTING SYSTEM FOR SINGLE IMAGES 审中-公开

公开(公告)号：US20230206406A1

公开(公告)日：2023-06-29

申请号：US18116052

申请日：2023-03-01

Applicant: Microsoft Technology Licensing, LLC

Inventor： Alexandros NEOFYTOU , Eric Chris Wolfgang SOMMERLADE , Sunando SENGUPTA , Yang LIU

IPC: G06T5/00 , G06N3/08 , G06F18/214

CPC classification number: G06T5/005 , G06N3/08 , G06F18/214 , G06T2207/20081 , G06T2207/20084

Abstract: In various embodiments, a computer-implemented method of training a neural network for relighting an image is described. A first training set that includes source images and a target illumination embedding is generated, the source images having respective illuminated subjects. A second training set that includes augmented images and the target illumination embedding is generated, where the augmented images corresponding to the source images. A first autoencoder is trained using the first training set to generate a first output set that includes estimated source illumination embeddings and first reconstructed images that correspond to the source images, the reconstructed images having respective subjects that are i) from the corresponding source image, and ii) illuminated based on the target illumination embedding. A second autoencoder is trained using the second training set to generate a second output set that includes estimated augmented illumination embeddings and second reconstructed images that correspond to the augmented images.

6.

发明申请
GAZE ADJUSTMENT AND ENHANCEMENT FOR EYE IMAGES 有权

公开(公告)号：US20210097644A1

公开(公告)日：2021-04-01

申请号：US16696639

申请日：2019-11-26

Applicant: Microsoft Technology Licensing, LLC

Inventor： Eric Chris Wolfgang SOMMERLADE , Alexandros NEOFYTOU , Sunando SENGUPTA

IPC: G06T3/20 , G06T5/00 , G06T7/00 , G06K9/00

Abstract: A method for image enhancement on a computing device includes receiving a digital input image depicting a human eye. From the digital input image, the computing device generates a gaze-adjusted image via a gaze adjustment machine learning model by changing an apparent gaze direction of the human eye. From the gaze-adjusted image and potentially in conjunction with the digital input image, the computing device generates a detail-enhanced image via a detail enhancement machine learning model by adding or modifying details. The computing device outputs the detail-enhanced image.

7.

发明公开
Removing Artifacts in Images Caused by Light Emitted by Electronic Screens 审中-公开

公开(公告)号：US20240071042A1

公开(公告)日：2024-02-29

申请号：US17899325

申请日：2022-08-30

Applicant: Microsoft Technology Licensing, LLC

Inventor： Sunando SENGUPTA , Ebey Paulose ABRAHAM , Alexandros NEOFYTOU , Eric Chris Wolfgang SOMMERLADE

IPC: G06V10/60 , G06T5/50 , G06V10/141 , G06V10/25 , G06V10/77 , G06V40/16 , H04N7/15

CPC classification number: G06V10/60 , G06T5/50 , G06V10/141 , G06V10/25 , G06V10/7715 , G06V40/169 , H04N7/15 , G06T2207/10016 , G06T2207/10024 , G06T2207/10152 , G06T2207/20081 , G06T2207/20221

Abstract: An image-processing technique is described herein for removing a visual effect in a face region of an image caused, at least in part, by screen illumination provided by an electronic screen. The technique can perform this removal without advance knowledge of the nature of the screen illumination provided by the electronic screen. The technique improves the quality of the image and also protects the privacy of a user by removing the visual effect in the face region that may reveal the characteristics of display information presented on the electronic screen. In some implementations, the technique first adjusts a face region of the image, and then adjusts other regions in the image for consistency with the face region. In some implementations, the technique is applied by a videoconferencing application, and is performed by a local computing device.

8.

发明申请
ENHANCED USER EXPERIENCE THROUGH BI-DIRECTIONAL AUDIO AND VISUAL SIGNAL GENERATION 有权

公开(公告)号：US20220343543A1

公开(公告)日：2022-10-27

申请号：US17240510

申请日：2021-04-26

Applicant: Microsoft Technology Licensing, LLC

Inventor： Sunando SENGUPTA , Alexandros NEOFYTOU , Eric Chris Wolfgang SOMMERLADE , Yang LIU

IPC: G06T9/00 , G06T3/60 , G10L19/012 , G06K9/62 , G10L25/51

Abstract: In various embodiments, a computer-implemented method of training a neural network for creating an output signal of different modality from an input signal is described. In embodiments, the first modality may be a sound signal or a visual image and where the output signal would be a visual image or a sound signal, respectively. In embodiments a model is trained using a first pair of visual and audio networks to train a set of codebooks using known visual signals and the audio signals and using a second pair of visual and audio networks to further train the set of codebooks using the augmented visual signals and the augmented audio signals. Further, the first and the second visual networks are equally weighted and where the first and the second audio networks are equally weighted. In aspects of the present disclosure, the set of codebooks comprise a visual codebook, an audio codebook and a correlation codebook. These codebooks are then used to create an visual image from a sound signal and/or a sound signal from a visual image.

9.

发明申请
CONTROLLING A FUNCTION VIA GAZE DETECTION 有权

公开(公告)号：US20220221932A1

公开(公告)日：2022-07-14

申请号：US17146719

申请日：2021-01-12

Applicant: Microsoft Technology Licensing, LLC

Inventor： Steven N. BATHICHE , Eric Chris Wolfgang Sommerlade , Vivek PRADEEP , Alexandros NEOFYTOU

IPC: G06F3/01 , G06K9/00 , H04N5/225 , G06T7/73 , G06F21/32 , G06N3/04

Abstract: Aspects of the present disclosure relate to systems and methods for controlling a function of a computing system using gaze detection. In examples, one or more images of a user are received and gaze information may be determined from the received one or more images. Non-gaze information may be received when the gaze information is determined to satisfy a condition. Accordingly, a function may be enabled based on the received non-gaze information. In examples, the gaze information may be determined by extracting a plurality of features from the received one or more images, providing the plurality of features to a neural network, and determining, utilizing the neural network, a location at a display device at which a gaze of the user is directed.

10.

发明申请
EYE GAZE ADJUSTMENT 有权

公开(公告)号：US20220141422A1

公开(公告)日：2022-05-05

申请号：US17084937

申请日：2020-10-30

Applicant: Microsoft Technology Licensing, LLC

Inventor： Steven N. BATHICHE , Eric SOMMERLADE , Alexandros NEOFYTOU , Panos C. PANAY

IPC: H04N7/15 , G06K9/00 , G06T7/246 , G06K9/20 , G06T11/00 , G06N3/08

Abstract: A computing system, a method, and a computer-readable storage medium for adjusting eye gaze are described. The method includes capturing a video stream including images of a user, detecting the user's face region within the images, and detecting the user's facial feature regions within the images based on the detected face region. The method includes determining whether the user is completely disengaged from the computing system and, if the user is not completely disengaged, detecting the user's eye region within the images based on the detected facial feature regions. The method also includes computing the user's desired eye gaze direction based on the detected eye region, generating gaze-adjusted images based on the desired eye gaze direction, wherein the gaze-adjusted images include a saccadic eye movement, a micro-saccadic eye movement, and/or a vergence eye movement, and replacing the images within the video stream with the gaze-adjusted images.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification