Invention Publication
- Patent Title: TRAINING MACHINE LEARNING MODELS WITH SPARSE INPUT
-
Application No.: US18456792Application Date: 2023-08-28
-
Publication No.: US20240070459A1Publication Date: 2024-02-29
- Inventor: Artem Goncharuk , Robert Clapp , Kevin Forsythe Smith , Shiang Yong Looi , Ananya Gupta , Joses Bolutife Omojola , Min Jun Park
- Applicant: X Development LLC
- Applicant Address: US CA Mountain View
- Assignee: X Development LLC
- Current Assignee: X Development LLC
- Current Assignee Address: US CA Mountain View
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N5/04

Abstract:
This disclosure describes a system and method for effectively training a machine learning model to identify features in DAS and/or seismic imaging data with limited or no human labels. This is accomplished using a masked autoencoder (MAE) network that is trained in multiple stages. The first stage is a self-supervised learning (SSL) stage where the model is generically trained to predict data that has been removed (masked) from an original dataset. The second stage involves performing additional predictive training on a second dataset that is specific to a particular geographic region, or specific to a certain set of desired features. The model is fine-tuned using labeled data in order to develop feature extraction capabilities.
Information query