-
公开(公告)号:US11923044B1
公开(公告)日:2024-03-05
申请号:US16896907
申请日:2020-06-09
Applicant: Amazon Technologies, Inc.
Inventor: Alexander Sewall Ford , Vanessa Nguyen , Layne Christopher Price , Franziska Seeger , Yen Ling Adelene Sim
Abstract: Techniques for predicting a protein sequence are described. An exemplary method includes receiving a request to predict a missing area of a protein's primary sequence and a corresponding three-dimensional position of the missing area; applying a machine learning model to backbone Cartesian coordinates of the protein's primary sequence and a protein vector of a representation of the protein's primary sequence including the missing area to predict a missing area of the protein primary sequence and a corresponding three-dimensional position for the missing area, wherein the machine learning model is selected from the group consisting of: an attention-based machine learning model, a bidirectional long short term memory-based model, and a convolutional neural network-based model; and outputting a result of the machine learning model.