摘要:
Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a “structural footprint” to the cluster.
摘要:
Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.
摘要:
Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.
摘要:
A strip for placement across a sheet of a stack of banded sheet material and having end segments positionable against the sides of the stack. Flexible webs join the strip segments to the major part of the strip and permit bending of the end segments into place against the stack. Wall surfaces of the strip define an open area into which a clamp and clamped segments of a band may be displaced without damaging contact with the surface of the adjacent sheet of material.
摘要:
A set of known protein sequences associated with an organism is identified, wherein each known protein sequence comprises a plurality of ordered residues. A set of scores associated with a set of residues of the plurality of ordered residues is identified, wherein each score indicates a frequency of a residue in sequence context. A set of unique sub-sequences of the set of known protein sequences is identified. A plurality of protein signature residues is determined based on the set of scores associated with the set of residues and the set of unique sub-sequences.