Download PDFOpen PDF in browserGeometric Algebra Models of Proteins for Three-Dimensional Structure PredictionEasyChair Preprint 868812 pages•Date: August 17, 2022AbstractA protein can be regarded as a chain of amino acids with unique folding in the three-dimensional (3D) space. Knowing the folding of a protein is highly desirable since the folding controls the protein properties. However, determining it experimentally is expensive and time consuming: estimating the 3D structure of a protein computationally - known as protein structure prediction (PSP) - can overcome these issues. In this paper, we explore the advantage of using Geometric Algebra (GA) to model proteins for PSP applications. In particular, we employ GA to define a metric of the orientation of the amino acids in the chain. We then encode this metric in matrix form and show how patterns in these images mirror folding patterns of proteins. Lastly, we prove that this metric is predictable through a standard deep learning (DL) architecture for the inference of pairwise amino acids distances. We demonstrate that GA is a powerful tool to obtain a compact representation of the protein geometry with potential to improve the prediction accuracy of standard PSP pipelines. Keyphrases: Geometric Algebra, deep learning, protein structure prediction
|