Download PDFOpen PDF in browser

Geometric Algebra Models of Proteins for Three-Dimensional Structure Prediction

EasyChair Preprint 8688

12 pagesDate: August 17, 2022

Abstract

A protein can be regarded as a chain of amino acids with unique folding in the three-dimensional (3D) space. Knowing the folding of a protein is highly desirable since the folding controls the protein properties. However, determining it experimentally is expensive and time consuming: estimating the 3D structure of a protein computationally - known as protein structure prediction (PSP) - can overcome these issues. In this paper, we explore the advantage of using Geometric Algebra (GA) to model proteins for PSP applications. In particular, we employ GA to define a metric of the orientation of the amino acids in the chain. We then encode this metric in matrix form and show how patterns in these images mirror folding patterns of proteins. Lastly, we prove that this metric is predictable through a standard deep learning (DL) architecture for the inference of pairwise amino acids distances. We demonstrate that GA is a powerful tool to obtain a compact representation of the protein geometry with potential to improve the prediction accuracy of standard PSP pipelines.

Keyphrases: Geometric Algebra, deep learning, protein structure prediction

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@booklet{EasyChair:8688,
  author    = {Alberto Pepe and Joan Lasenby and Pablo Chacon},
  title     = {Geometric Algebra Models of Proteins for Three-Dimensional Structure Prediction},
  howpublished = {EasyChair Preprint 8688},
  year      = {EasyChair, 2022}}
Download PDFOpen PDF in browser