Title: Protein Function Prediction Using Homologues
Speaker: Mr Joseph Bonello
Date: Mon 17th July 2017, 4pm
Location: ICT Faculty Building, CS seminar room 38, Block B, 1st floor (Wait for me by the trees at the entrance if you do not have key card access to the building).
Abstract: Homology refers to the existence of a common origin between a pair of proteins in different organisms. Proteins consist of multiple domains – conserved regions of a sequence and structure that can function independently from the rest of the protein chain. Protein function prediction methods based on homology, take advantage of the many pairwise homology relationships between individual domain sequences.
This project attempts to create a set of scores that can be used to predict the possible domain functions that a protein can possess. The study uses CATH Superfamilies and CATH Functional Families (FunFams) to generate the scores. CATH is a database that provides a hierarchical protein-domain classification for proteins obtained from PDB. The Superfamilies and FunFams provide a natural grouping for proteins that share the same evolutionary origin (homologous superfamilies). This grouping can be exploited to generate similarity scores between the domains and the families. Two methods have been developed for the purpose of function prediction based on these principles. The first method uses Set Theory, where the proteins belonging to a Superfamily or a FunFam are used to determine which GO Terms are more likely to occur in the group. The second method uses a statistical calculation to represent the presence of GO Terms in a family.