Operationalizing the PageRank Algorithm: Protein-Protein Interaction Analysis

By Sean Robinson, MS / Director of Data Science

July 1, 2022

Blog

Reading Time: 3 minutes

Watch our video below for a quick PageRank algorithm demonstration using a protein-protein interaction graph. Graphable Lead Data Scientist Sean Robinson explores how to operationalize centrality measures such as PageRank from the Neo4j graph data science (GDS). The GraphAware Hume knowledge graph platform delivers the value of graph-based analytics into the hands of everyday analysts and decision-makers in an easy-to-interpret visualization.

Learn more:

Video Transcript: PageRank Algorithm (Protein-Protein Interaction Graph Example)

Today we’re going to operationalize the PageRank Algorithm with a protein-protein interaction graph example. We’ll be using the Clinical Knowledge Graph (CKG) to surface which proteins are most important within a given protein complex. Then we’ll see which biological processes are impacted by those most influential proteins. 

Using PageRank algorithm on protein-protein interaction graph
Using PageRank algorithm on protein-protein interaction graph
Adding a Style With Graph Data Science Algorithms 

I’m going to start by looking up my complex in question. Let’s expand the proteins for this complex to see what our interaction network looks like. Here we can see a number of proteins, but there’s quite a bit of chaos to our visualization. Let’s put some order to that chaos using some graph data science algorithms.

I’m going to add a style to my proteins. Let’s base it on the size of the protein to help those most influential proteins stand out. I’m going to select my PageRank attribute, which I’ve previously calculated and written in my database so Hume will be able to access to it. 

I’m going to have it relative to my visualization (rather than globally across my entire database) to keep things in context in terms of what I’m seeing on the screen. I’m even going to increase my size multiplier a bit so that those most influential proteins really stand out. 

Now let’s add our rule: Our most influential proteins now stand out like a sore thumb, which is incredibly useful when we have a large, chaotic visualization.

Protein-protein interaction graph with increased size multiplier
Protein-protein interaction graph with increased size multiplier
Using Processes Action

Let’s select those proteins and take a look at what processes they might be impacting. I’ll clear out the rest of my visualization. Now we have a much simpler picture of our most influential proteins. I’ll select all of those proteins and use my Processes action to see which biological processes they’re affecting.

Using Bio Processes action to analyze protein-protein interaction graph
Using Bio Processes action to analyze most influential proteins

So here we can see they’re impacting rRNA processing, translation, and a number of other biological processes. This can be incredibly useful when we’re trying to do something like targeted drug discovery for one of these processes.

So I hope this has provided useful insight into how to use your graph data science algorithms to filter and clean up your protein-protein interaction graph visualizations. I encourage you to think about what other visualizations and algorithms could be enhanced with the combination of the two. I look forward to seeing what you do!

Check out related articles on graph algorithms. For a deeper dive on these and other algorithms discussed here as well as more advanced graph topics, check out our additional articles on the subject:

Also read this related article on graph analytics for more on analytics within the graph database context.


Graphable helps you make sense of your data by delivering expert data analytics consulting, data engineering, custom dev and applied data science services.

We are known for operating ethically, communicating well, and delivering on-time. With hundreds of successful projects across most industries, we have deep expertise in Financial Services, Life Sciences, Security/Intelligence, Transportation/Logistics, HighTech, and many others.

Thriving in the most challenging data integration and data science contexts, Graphable drives your analytics, data engineering, custom dev and applied data science success. Contact us to learn more about how we can help, or book a demo today.

Still learning? Check out a few of our introductory articles to learn more:

Additional discovery:

    We would also be happy to learn more about your current project and share how we might be able to help. Schedule a consultation with us today. We can also discuss pricing on these initial calls, including Neo4j pricing and Domo pricing. We look forward to speaking with you!


    Graphable helps you make sense of your data by delivering expert analytics, data engineering, custom dev and applied data science services.
     
    We are known for operating ethically, communicating well, and delivering on-time. With hundreds of successful projects across most industries, we have deep expertise in Financial Services, Life Sciences, Security/Intelligence, Transportation/Logistics, HighTech, and many others.
     
    Thriving in the most challenging data integration and data science contexts, Graphable drives your analytics, data engineering, custom dev and applied data science success. Contact us to learn more about how we can help, or book a demo today.

    We are known for operating ethically, communicating well, and delivering on-time. With hundreds of successful projects across most industries, we thrive in the most challenging data integration and data science contexts, driving analytics success.
    Contact us for more information: