CONTACT US
Operationalizing the PageRank Algorithm: Protein-Protein Interaction Graph Analysis [Video]
Watch our video below for a quick PageRank algorithm demonstration using a protein-protein interaction graph. Graphable Lead Data Scientist Sean Robinson explores how to operationalize centrality measures such as PageRank from the Neo4j graph data science (GDS). The GraphAware Hume knowledge graph platform delivers the value of graph-based analytics into the hands of everyday analysts and decision-makers in an easy-to-interpret visualization.
Learn more:
- Check out the code: coming soon
- Read our blogs
- About Graphable
- About Neo4j Graph Data Science
- Explore more protein-protein interaction graphs
Video Transcript: PageRank Algorithm (Protein-Protein Interaction Graph Example)
Today weāre going to operationalize the PageRank Algorithm with a protein-protein interaction graph example. Weāll be using the Clinical Knowledge Graph (CKG) to surface which proteins are most important within a given protein complex. Then weāll see which biological processes are impacted by those most influential proteins.Ā

Adding a Style With Graph Data Science Algorithms
Iām going to start by looking up my complex in question. Letās expand the proteins for this complex to see what our interaction network looks like. Here we can see a number of proteins, but thereās quite a bit of chaos to our visualization. Letās put some order to that chaos using some graph data science algorithms.
Iām going to add a style to my proteins. Letās base it on the size of the protein to help those most influential proteins stand out. Iām going to select my PageRank attribute, which Iāve previously calculated and written in my database so Hume will be able to access to it.
Iām going to have it relative to my visualization (rather than globally across my entire database) to keep things in context in terms of what Iām seeing on the screen. Iām even going to increase my size multiplier a bit so that those most influential proteins really stand out.
Now letās add our rule: Our most influential proteins now stand out like a sore thumb, which is incredibly useful when we have a large, chaotic visualization.

Using Processes Action
Letās select those proteins and take a look at what processes they might be impacting. Iāll clear out the rest of my visualization. Now we have a much simpler picture of our most influential proteins. Iāll select all of those proteins and use my Processes action to see which biological processes theyāre affecting.

So here we can see theyāre impacting rRNA processing, translation, and a number of other biological processes. This can be incredibly useful when weāre trying to do something like targeted drug discovery for one of these processes.
So I hope this has provided useful insight into how to use your graph data science algorithms to filter and clean up your protein-protein interaction graph visualizations. I encourage you to think about what other visualizations and algorithms could be enhanced with the combination of the two. I look forward to seeing what you do!
Graphable delivers insightful graph database (e.g. Neo4j consulting) / machine learning (ml) / natural language processing (nlp) projects as well as graph and Domo consulting for BI/analytics, with measurable impact. We are known for operating ethically, communicating well, and delivering on-time. With hundreds of successful projects across most industries, we thrive in the most challenging data integration and data science contexts, driving analytics success.
Want to find out more about our Hume consulting on the Hume knowledge graph / insights platform? As the Americas principal reseller, we are happy to connect and tell you more. Book a demo by contacting us here.
Check out our article, What is a Graph Database? for more info.