INDRA Interactive Map

Describe your mechanism.

Growth-factor proteins activate EGFR, ERBB2, FGFR, PDGFR, MET, ROS1, and ALK.
EGFR, ERBB2, PDGFR, MET, ROS1, ALK and FGFR activate GRB2 and SHC.
GRB2 and SHC activate RASGRF and SOS.
GRB2 binds SHC.
SOS and RASGRF activate HRAS, NRAS and KRAS.
RASGRP activates HRAS, KRAS and NRAS.
SPRY deactivates HRAS, KRAS and NRAS.
RASA/ARHGAP35 deactivates HRAS, NRAS and KRAS.
RASAL deactivates HRAS, NRAS and KRAS.
NF1/SPRED deactivates HRAS, NRAS and KRAS.
RASA/ARHGAP35 deactivates RHOA, RHOB and RHOC.
RASAL deactivates RHOA, RHOB and RHOC.
NF1/SPRED deactivates RHOA, RHOB and RHOC.
HRAS, NRAS and KRAS activate RALGDS.
RALGDS activates RALA and RALB.
HRAS, NRAS and KRAS activate ARAF, BRAF and RAF1.
ARAF, BRAF and RAF1 activate MAP2K1 and MAP2K2.
MAP2K1 and MAP2K2 activate MAPK1 and MAPK3.
MAPK1 and MAPK3 activate ETS, JUN and FOS.
KSR binds ARAF, BRAF and RAF1.
KSR binds MAP2K1 and MAP2K2.
KSR binds MAPK1 and MAPK3.
ETS, FOS and JUN activate MDM2, CCND1 and DUSP.
MDM2 deactivates TP53.
CCND1 activates CDK4 and CDK6, which deactivate RB1.
DUSP deactivates MAPK1 and MAPK3.
SOS and RASGRF activate RHOA, RHOB and RHOC.
RHOA, RHOB and RHOC activate ROCK1 and ROCK2.
HRAS, NRAS and KRAS activate PIK3C and PIK3R.
PIK3C and PIK3R activate PIP3.
PTEN deactivates PIP3.
PIP3 activates PDPK1, AKT and TIAM.
PDPK1 activates AKT.
AKT deactivates TSC1/TSC2.
TSC1/TSC2 deactivates RHEB.
RHEB activates mTORC2.
STK11 activates PRKAA, PRKAB and PRKAG.
PRKAA, PRKAB and PRKAG deactivate mTORC2.
mTORC2 deactivates EIF4EBP1.
mTORC2 activates RPS6KB1 and RPS6KB2.
TIAM activates RAC, which activates PAK.

NLP system to read with

Choose a cell line.

Quick Start

Welcome to the INDRA Interactive Pathway Map (IPM) built and maintained by the INDRA team at the Harvard Program in Therapeutic Science (HiTS).

Reading and using the pathway map:

Nodes represent biological entities mentioned in text.
Nodes representing families are subdivided into pie charts.
Wild-type genes are colored green.
Mutated genes are colored orange.
The intensity of each color corresponds to the expression level of a gene in CCLE.
Clicking any node provides additional context by linking out to external resources.
Clicking any edge will allow a user to query INDRA DB for evidence.
The force-directed active layout can be disabled by toggling the “Forces” button.

Build your own model from text

Open the Build tab.
Enter a list of sentences describing biological mechanisms.
Click load and exit the menu.
Your model should be ready momentarily.

Model context

Network models can be contextualized using the Context tab.
Users can load gene expression and mutation data from each of the 900+ cell lines available in the CCLE.

Contact

If you encounter any technical problems or would like to get in touch with the team please emails us at indra.sysbio@gmail.com.

Introduction

This website implements an interactive pathway map (IPM) built using INDRA, an automated model assembly system for molecular biology. The goal of INDRA-IPM is to allow users to build, contextualize, and share biological pathway models by describing them in natural language.

The visualization aims to display pathways in a visual style similar to that used by biologists in textbooks and presentations. In addition we offer a layer of contextualization and an interactive user interface:

Nodes represent biological entities mentioned in text.
Nodes representing families are subdivided into pie charts.
Wild-type genes are colored green.
Mutated genes are colored orange.
The intensity of each color corresponds to the expression level of a gene in CCLE.
Clicking any node provides additional context by linking out to CiteAb, HGNC, and UniProt.
Clicking any edge will allow a user to filter the sources and targets of that edge and make a request for evidence found in literature and curated database that is stored in INDRA DB.

We start off displaying a pre-built model which demonstrates all of these features. The RAS Pathway Map model was drawn by Dr. Frank McCormick in collaboration with the NCI RAS Initiative community.

Building models from text

Users have the ability to define their own biological models in text under the “Build” tab. Here, we start off with the text necessary to build The NCI RAS Pathway Map as an example. A full list of the mechanistic relationships that can be represented by INDRA (and therefore INDRA-IPM) can be found in the software documentation, and examples of models described in natural language (processed via the TRIPS system and assembled by INDRA) can be found in Gyori, Bachman, et. al. (2017).

Users should note that the natural language processing systems are fairly robust but not without limitations. Proper grammar and punctuation should be used. The reading systems do not consider newlines to be sentence separators and may return erroneous output for sentences which are not terminated with a period.

The recognition and grounding of named entities (proteins, etc.) to database identifiers is done automatically. Nevertheless, using standardized names such as HGNC symbols (as opposed to informal synonyms) is preferred to avoid ambiguity. To normalize node names in the pathway map, the IPM performs name standardization, in which entities mentioned by their synonyms are normalized to standard names such as HGNC symbols (for instance, MEK1, Map2k1 and Mek1 are all normalized to the standard symbol MAP2K1). Note that by clicking on a node, a tooltip opens that allows linking out to databases (HGNC, UniProt, CiteAb), and checking the original text that the standardized node was created from.

INDRA-IPM also recognizes protein families and complexes and grounds them in the FamPlex ontology. In some cases, there is ambiguity in the name of a specific gene and a family it is part of. An example of this is the grounding of “JUN” from text to the JUN family, which also includes the JUN gene. In this case the user can use a synonym such as “c-JUN” that refers to the singular entity in order to reference only the gene and not the family.

We have exposed two reading systems to users. The REACH reader developed by the CLU Lab at the University of Arizona is an information extraction system for the biomedical domain, which aims to read scientific literature and extract cancer signaling pathways. We recommend users try REACH first due to its speed. The TRIPS/DRUM system developed by IHMC may offer greater mechanistic detail in some use cases (for instance, it supports recognizing complex molecular conditions such as “BRAF-V600E not bound to Vemurafenib”), but it requires significantly longer to run.

Contextualizing models

Users are able to project data from the Cancer Cell Line Encyclopedia (CCLE) onto their pathway maps. This is done automatically when the IPM is loaded initially (using the LOXIMVI skin cancer cell line) and can be changed to any other CCLE cell line in the Model Options dialogue panel. Wild type genes are colored green, while mutated genes are colored orange. Color intensity indicates the relative level of expression. Context is unavailable for gray nodes because they were not measured in CCLE.

Sharing models

Users can share models using the NDEx network sharing website . To upload the current model, click the “NDEX” button at the bottom of the interface, then click “Upload”. A link to NDEx will appear one the upload is complete.

One can load a model by entering the unique key at the end of this link (e.g., 9b901d8f-2e2d-11e9-9f06-0ac135e8bacf) into the Load field. Alternatively, one can share the link in the address bar (e.g., pathwaymap.indra.bio/?uuid=9b901d8f-2e2d-11e9-9f06-0ac135e8bacf) which will send a user to the IPM website and immediately load the shared model. Shared models preserve their text description, INDRA statements, graph layout, cell line context, and any evidence retrieved from INDRA DB.

Exporting models

Users can export models in a variety of formats.

INDRA JSON will export the model statements as a in the JSON format. These can be imported into INDRA or processed separately. The INDRA JSON format is specified in the software documentation.
PySB, SBML, BNGL, Kappa will export executable models in these formats. These modeling formalisms allow parameterizing and simulating models, and evaluating them against time-course data. Additionally, the Kappa IM option downloads an image of the rule-based model's influence structure. More information about these formats and tools supporting them is available at the following places:

PySB

http://pysb.org/

SBML

http://sbml.org/

BNGL:

Kappa

https://kappalanguage.org/

SBGN will export a model in the Systems Biology Graphical Notation format. Documentation and tools supporting SBGN are available. Note that layout information is not included in exported SBGN models, however tools such as Newt have built-in layout algorithms.
CX will export a model in the .cx format which can be opened in Cytoscape and also uploaded to NDEx. Cytoscape enables network visualization and provides access to a large ecosystem of analysis plugins; NDEx is a network sharing and versioning website with a programmatic API for accessing networks.
PNG will export a high-resolution .png image of the current graph. This feature is useful for taking snapshots of a pathway map for inclusion into documents or presentations.

In order to simplify the user interface, only PNG export is available on mobile devices with limited screen width.

Funding

This work was funded by ARO Grants W911NF‐14‐1‐0397 and W911NF‐15‐1‐0544 under the DARPA Big Mechanism and Communicating with Computers programs, and by NIGMS Grant P50GM107618.

Privacy

Our API backend receives user-generated requests such as those for reading, contextualization, and NDEx sharing.
Our server logs the IP addresses which make requests to the API.
The data from some user requests is forwarded to external APIs such as TRIPS (reading), cBioPortal (contextualization), NDEx in order to implement these functions.