These exercises are about using IGV are relate to this training
session. The exercises walk you through some commonly used IGV
workflows to visualize data, so you can get familiar with the console
and tools available.
We will be using externally hosted data available from Encode
including ChIPseq and RNAseq data. Please feel free to try other data
sets once you have run through the tasks but remember IGV is hosted on
your machine so less powerful computers will feel the memory load of
large numbers of data sets.
ChIPseq data
This section looks at how to load and manipulate some ChIPseq data.
The data is from a lymphoblastoma cell line and contains ChIPseq samples
for several histone types. Loading IGV and moving around the gene of
interest.
- First, load IGV and select the hg19 genome.
- Navigate to chr12:6,641,585-6,649,537.
- Navigate to Gapdh.
- Zoom out to see the surrounding genes.
- Select the “RefSeq Genes” track and expand to see all transcript
isoforms.
- Scroll to the left to discover the Vamp1 gene.
- Click on gene to bring up a window with further information.
- Review gene information from NCBI link.
- Double click on Gapdh to recenter window on the gene.
- Add a region of interest at the Gapdh locus using the IGV menu *
Regions -> Region Navigator -> add
- Right click the “Region of interest” and edit description to show
“Active gene” Loading data
- Go to the IGV menu -> File -> load from server
- From the menu follow drop-down
- Tutorials -> UI Basics (Encode):
- GM12878 H3K27ac
- GM12878 H3K27me3
- GM12878 H3K36me3
- GM12878 H3K4me1
Controlling IGV display
- Select all tracks and then select “autoscale.”
- At the Gapdh locus, investigate the enrichment of signal over gene
body.
- Set all the tracks to maintain current data ranges (deselect
autoscale) Once set, navigate to PIANP.
- Note the difference in enrichment of ChIPseq.
- Add as “Region of interest” and edit description to show “Inactive
gene”
- Autoscale again and zoom out to compare enrichment across
neighboring genes (Gapdh and Pianp in same view).
- Go to Regions -> Gene Lists.. -> “proneural dev genes”
- Inspect the signal across genes to determine their expression
state.
- Click on individual genes and compare to neighboring genes, click
back to return to gene list view
- Return to main view by selecting all from chromosome dropdown.
- Color all track by a unique color. Good idea to make K27me3 the most
distinct color to rest. Autoscale all tracks.
- Select all tracks and create an overlay,
- select tracks -> Create Overlay Track
- Change track height to 100
- Select tracks -> Change Track Height
- Revisit Gapdh, PIANP and gene list. Scan across genome to identify
silent and active gene expression.
RNAseq data
- Start a new session
- Clear data by going to IGV menu -> File -> New Session…
Loading the data
- Go to the IGV menu -> File -> load from server
- From the menu follow drop-down
- Tutorials -> RNAseq (Body Map)
- Go to IGV menu -> Views -> Preferences -> Alignments or
RNA
- Tick “show junction track”
- Set “visibility range threshold’ to 15kb
- Go to SLC25A3 gene.
- Collapse reads track (named Heart/Liver)
Inspect RNAseq data
- Select tracks -> Color Alignments by -> Read Strand to
identify strand of transcript.
- Expand “Features track” to identify alternative
exons/transcripts.
- Inspect coverage tracks to discover areas of coverage unique to
Heart or Liver sample.
- Inspect junction tracks to evaluate alternative splicing of
transcripts between tissue
- Select junction tracks -> Expand
- Compare major (first) transcript variant in Heart and Tissue
- Click on junctions to identify start and end of spans. Another way
to inspect splicing
- Select tracks -> Shashimi plot
- Select Alignment Tracks –> Heart + Liver
- In shashimi plot window set min junction coverage to 500 for both
tracks.
- Save image and compare junctions across tissue.