GMOD in the Cloud
Genome Informatics
November 3, 2011
Scott Cain
GMOD Project Coordinator
Ontario Institute for Cancer Research
scott@scottcain.net
Click to edit the title text format
Introduction: GMOD is …
•
A set of interoperable open
-
source
software
components for visualizing, annotating, and
managing biological data.
•
An active
community
of developers and
users asking diverse questions, and facing
common challenges, with their biological data.
Click to edit the title text format
Who uses GMOD?
Plus hundreds of others
Click to edit the title text format
GMOD in the Cloud
What GMOD in the cloud isn't:
Clouds
Guy getting
blown up
Garry's MOD (aka gmod.com)
Click to edit the title text format
Several GMOD Cloud Projects
Galaxy
-
Web
-
based platform for data intensive
biomedical research
CloVR
-
Automated and portable sequence
analysis
GBrowse2
-
Web
-
based, scalable genome
browser
cloud.gmod.org
-
Several integrated GMOD
tools
http://gmod.org/wiki/Cloud
Click to edit the title text format
Galaxy Cloudman
Get Galaxy without the data or usage
limitations.
Combine with Cloud BioLinux to have access
to MANY tools.
Create an analysis cluster in minutes.
Use autoscaling to get good performance at
low cost.
http://wiki.g2.bx.psu.edu/Admin/Cloud
Click to edit the title text format
Deploying Galaxy cluster on AWS
1.
2.
3.
4.
Click to edit the title text format
Exercising elasticity with autoscaling
Computation time: 9 hrs
Fixed cluster size
5
nodes
Computation cost: $20
20
nodes
Computation cost: $50
Computation time: 6 hrs
1 to 16
nodes
Computation time: 6 hrs
Dynamic cluster size
Computation cost: $20
Click to edit the title text format
CloVR
Cloud Virtual Resource.
Automated pipeline for sequence analysis.
Uses 2 GMOD tools: Workflow and Ergatis.
Use a virtual machine locally to interact with
resources in the cloud.
http://clovr.org/
Click to edit the title text format
CloVR Architecture
Click to edit the title text format
Why the virtual machine?
Running the pipeline happens on the local machine,
while the heavy lifting is done on the cloud/cluster
Click to edit the title text format
GBrowse2
Installed and configured recent release of
GBrowse2.
Tools to allow automatically adding rendering
servers.
Ability to add standard data sets.
http://gmod.org/wiki/GBrowse
Click to edit the title text format
GBrowse2
Yeast
Fly
Worm
Human
Amazon Snapshots
Render
Slaves
Master
GBrowse2 in the Cloud
Click to edit the title text format
Click to edit the title text format
cloud.gmod.org
Tripal
Drupal
-
based web frontend
Chado
Generic organism DB schema
GBrowse
Venerable genome browser
JBrowse
Fast, AJAX genome browser
Sample data
Saccharomyces cerevisiae
GMOD tools preinstalled:
Can be run as a micro machine (albeit slowly)
Click to edit the title text format
A little more on Tripal
Based on the popular CMS Drupal.
Several modules written to serve as an
interface for Chado:
Controlled Vocabularies
Features
Analyses
Libraries
Stocks
Integrated job management
Click to edit the title text format
Click to edit the title text format
Click to edit the title text format
Click to edit the title text format
Potential use case for Cloud GMOD
Community annotation:
Just add a web
-
start Apollo and set the security
group to allow it to connect to the database.
When WebApollo is ready, it's even easier: WA is
an addon to JBrowse but allows collaborative
editing.
Tripal and Drupal allow editing of most data types in
Chado, and commenting on pages similar to a
blog.
Click to edit the title text format
Why use the cloud?
Avoid installation related issues (saves you time
and frustration!)
Save money (how much, of course, depends)
Availability of common genomic data sets
(several projects already make these
available at AWS)
Click to edit the title text format
Future work
Get GBrowse2 AMI public (very soon)
Add Apollo to gmod.cloud.org (relatively soon)
Add WebApollo to gmod.cloud.org (as soon
as it's released)
Click to edit the title text format
Conclusion
http://gmod.org/wiki/Cloud for more information
on GMOD work in the cloud.
http://cloud.gmod.org/ for a running example of
cloud.gmod.org.
http://clovr.org/ for more info on CloVR and to
download the client VM.
http://getgalaxy.org/ for more information on
getting Cloudman.
Click to edit the title text format
Acknowlegements
•
Funding agencies: NIH, USDA ARS, NSF,
Ontario Ministry of Economic Development and
Innovation
•
Lincoln Stein, Chris Vandevelde
•
Enis Afgan and the Galaxy Team
•
Sam Angiuoli
et al
at UofM SOM
•
Stephen Ficklin and the Tripal group
•
Mitch Skinner and JBrowse developers
•
The rest of the GMOD community
Enter the password to open this PDF file:
File name:
-
File size:
-
Title:
-
Author:
-
Subject:
-
Keywords:
-
Creation Date:
-
Modification Date:
-
Creator:
-
PDF Producer:
-
PDF Version:
-
Page Count:
-
Preparing document for printing…
0%
Comments 0
Log in to post a comment