Design For Failure Is The Path To Success In Cloud

lovingbangInternet και Εφαρμογές Web

3 Νοε 2013 (πριν από 4 χρόνια και 5 μέρες)

120 εμφανίσεις

Design For Failure Is The
Path To Success In Cloud

Ashay Chaudhary

REQUIREMENTS

Journey thru the computing models

Evolution of Requirements


Mainframe


Desktop


Client
-
Server





Internet



Cloud Computing


Reliability


Availability


Serviceability


Performance

+


Security

+


Agility

AVAILABILITY

Non
-
Cloud Model

Guiding Principles


Design for Non
-
Failure


Deploy with Redundancy


Manage Effectively


Design for
Non
-
Failure


Quality
Hardware


Deploy with
Redundancy


Specialty
Hardware


Manage
Effectively


Expert Staff


Processes

AVAILABILITY

Cloud
Model

Guiding Principles


Design for Failure


Design for Redundancy


Monitor Extensively


Track Dependencies

Design For
Failure


Assume nothing


Expect failures


Anywhere and everywhere


If it is available now, doesn’t mean it is there later



Failures cascade


Unhandled failures propagate


Poorly handled failures adds complexity


Difficulty increases exponentially with complexity




Embrace failure, make it a first class citizen

Handle All Failures


Unhandled failures is a very bad idea


Poorly handled trivial failure in one part
becomes a critical one somewhere else


Two types of failures: Transient and
Resource


Transient failures are difficult, treat them like
Resource failures and fail fast


Delays are transient failures, define response
time guarantees


Failure injection is a lifestyle


Design For Redundancy


Eliminate single points of failure


Architect distributed applications


Minimize duration of
statefulness


Monitor Extensively


Self assess and report health


Complementary external monitoring


Load and latency monitoring


Proactively restart components


Track Dependencies


Identify
all

dependencies


Hardware, 3
rd

Party Libraries, Other servers, Network


Infrastructure/Platform
services, External services


Your own components


Track their health and availability



Key Takeaways


If there’s only
one

thing you could do


Design for Failure



It is a paradigm shift


It is a cultural change


It is
not easy




It is the key to success in the cloud


Ashay Chaudhary

Cloud Consultant

Corporate Education

Private Cloud Solutions

Highly
S
calable
SaaS

Applications

SaaS

Business Intelligence & Analytics



ashay@kloudpros.com

@
ashay_c