Theia Big Data Presentation - FIT Solutions

sillysepiaElectronics - Devices

Nov 27, 2013 (3 years and 7 months ago)

70 views

Presented: January 2013

BIG Data

Together we build the right solution

Agenda

Introductions

What is Big Data

Tools for Big Data

Why it is important

Next steps

Introductions

Dennis J Perlot: Founder & CTO, Theia Solutions


Over 25 years experience providing award winning, innovative IT solutions


Smithsonian Innovators Award



Global Innovation Award


Artificial Intelligence/Machine Learning


Technology Community Advocate


Speaker/ Technology Evangelist

Megan Cocuzzo: Director, Business Intelligence


Over 15 years experience leveraging “BIG DATA” to deliver innovative financial and resource optimization
strategies and tools


Financial Planning & Analysis


Capacity & Resource Planning


Opportunity and Risk Assessment


Capital Funding


Six Sigma Black Belt Professional


ISO 9000 Quality System Auditor



Theia Solutions LLC

“Together we build the right solution”


Socially responsible technology services


Application Development


Data Optimization


Cloud Hosting


Data Analytics


Why Theia?


We put people first


Partnerships

not just contracts


Innovative solutions


Solutions

People

Data
Analytics

Data
Agility

Technology

Cloud Hosting

Application
Development

What is “Big Data”?



Data sets that can not be processed with traditional tools such as
relational databases, requiring “massively parallel” approaches.


What is considered "big data" varies depending on the organization and
the applications that are used to process and analyze the data set in its
domain.


Traditional tools can not handle the 3 V’s:

Volume

Variety

Velocity

A visualization created by IBM of Wikipedia edits. At
multiple
terabytes

in size, the text and images of
Wikipedia are a classic example of big data.

The data explosion!!!



Volume


1000
Megabytes

= 1 Gigabyte


1000
Gigabytes

= 1 Terabyte


1000
Terabytes

= 1 Petabyte
[where most corporations are]


1000
Petabytes

= 1 Exabyte


1000
Exabytes

= 1 Zettabyte
[where Facebook and Google are]


1000
Zettabytes

= 1 Yottabyte


1000
Yottabytes

= 1 Brontobyte

Volume

Just how BIG…


Web logs and blogs


eCommerce


Mobile
-

4.5 billion phones


Sensors


temp, vibration, etc.


Smartphones


400 million worldwide


Over 50% of US cell users


Variety

Where does it come from?


eCommerce


56 million plus transactions in Q3 2012


RFID


location reporting


Large Hadron Collider: 700MB to 1 TB
per second


Cell phone location tracking


Must consider data in motion vs. data at rest

Velocity

How fast is it generated?

Big Data Trends and Tools

The Cloud

Hadoop

Data Scientist


Consider the electricity model


Do you build a power plant?


Do you run wires to your home?


Do you buy transformers, etc.


Let someone else worry about all that and just pay
for what you use.


This is cloud computing


Pay for what you use


Rapid elasticity


Location transparent resources



The Cloud


Infrastructure as a Service (
IaaS
)

“… servers, servers, get your servers here”



Platform as a Service (
PaaS
)

“… just give me a place for my
application and data”



Software as a Service (
SaaS
)

“… like Salesforce.com



The Cloud

Cloud Offerings

On
-
Premises

Storage

Servers

Networking

O/S

Middleware

Virtualization

Data

Applications

Runtime

You manage

Infrastructure

(as a Service)

Storage

Servers

Networking

O/S

Middleware

Virtualization

Data

Applications

Runtime

Other Manages

You manage

Platform

(as a Service)

Other Manages

You manage

Storage

Servers

Networking

O/S

Middleware

Virtualization

Applications

Runtime

Data

Software

(as a Service)

Other Manages

Storage

Servers

Networking

O/S

Middleware

Virtualization

Applications

Runtime

Data

IaaS

PaaS

SaaS

The Cloud

Separation of Responsibilities


Microsoft Azure Platform


SAS 70 Type 1 and Type 2 (now
SSAE 16)


ISO 27001


Safe Harbor


HIPPA


SOX


PCI DSS


Over 250 internal controls


More guards than engineers at most facilities

The Cloud

Is it Secure?

The Cloud

Who is using the cloud today?

Who is NOT using the cloud today……..

For your information….



1 billion:

Windows Live ID authentications each day



3 to 4 billion:

junk emails filtered daily



2 billion:

queries each month on Bing



100 million plus:

Windows Update users



6 Regional Data Centers
: 2 each in US, Europe, Asia



400,000 plus:

square footage in each datacenter


Azure Data Centers


Breaks problem down into smaller “chunks”


Why is it called Hadoop?


Doug Cutting was trying to think of a name for his
“map reduce” system


His son said “Why don’t you name it after my toy
elephant?


Hadoop

What the scoop?

Hadoop

Traditional Data Center

Hadoop Cluster

Comparison


Amazon/A9


Facebook


Fox interactive media


Google


IBM’s Watson


New York Times


J.P. Morgan


Rackspace


eBay


Yahoo!


More at http://wiki.apache.org/hadoop/PoweredBy


Hadoop

Who is using?


Monitor Hadoop in marketplace


Revise thinking on problems

“Why not record every mouse click?”

“If we capture it, we can process it”


Think about “recommender” apps


More is better!


Hadoop

Next Steps & Recommendations


Computer skills


Understands Relational Databases


Write SQL queries


Linking internal and external data


Statistics skills


Design “experiments”


Create analytical models


Top Job on LinkedIn

Data Scientist

Who Are They?

Why BIG Data Matters and

Human Patterns

Real
-
Time Data Integration

Competitive Intelligence

the importance of data agility

The next frontier for innovation, competition, and productivity

How is BIG data creating value?

Unlocking significant value by making information
transparent and usable at much higher frequency and
velocity

Expose variability and boost performance by using
data collection and analysis to conduct controlled
experiments to make better management decisions

Allows ever
-
narrower segmentation of customers and
therefore much more precisely tailored products or
services

How is BIG data creating value?

Allows ever
-
narrower segmentation of
customers and therefore much more precisely
tailored products or services

Basic low
-
frequency forecasting to high
-
frequency now casting to adjust their
business levers just in time

Used to improve the development of the next
generation of products and services


Our process begins with an end to end assessment
and documentation of your current capabilities and
data structure


No two organizations are alike


No two data sets are alike


We partner with you to develop a data strategy to
exceed your goals in the form of a strategic roadmap


The key drivers to operational health vary as do the
regulatory and compliance needs of each organization in each
market/sector


Theia Solutions LLC

Data Analytics Offerings

Theia Solutions LLC

Data Analytics Offerings


Phase




Description


Goal


Duration

Project Initiation, Getting
Started



Initial assessment of
analytic capabilities, meet
with biz and IT, identify
pain points

Produce tech roadmap and
project list with ROI
identified for each

2 days to 2 weeks

Establish Base
Capabilities



Projects from “Getting
Started” roadmap

Establish infrastructure
and data flows, data
warehouse, data marts,
key performance
indicators

4 weeks to 6 months

Extended Capabilities



Review efforts to date and
identify improvements

Examine advanced
capabilities like predictive
analytics

6 weeks to 1 year

Theia Solutions LLC


So, no matter what your need, Theia Solutions can
help you get there


Experienced, agile, specialized teams


Innovative Ideas, Old School Values


Long Terms Partnership with Clients

Questions

www.TheiaSolutionsLLC.com


Dennis.Perlot@theiasolutionsllc.com


Megan.Cocuzzo@theiasolutionsllc.com