AIE an Big Data

sweetleafapartInternet και Εφαρμογές Web

7 Αυγ 2012 (πριν από 5 χρόνια και 1 μήνα)

667 εμφανίσεις

Partner Briefing

Completing the Big Data Picture

INSIGHT THAT MATTERS

Proprietary & Confidential

Agenda


Intro
to Big Data/Extreme Information and Attivio's Value
Proposition



AIE
Big Data Use Cases



AIE
Big Data Reference Architecture and
Features



Big
Data Competitive Landscape and
Alliances



GTM
Assets
Overview



Q
and
A



Next Steps

Proprietary & Confidential

The Big Data Movement


Rapid increases in
data volumes, velocity
, and
variety
and
complexity of processing
cannot be handled by
traditional RDBMS technologies, or are cost
-
prohibitive


Volumes

of data: 10s to 100s of TBs to petabytes (PB) of data


Velocity

can be as high as billions of records a day


Variety

includes multiple sources of unstructured, semi
-
structured and
structured data



Tremendous business advantage in unlocking the
information contained in Big Data through traditional,
discovery
-
oriented, and real
-
time analytics

Proprietary & Confidential

Drivers of Big Data


Exponential growth of
machine generated data


Prohibitive legacy
infrastructure costs:


Proprietary hardware


Non
-
linear scaling


Cluster administration


Inability of traditional
DW/BI/analytics for ad
hoc, exploratory analytics
on large, unprocessed
volumes of data

Proprietary & Confidential

“Unstructured” Data in the Big Data Market

Structured Data


Sourced from relational
databases

Semi
-
Structured
Data


Also known as
Unstructured Data
or

Non
-
Relational Data


Contains tags or other
markers to readily parse
data fields, etc.


Clickstream data, web
logs, etc.

Unstructured
Content


Any type of free
-
form
text information


Documents (500+
formats); scanned
documents; email; web
content; SharePoint;
knowledge bases; etc.

Proprietary & Confidential

Big Data: Just One Part of
Extreme Information


Big Data only addresses
the challenge of
volume
,
but not:


Variety


Velocity


Complexity


This broader enterprise
picture is referred to as
Extreme Information



Source:
'Big Data' Is Only the Beginning of Extreme Information Management,

April 7, 2011, Gartner Group


Proprietary & Confidential

Attivio’s Extreme Information
Value Proposition

Extreme Information

Attivio completes the Big
Data picture:


Add “why” insights from
unstructured content


Access

all

data types for
BI and decision making
with one query


Turn
Big Data into
real
-
time
Active
Information
that
initiates action


Eliminate latency of
information that causes
“blind spots”


Proprietary & Confidential

Unlocking the Value of Extreme Information

$300 Billion
: Potential annual value to US healthcare industry



250 Billion
: Potential annual value to Europe's Public Sector
administration


more than GDP of Greece


$600 Billion
: Potential annual consumer surplus from using personal
location data globally


60% Potential increase
in retailer’s operating margins possible with
big data


Source: Big Data: The Next Frontier Innovation, Competition, and Productivity

McKinsey Global Institute

May 2011

Proprietary & Confidential

Partner Opportunity for AIE


Prospect Engagement
:


Big data and analytics are top of mind in enterprise accounts
(business and IT)


Many customers have invested in DWH and BI and are looking
to extend capabilities with unstructured data


Solution Architecture Services:


Projects are complex and will drive Architecture and design
services for Attivio Partners


Implementation Services:


Deploying and integrating AIE will drive services for Attivio
Partners


Multiple application opportunities within accounts

AIE XT Use Cases

INSIGHT THAT MATTERS

Proprietary & Confidential

Problem


System outages costing millions per year in slower cash collections, lost productivity.


Service interruptions taking too long to resolve.

Very high:
Critical, diverse information sources
include application log data; documents
(SharePoint, Documentum, etc.); HP Service
Center data; People Central data

Very high:
Troubleshooting
content
scattered across
60+
internal
sources.
Also, must identify specific log data in
real time that indicate a system problem

Extreme Information in Action: Fidelity

High:
Heavy velocity and massive volume of
log data from over
90

internal applications

Proprietary & Confidential

Solution: AIE


AIE integrates relevant system log data with all internal support
content (knowledge bases, wikis, documents, etc.) in one portal


AIE identifies log data indicating a system failure in real time;
sends alert to IT team with all relevant troubleshooting content


Role
-
based dashboards
track activities and performance



Outcomes


Mean time to resolution (MTTR) cut from 90 minutes of
manual processes to mere seconds


Millions of dollars in reduced costs, restored productivity


Over 90 applications now supported and served by AIE solution

Problem


System outages costing millions per year in slower cash
collections, lost productivity


Service interruptions taking too long to resolve

Application Issue Resolution System

Proprietary & Confidential

Use Case #1 : 360
-
degree Point of Resolution for System Failures

Industry:
Cross
-

vertical


Situation
: Disruption of systems infrastructure components and / or applications which impacts
availability of the systems for the end
-
user


Critical Issues
: System and or application failure can


depending on the industry


lead to a loss of
revenue and penalties for violation of SLAs both internal to the
LoB

and external to partners
(ex. financial trading partners); high support costs; productivity loss


Key Challenges
: Access and real
-
time integration of multiple data sources both structured


system
event logs


and unstructured


knowledge bases, wiki site content, external website content


Key Capabilities Required
: Ability to integrate multiple diverse content and data sources and ability
to respond to inbound issues to provide complete context from those sources; scaling;
reporting and analysis of service metrics with all relevant data


pushed to users via role
-
based
active dashboards and alerts


Impact
: Reduced mean time to resolution (MTTR) and mean time between failures (MTBF), cutting
costs and increasing productivity







Proprietary & Confidential

Use Case #2 : Claims Validation and Processing

Industry:

Insurance ( Auto; Home; Healthcare)


Situation
: Vast amounts of historical and real
-
time data concerning insurance policies and claims
reside within insurance data centers and are not easily accessible


Critical Issues:

Inability to effectively mine and analyze data can lead to customer satisfaction
issues, inability to develop predictive models and resultant savings losses for insurers, as well
as inability to identify potential claims fraud in a timely manner


Key Challenges
: Simplified access to multiple sources (claims descriptions in claims files, emails,
underwriters written evaluations, responses to open
-
ended surveys on customer sat surveys
of data) and the ability to classify the data through text analytics and mining to derive
predictive models; correlation of unstructured and structured data pertaining to a claim and
associated policy; identification of discrepancies in data patterns based on historical and
current analysis to identify potential fraudulent claims


Key Capabilities Required
: Ability to integrate multiple diverse content and data sources; scaling;
robust text analytics and text mining; workflow processes that can be linked into existing
claims processing operations


Impact
: Increased customer satisfaction; improved predictive capabilities; reduced fraudulent payouts







Proprietary & Confidential

Use Case #3 : Financial Transaction Compliance

Industry:

Financial Services


Situation
: Increasing regulations are causing financial firms to not only increase the amount of
staff to monitor global regulatory agencies, and align with financial transactions, but also
tying up budget to mitigate risk associated with non
-
compliance resulting in monetary
penalties


Critical Issues:
Ability real
-
time monitor and align regulations with financial activities


Key Challenges
: Ability to streamline the collection/reporting of metrics and policy activities for
monitoring and audit purposes


Key Capabilities Required
: Capture and processing of diverse sources and information types in
real
-
time; alerting of changes to regulations; risk assessment reporting; automation of
compliance processes


Impact
: Resource reduction in monitoring staff, fines and amount of budget tied up in risk pool




Proprietary & Confidential

Use Case #4 : Customer Experience and Loyalty

Industry:

Cross Industry (
Telcom
, Financial Service, Retail…)


Situation
: Reduction in customer churn and the ability to establish a long
-
term customer


vendor
relationship is critical in revenue stabilization and revenue growth in terms of cross
-
sell. Massive
amounts of data are available to provide leading indicators and early alerts as to changes in
behavior


Critical Issues:

Especially in consumer electronics, information service industries, and on
-
line retail
there are increasing sources of data that provide contextual information


the “why” not just
the “what”


that should be linked to the buyer or the product associated with the buyer


Key Challenges
: Alignment of data sources and the integration of contextual information and
sentiment analysis with web log, CRM, and other sources of structured and semi
-
structured data


Key Capabilities Required:
Ability to access and align external data sources and internal call logs,
customer care files, and blogs with transactional information providing usage information;
ability to integrate campaign information and associate with changes in customer behavior;
ability to correlate in near real
-
time.


Impact
:
Top line revenue







Reference Architecture

And

Feature Overview

INSIGHT THAT MATTERS

Proprietary & Confidential

Attivio Integration with Existing MPP/ADBMS Infrastructures

MPP Infrastructure

Real
-
time Feed

Batch Loads

Big Data Analytics
Store
(ADBMS)

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Attivio Active Intelligence
Engine (AIE)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

Proprietary & Confidential

Attivio: Unified Information Access

MPP Infrastructure

Real
-
time Feed

Batch Loads

Attivio Active Intelligence
Engine (AIE)

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Universal Information Repository

Search API

ODBC/JDBC

Big Data Analytics
Store
(ADBMS)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

AIE

Workflow

Recommendation Module

Text Analytics/Data Mining • Classification Module • Alerts

Proprietary & Confidential

Attivio: Unified Information Access

MPP Infrastructure

Real
-
time Feed

Batch Loads

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Big Data Analytics
Store
(ADBMS)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

Attivio Active Intelligence
Engine (AIE)

Universal Repository

Search API

ODBC/JDBC

AIE

Workflow

Recommendation Module

Text Analytics/Data Mining • Classification Module • Alerts

End
-
user interfaces
included with AIE

Shared
-
nothing parallel architecture
optimized for structured and
unstructured information that
supports keyword search and SQL

Proprietary & Confidential

Attivio: Text Analytics

MPP Infrastructure

Real
-
time Feed

Batch Loads

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Big Data Analytics
Store
(ADBMS)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

Attivio Active Intelligence
Engine (AIE)

Universal Information Repository

Search API

ODBC/JDBC

AIE

Workflow

Recommendation Module

Text Analytics/Data Mining • Classification Module • Alerts

Classification Engine m
odule
automates
categorization
and
tagging of all content

Recommendation
Engine m
odule
provides
recommendations based
on observed user
behavior, etc.

Text Analytics enrich unstructured content
with “why” dimensions for analytics insight:


Entity Extraction


Sentiment Analysis … and more

Proprietary & Confidential

Attivio: Workflow and Alerts

MPP Infrastructure

Real
-
time Feed

Batch Loads

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Big Data Analytics
Store
(ADBMS)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

Attivio Active Intelligence
Engine (AIE)

Universal Information Repository

Search API

ODBC/JDBC

AIE

Workflow

Recommendation Module

Text Analytics/Data Mining • Classification Module • Alerts

Automated processing of
information
(content, data or
queries) in
desired sequences

Triggers
within workflows can
send alerts directly to
users

Proprietary & Confidential

Attivio: XT Connector Modules

MPP Infrastructure

Real
-
time Feed

Batch Loads

BI Tools


(Cognos, BO, Tableau)

Ad
Hoc

Analysis Tools

UIA Applications

Packaged Applications

SEARCH

ACTIVE DASHBOARD

Big Data Analytics
Store
(ADBMS)

High
-
speed, Dynamic

Columnar
Data Store

Data Compression

Attivio Active Intelligence
Engine (AIE)

Universal Information Repository

Search API

ODBC/JDBC

AIE

Workflow

Recommendation Module

Text Analytics/Data Mining • Classification Module • Alerts

Load data from
:


Hadoop


Cloudera


ADBMS stores via JDBC (Vertica
,
Aster Data, Greenplum,
etc
.)


Push
MapReduce output to AIE via
client APIs


E
xisting
data marts, data cubes or
warehouses

Proprietary & Confidential

AIE XT Modules Pricing

Connector

Price *

Hive,

HDFS,
HBase

Cloudera

MapReduce

SDK

Module

Price *

ODBC/JDBC

Recommendation
Engine

Classification

Module

Connectors

Modules

Service

Price *

XT Services Pack

Professional Services

**

XT Services
Pack
-

up to 250 hours professional services:



Extreme
Information analysis assessment & report




Configuration
of up to 3 DB connections (max 10M
records



Configuration
of up to 2 file system stores (max 10M
documents



Configuration
of up to 3 key XT Module connectors




Includes
security/ACL configuration




Post
connectivity test & quality check

* Refer directly to Partner
Price Book for all pricing.

Proprietary & Confidential

XT Module Availability

Connector

Delivery

timeframe

Hive,

HDFS,
HBase

Fall 2011

Cloudera

Fall 2011

MapReduce

SDK

Fall

2011

Module

Delivery

timeframe

ODBC/JDBC

AIE 3.0 (Sept 2011)

Recommendation
Engine

Immediate (services
delivery)

Classification

Module

GA

Connectors

Modules

Big Data

Competitive Landscape

INSIGHT THAT MATTERS

Proprietary & Confidential

Current Ecosystem

27

Data
Source

ETL

MDM

MPP/
NoS
QL

DW

ADBMS

BI/Repo
rting

Analytics

Data
Mining

IBM

Websphere
;
Filenet

; DB2

InfoSphere

DataStage

InfoSphere

Infosphere

BiigInsights

(
Hadoop
)

DB2

Netezza

Cognos

Cognos

SPSS

SAP

myBusiness

Suite; Sybase
AS

Netweaver

Netweaver

/
BI

Sybase AS

Sybase IQ;
BI
Accelerator;

HANA

Netweaver

/ BI;BO

Netweaver

BI;
BO;
myCRM

Oracle

11g; Oracle
Applications

Oracle
Warehouse

Builder

Oracle MDM
Suite

Qest

/
Cloudera


11g

Exadata

Oracle BI

Oracle

Analytics;
Siebel Analytics;
Hyperion

Oracle Data

Mining

MS

SQL
-
Server;

ERP;
SharePoint;
Exchange

SSIS SQL
-
Server

SQL
-
Server
MDM

LINQ

to
HPC/Project
Daytona

SQL
-
Server

PDW

SQL

Server
Analysis
Services

MS BI

MS BI

MS BI

SAS

DataFlux

DataFlux

Industry BI

Industry and
Horizontal
Analytics

Enterprise

Miner

Informatica

PowerCenter

Informatica

MDM

ILM
-

DB
Archive

TeraData

Cloudera


TeraData

Aster Data

Teradata

BI

Teradata

Anayltics

Talend

Open Source
ETL

Talend

MDM

Big Data Player

Proprietary & Confidential

Current Ecosystem

28

Data
Source

ETL

MDM

MPP/
NoS
QL

DW

ADBMS

BI/Report
ing

Analytics

Data
Mining

Pentaho

Data

Integration

BI
-
Suite

BI
-

Suite
-

Analysis

BI
-
Suite
-

Data Mining

JasperSoft

DI

BI

BI

clikView

BI

BI

EMC

Documentum

EMC®
Greenplum
®
HD
Community
and
Enterprise
Edition

GreenPlum


Info Builders

iWay
:
WebFocus

iWay

Reporting

Pred.
Anayltics

MicroStrategy

BI

Analytics

Cloudera

CDH2/CDH3

Proprietary & Confidential

Open Source Players


NoSQL


Hadoop

(
Hbase
, HDFS, Hive)


HPCC (LexisNexis)


Cassandra


CouchDB


Memcached


Membase


MongoDB


Redis


Riak


ADBMS


InfoBright


Proprietary & Confidential

Competitive Comparison

Feature

AIE

Hadoop

Netezza

Greenplum

AsterData

Vertica

Exadata

Endeca

Exalead

SQL

MapReduce

Keyword

search

Hybrid
Structured/
Unstructure
d

Queries

Shared
-
nothing

architecture

Text
Analytics

Natural
Language

Processing

Open BI Tool
Support

Real
-
time

Ingestion

In
-
Engine

Analytics

In
-
Engine

ETL

Pricing
Model

Per
Applic
ation

Open
Source

Capacity
Based

Open
Source/Cap
acity Based

Capacity
Based

Capacity
Based

Capacity
Based

Capacity
Based

Capacity
Based





















































































Proprietary & Confidential

Endeca SWOT

Strengths


Dominates
eCommerce
solutions


Large eCommerce customer base


Newer
Latitude

product targeted to BI market


Most direct competitor to AIE


Offers some combination of inverted index (like AIE)
with columnar
DB store
(“
hybrid search
-
analytical
database
”)


Like AIE, no data model required, as in a typical DW


Presents itself as an Agile Enterprise BI solution


Offers very good drag
-
and
-
drop UI builder




Proprietary & Confidential

Endeca SWOT

Weaknesses


Does not support SQL or ODBC/JDBC


a major
deficiency for any product in BI space


Integration with 3
rd

party tools requires
xQuery


Endeca Latitude
competes

with BI tool vendors;
Attivio AIE
encourages

using BI tools as front end to
AIE, serving as the information source


In a 2/25/09 live seminar,
QlikTech

noted they
replaced Endeca at Harvard University.


Endeca needed six servers for what they are doing on one
QlikView server.
QlikTech

speaker said that “The
cost for
QlikView is less than the cost to renew
Endeca.”





GTM Enablement Tools

INSIGHT THAT MATTERS

Proprietary & Confidential

AIE for Big Data: Marketing Assets Review


Prospect Deck


White Paper


Data Sheet


Battle Card


Solution Prompters


FAQs

Q&A

INSIGHT THAT MATTERS

Proprietary & Confidential

Next Steps


Attivio to provide Big Data GTM Collateral


One
-
on
-
one GTM planning sessions with the Attivio
team