new in IBM InfoSphere Information Server - FTP Directory Listing

radiographerfictionData Management

Oct 31, 2013 (3 years and 9 months ago)

229 views

What’s new in IBM InfoSphere
®

Information Server v9.1
“Anywhere integration” for a new era of computing
2 What’s new in IBM InfoSphere Information Server v9.1
Contents
2 InfoSphere Information Server v9.1: Robust “anywhere
integration” capabilities for a new era of computing
4 InfoSphere Information Server for Data Integration:
Agile integration capabilities

Agile governance
– InfoSphere Data Click
– Enriched control of the operational environment

Dynamic integration
– Workload management
– Anywhere integration supports big data via Hadoop
– Anywhere integration supports balanced optimization
for DB2 for z/OS
– Private cloud support via IBM PureApplication System
– Business rules and policy management
10 InfoSphere Information Server for Data Quality: Sustainable
quality capabilities

New data validation capabilities
– Flexible data validation rule output table configuration
– Registration and reuse of data validation rule output tables
– Sequencing of data validation rules
– Distinct output records for data validation rule execution
– Impact analysis for data validation rules

InfoSphere Data Quality Console

InfoSphere QualityStage Standardization Rules Designer
15 InfoSphere Business Information Exchange: Business-
driven governance capabilities

Business-driven information integration
– Information governance policies and rules
– Advanced term relationships
– Data lineage enhancements
– BPMN2 support
– OSLC support

Simple access for enterprise collaboration
– Single sign-on support
– Blueprint viewer
23 Appendix: Feature-to-product mapping table
InfoSphere
®
Information Server v9.1:
Robust “anywhere integration” capabilities
for a new era of computing
Today, as a new era of computing unfolds with an explosion
in the volume, variety and velocity of data, integrating trusted
information is more important than ever so firms can be sure
that critical projects and key analytics initiatives will be successful.
In this new era of computing, it is now even more critical for
organizations of all sizes to have robust information integration
capabilities that support their business requirements. They need
to successfully and flexibly integrate data anywhere it resides,
with sources ranging from mainframe to big data, while applying
governance and data-quality best practices. InfoSphere Information
Server v9.1 delivers “anywhere” information integration capabilities
that an organization can use to succeed as the volume of data and
sources grows with accelerating speed and complexity. Because
IBM developed InfoSphere Information Server v9.1 with a focus
on agile integration, business-driven governance and sustainable
data quality, organizations gain the agile yet sophisticated
information integration capabilities necessary to thrive in
today’s exceedingly information-rich environment.
With the end-to-end information integration capabilities of
IBM InfoSphere Information Server, companies are able to
better understand, cleanse, monitor, transform and deliver their
data, as well as collaborate to bridge the gap between business
and IT. The “anywhere integration” capabilities of InfoSphere
Information Server v9.1 help firms ensure that the information
that drives their business and strategic initiatives – from big data
and point-of-impact analytics to master data management and
data warehousing – is trusted, consistent and governed.
InfoSphere Information Server, a highly scalable and flexible
integration platform, helps firms ensure that the information
that drives their business is trusted, consistent and governed
in real time. Since its inception, InfoSphere Information Server
has been a massively parallel processing (MPP) platform able
to support everything from small to very large data volumes
to meet your requirements, regardless of complexity.
IBM Software 3
The capabilities of InfoSphere Information Server v9.1 are
available in four essential packages that help firms target key
information challenges:

InfoSphere Information Server for Data Integration
Transform data in any style and deliver it to any system,
supporting faster time-to-value and reduced risk for IT.

InfoSphere Information Server for Data Quality
Establish and manage high-quality data, turning a deluge
of data into trusted information.

InfoSphere Business Information Exchange
Understand data and foster collaboration between IT
and line-of-business teams to narrow the communication
gap and create business-driven information integration.

InfoSphere Information Server Enterprise Edition
Gain the capabilities of all three individual packages in
one comprehensive package so firms can start information
integration efforts in one area, and then be ready to expand
when needed to further optimize results.
Figure 1:
InfoSphere Information Server Packages: InfoSphere Information Server for Data Integration, InfoSphere Information Server for Data Quality, InfoSphere Business
Information Exchange, and InfoSphere Information Server Enterprise Edition
*Separately orderable
InfoSphere Information Server Enterprise Edition
InfoSphere Business
Information Exchange
• Helps foster collaboration between business
and IT teams
• Enables creation of common business
language, rules and policies
• Automates discovery of relationships across
data sources
InfoSphere Information
Server for Data Quality
• Helps in understanding the structure of the
data and its relationships within and
across systems
• Provides data quality validation rules and
metrics to monitor and maintain quality
• Delivers comprehensive and customizable data
cleansing capabilities: automates source data
investigation, information standardization and
record matching based on business rules
InfoSphere Information
Server for Data Integration
• Transforms data in any style and delivers
it to any system
• Includes a web-based operations console
for easy runtime management
• Provides hundreds of built-in transformation
functions
• Supports multiple batch and real-time
data delivery options
• Provides balanced optimization* capabilities,
such as ETL, ELT or mixed TETL
• Massive scalability
• Broad, native connectivity to heterogeneous
data sources
*Separately orderable
4 What’s new in IBM InfoSphere Information Server v9.1
This white paper describes what’s new in InfoSphere Information
Server v9.1.
InfoSphere Information Server for Data
Integration: Agile integration capabilities
Wherever its information resides, a business can integrate it
quickly and flexibly with InfoSphere Information Server for
Data Integration v9.1. Easily manage information provisioning
for a data warehouse, integrate big data, consolidate applications,
deploy to a private cloud or maximize resources by leveraging
balanced optimization capabilities for IBM DB2
®
on System z.
Agile Integration and Provisioning
InfoSphere Data Click
InfoSphere Data Click is a new feature that simplifies information
provisioning – something that is often a recurring drain on
the enterprise’s IT staff. Because of the depth of skill required,
information provisioning used to be something left to technically
skilled ETL engineers. With InfoSphere Data Click capabilities,
novice or line-of-business users are able to easily retrieve data
and populate new systems, freeing up expert technical resources
to focus on activities with higher returns for the business.
To streamline and simplify these processes, the intuitive
user interface presents simple policy choices that users select,
entirely avoiding the need to write any code. In fact, the
InfoSphere Data Click experience can provide a privileged
user the ability to move thousands of tables, or load a data mart,
with two simple clicks. Additionally, InfoSphere Data Click then
automatically registers the design and operational metadata to
support built-in governance. InfoSphere Data Click helps speed
time-to-value, increases business agility and lowers costs by
shrinking the time required to complete tasks — from days
or weeks to minutes or hours.
In this exciting first release, InfoSphere Data Click helps
improve the timeliness of IBM PureData System for Analytics
(formerly IBM Netezza) warehouse environments by
intelligently selecting the right data integration engine for
moving information — batch workloads leveraging the scalable
InfoSphere DataStage runtime and real-time workloads based
on InfoSphere Data Replication log-based replication. These
capabilities provide for processing optimization by reducing the
time and processing required from source to target, reducing
network traffic and making data available when a full extract
is not possible. These levels of automation and the extreme
simplicity of InfoSphere Data Click set a new standard within
the data integration market.
IBM Software 5
 
Figure 2:
InfoSphere Data Click — easily select the target destination for data with a single click
6 What’s new in IBM InfoSphere Information Server v9.1
 
Figure 3:
InfoSphere Data Click – review settings for the offload request.
IBM Software 7
Enriched control of the operational environment
InfoSphere Information Server v9.1 includes new capabilities
that enrich the level of information and control available for
the data integration and data quality runtime environment.
With these features, users can quickly understand and respond
to changes and conditions in their information processing
environment. Enhancements in this release include:

Web-based full log accessibility
In addition to the succinct set of most relevant messages
provided in the previous release, the operations console now
also includes the display for the full log so that all messages are
accessible. This is particularly useful for helping subject matter
experts, who may work offsite or have only browser access,
investigate problems when they arise.

Web-based job runtime management
Administration and management of the operational
environment are simplified by extending the operations console
web interface to provide job controls so entitled users can run,
stop, reset or schedule integration jobs in their projects.

Operational/runtime reports
Cognos BI customers have the opportunity to gain additional
insight from the data integration operational environment
through a variety of new pre-built sample reports. These reports
explore job statistics across the environment — by dimensions
such as elapsed runtime and CPU usage — as well as data-store
analytics that illustrate what data has moved around the enterprise,
including quantity and frequencies. With these reports, firms
can easily answer questions such as “What jobs are consuming
resources above a certain level?” and “How much data in my
customer table is coming from this particular source?”
Dynamic integration
Workload management
In this release, InfoSphere Information Server supports
policy-driven control of system resources and prioritization
of different classes of workloads. With this, the InfoSphere
DataStage administrator can proactively manage the distribution
of resources and payloads, which is particularly useful where
multiple teams share a common hardware infrastructure.
Customers can use new workload management capabilities to
optimize hardware utilization and prioritize mission-critical
tasks, throttle job activities where resources exceed specified
thresholds, and assess, assign and reassign the priority of jobs
as new jobs are submitted into the queue. These gains are
realized through the following features:

System-level job policies
Administrators can define the limits that govern when new
jobs may be executed. If the system has already reached the
maximum number of jobs, other jobs submitted will be queued
until activity reaches an acceptable level. Additionally, InfoSphere
Information Server will monitor the rate at which new jobs are
run to guard against multiple project teams in a shared services
environment flooding the system with more activity requests
then the operating system can service. InfoSphere Information
Server will pace that activity so that only a specified number
of jobs are executed in a defined interval of time.

System-level resource policies
The data integration engine tracks resource consumption on
symmetric multiprocessing (SMP) environments. Administrators
can configure it to hold jobs in queue until the thresholds,
CPU or memory are within acceptable levels. Setting such
a policy will limit the amount of context switching and memory
swapping to reduce system overhead and execute tasks more
efficiently. (In a clustered environment, IBM already offers
similar capabilities via integration with resource manager
technologies. Contact your IBM Account Specialist for
more details.)
8 What’s new in IBM InfoSphere Information Server v9.1

Queue-based policies
The system administrator can define any number of queues to
manage each project team individually or delineate jobs based
on workload priority (high, medium and low). The queue-based
policies then govern the mix of jobs from each queue that will be
executed once the system has availability. Organizations can use
these controls to ensure that project teams with strict service-
level agreements can achieve their objectives on time, by slowing
down other workloads without similar imposed requirements.

User control and information
As required, privileged users may promote jobs ahead in the
queue or move them to different queues. All other users may
investigate the list of jobs in each queue and visualize system
resource usage and thresholds.
Anywhere integration supports big data
Organizations exploring big data analytics, such as Apache
Hadoop for data at rest or streaming technology for data in
motion, face many of the same challenges as they do with other
analytical environments. These challenges include determining
the location of the information sources needed for analysis, how
that information can be moved into the analytical environment,
how it must be reformatted so that it becomes easier and more
efficient to explore, and what data should be persisted to quickly
get to the next level of analysis.
InfoSphere Information Server v9.1 includes new capabilities
that organizations need to integrate the extreme volume, variety
and velocity of big data – from new and emerging big data
sources. Building on previously released InfoSphere BigInsight-,
Cloudera-, Apache-, and Hortonworks-certified Hadoop Data
File System (HDFS) support, this release includes:
 
Figure 4:
List of queued jobs for workload management
IBM Software 9

Balanced optimization for Hadoop
When a data integration job includes a big data source,
InfoSphere Information Server now can push the processing to
the data. Using the same common set of InfoSphere DataStage
stages and links to build the data integration logic, developers
may now choose to run the entire logic, or only portions of
that logic, as a MapReduce job that will execute directly on
the Hadoop platform. When the sources and targets of the
integration task are Hadoop data stores, this approach will
yield significant performance gains, as well as savings in
network resource consumption.

IBM InfoSphere Streams integration
For big data projects that focus on real-time analytical
processing, IBM now offers direct data flow integration
between InfoSphere Information Server and InfoSphere
Streams to combine the power and reach of both platforms.
With this feature, organizations can use standard data
integration conventions to gather information from across
the enterprise and pass that information to the real-time
analytical processes. Similarly, when InfoSphere Streams
finds records of insight, that data can now be passed directly
to a running data-integration job and made available to data
stores or applications across the information landscape, using
the full depth and breadth of InfoSphere Information
Server connectivity.

Big data job sequencing
InfoSphere Information Server now allows any InfoSphere
BigInsights or Cloudera-certified Oozie-contained MapReduce
job to be included in the job sequencer. This feature provides
end-to-end workflow across heterogeneous topologies executed
in both InfoSphere Information Server and Hadoop.

Big-data governance
InfoSphere Information Server also supports big data-related
governance features, such as impact analysis and data lineage,
on any big data integration points, thus providing enterprises
the ability to deliver on the promises of massively scalable
analytics, without sacrificing organizational insight into the
information infrastructure.
Anywhere integration supports balanced optimization
for DB2 for z/OS
Organizations leveraging DB2 for z/OS as a robust data
warehousing platform can now also exploit its power as part
of the data integration process. InfoSphere Information Server
v9.1 has expanded Balanced Optimization features to execute
data transformation tasks on the DB2 for z/OS database itself.
The developer expresses the logic of the data integration
processes in the current flow-oriented convention of InfoSphere
Information Server, which then optimizes the logic to benefit
from pushing desired parts of the processing to the database.
Users can choose to execute their jobs in the optimal location
for any given point in time: in the source database, in the
high-performance engine of InfoSphere Information Server,
in the target database or in any combination of these locations.
In many cases, pushing the processing in this way close to
the data, will save tremendous amounts of time and system
resources, and therefore create savings while delivering the
same business value.
Private cloud support via IBM PureApplication

System
InfoSphere Information Server Enterprise Hypervisor Edition
v9.1 delivers end-to-end information integration capabilities as
a system pattern that supports IBM PureApplication System, as
well as supporting shared services or competency center contexts.
By introducing InfoSphere Information Server Enterprise
Hypervisor Edition, IBM becomes the only information
integration vendor that includes comprehensive information
integration capabilities as part of an expert integrated system.
10 What’s new in IBM InfoSphere Information Server v9.1
Clients may leverage InfoSphere Information Server Enterprise
Hypervisor Edition:

As part of their IBM PureApplication System deployment,
supplying optimized information integration features

As part of a private cloud or shared services environment to
achieve greater speed and flexibility when provisioning and
expanding InfoSphere Information Server use within a
horizontal shared infrastructure

To quickly establish a private cloud development and test
environment devoted to information integration

As a way to accelerate fixed-time projects

To benefit from IBM’s best-practice reference architecture
Business rules management
With the rapidly increasing volume, variety and velocity of data,
organizations are facing the need to adapt quickly to changing
business requirements. Business users must make decisions
quickly to effect organizational success, so business-driven
organizations seek to provide these individuals with the right
tools to effect that change. Organizations must move from code
development, and the associated longer software development
life cycles, to the agility of business rules management.
InfoSphere Information Server v9.1 can now integrate directly
with IBM Operational Decision Management (formerly ILOG
JRules), providing the ability for non-technical subject-matter
experts to govern key logic constructs in their organization.
In this way, the line of business defines policies and automated
decisions in a form that is understandable to the non-technical
user. These same rules can quickly be promoted to the operational
environment and consumed by InfoSphere Information Server
to leverage within the data integration process — extending the
value of a business rule engine into an agile transformational
component of a data integration platform. Organizations make
a giant leap forward in bridging the gap between business people
and IT by implementing decision logic using IBM Operational
Decision Management within InfoSphere Information Server to:

Enable line-of-business subject-matter experts to create and
maintain decision logic, accelerating the implementation
of changes to meet evolving market, regulatory or
competitive conditions

Improve collaboration between business and IT teams, sharing
visibility and governance controls for the management of
decision logic

Separate decisions from processes and applications, facilitating
change and reuse across business systems
The following IBM Operational Decision Management features
are supported in InfoSphere Information Server v9.1:

Allow data integration specialists to explore rules available on
the Rule Team Server during the design process, and easily
select those applicable to the given transformation process.

Integrate the record data moving through the data integration
process with the “in,” “out” and “inout” parameters of the
selected rule sets.

Support three execution modes of the rule engine (core engine,
managed J2SE RES XU and managed J2EE RES XU).

Support rules generated in Java XOM and dynamic XOM object
models, as well as Java-bean compliant and non-compliant
rule-set parameter Java classes.

The metadata importer auto converts rule-set parameters and
its methods into matching database columns, automatically
configuring stages for the specified rule set.
InfoSphere Information Server for Data
Quality: Sustainable quality capabilities
To help create and maintain quality data, InfoSphere Information
Server for Data Quality provides capabilities to analyze, cleanse,
monitor and manage data in both batch and real time. Two aspects
set the InfoSphere Information Server approach to data quality
apart. First, InfoSphere Information Server for Data Quality
delivers unique business and IT collaboration capabilities to align
quantitative metrics with business and governance objectives.
Second, the product provides rich and comprehensive
capabilities that users can use to quickly and easily design,
manage and monitor data quality in alignment with the
business policies the organization defines.
IBM Software 11
New data validation capabilities
With the v9.1 release of InfoSphere Information Server for Data
Quality, data validation rules have been enhanced so users can
more easily access and manage both the data validation rules
and any records that violate the rules. These enhancements
provide for multiple rules output violations to a single table,
which saves users significant time in finding and working with
exception records. Additionally, users can define the sequence
in which rules should run, giving them greater flexibility and
insight when filtering data violations and performing drill-down
analysis. The specific enhancements include:
Flexible data validation rule output table configuration
With InfoSphere Information Server for Data Quality v9.1,
users can define custom names for data validation rule output
tables that typically store the exception records. They also
can append or overwrite previous output records after the
execution of a rule.
Another option is to use the same named output table for
multiple data validation rules that then append or overwrite
previous rule-execution results. As a result, the output table
stores output records from different rules in a single place
with a common table definition.
 
Figure 5:
Configuring simple user-named output tables
12 What’s new in IBM InfoSphere Information Server v9.1
Registration and reuse of data validation rule output tables
Named output tables of data validation rules are automatically
registered in the metadata repository and accessible and reusable
in other data validation rules. One data validation rule can now
be executed over the output of another data validation rule.
Sequencing of data validation rules
Users can specify a sequence of data validation rules, rule
sets or metrics in a group. The product exposes an HTTP API
and command-line interface to create, view and modify these
sequences of rules. Together with the more flexible configuration
of rules, this feature enables a powerful sequencing of rules
where the first data validation rule creates an output that can
be accessed by the second rule in the sequence.
Distinct output records for data validation rule execution
Users can determine whether to exclude duplicates when storing
the output records after a data validation rule execution.
Impact analysis for data validation rules
Users can search, browse and view data validation rules in
InfoSphere Information Server for Data Quality. They can
also identify which data rule definition is used in which
InfoSphere Information Server for Data Integration jobs.
When a rule definition changes, developers are able to quickly
understand which jobs are impacted and need to be changed.
 
Figure 6:
Fields for defining a data validation rule
IBM Software 13
InfoSphere Data Quality Console
The new InfoSphere Data Quality Console provides a unified
environment in which users assess and monitor how data quality
is progressing within the business, proactively increasing
data-quality awareness.
The InfoSphere Data Quality Console provides data quality
summary charts that direct users to the most critical data quality
information. For in-depth analysis, classification and prioritization
of data quality events, the console provides comprehensive
search, filtering and drill-down functionality to understand
where and to what severity data quality is being compromised.
InfoSphere QualityStage Standardization
Rules Designer
A new feature, the InfoSphere QualityStage Standardization
Rules Designer, provides an agile and business-focused
environment to customize data standardization processes
for particular business needs, such as data enrichment or
data cleansing.
 

Figure 7:
InfoSphere Data Quality Console dashboard view
14 What’s new in IBM InfoSphere Information Server v9.1
The standardization rules designer includes:

An intuitive framework to design, maintain and execute
standardization rules for data quality

A web-based user interface for users to quickly begin the
classification process by changing or adding value definitions
to their data

Drag-and-drop capabilities for users to easily manage the
rules that handle their records without needing to write
any pattern action language code

Team collaboration, with the ability to work on any revision
of the rule.
InfoSphere Business Information Exchange:
Business-driven governance capabilities
Wherever information will directly impact a business — known
as the point of impact — it is critical to firms that the information
be correct, consistent and complete. By leveraging business-
driven integration and governance, including new capabilities
in InfoSphere Business Information Exchange v9.1 that provide
capabilities to create consistent business policies and rules,
employees can be more confident making decisions at the
point of impact.
 
Figure 8:
Definition of a data standardization rule
IBM Software 15
Business-driven information integration
Information governance policies and rules
InfoSphere Business Information Exchange v9.1 now provides
support for creating information governance policies and rules,
enabling companies to define the manner in which information
should be structured, stored, transformed and moved.
Information governance policies are expressed in straightforward
language and can contain sub-policies and references to
information governance rules. Information governance rules, also
expressed in simple language, describe the measurable criteria by
which information can be determined to be compliant with a
business objective.
Information governance policies and rules are available from
all interfaces where categories and terms are available.
 
Figure 9:
Example of an information governance policy details screen.
16 What’s new in IBM InfoSphere Information Server v9.1
There are two types of assigned relationships between technical
metadata assets and information governance rules:
1. Governs — This relationship expresses the intended
governance behavior for data sources and terms. For example,
the information governance rule “Customer Information must
have phone number” may govern a database table. This
expresses the intended behavior or characteristic of this
data source.
2. Implemented by — This relationship expresses the fulfillment
of an information governance rule by an active process or routine,
such as a data integration job, data cleansing rule or an MDM
service. For example, the information governance rule
“Customer Information must have phone number” may be
implemented by a job that loads a database table.
 
Figure 10:
Example of an information governance rule details screen
IBM Software 17
Advanced term relationships
Language is expressed via interrelated concepts. Many times,
in order to better understand a particular concept, a user finds
it helpful to understand its type and what characteristics it has.
The following relationships are available in the glossary:
1. Is A Type Of
2. Has Types
3. Has
4. Is Of
Type relationships are hierarchical, enabling the user to see all
of the supertypes of a term, and “has” relationships can be
derived by the “type” relationships. Figure 11 provides an
example, using the hypothetical term “Drug Administration.”
 
Figure 11:
Example of a term details screen, defining “Is A Type Of” and “Has A” relationships
18 What’s new in IBM InfoSphere Information Server v9.1
The term “Drug Administration” is defined with Is A Type Of
“Healthcare Treatment,” which is defined with Has An “Effective
Date,” “Expiration Date” and “Supervising Physician.” Therefore,
“Drug Administration” also derives these three terms because
of its relationship with “Healthcare Treatment.”
These relationships are also displayed in the glossary’s Eclipse
plug-in for IBM InfoSphere Data Architect and IBM Rational
Software Architect. When dragging and dropping a term with
Has A relationships onto the modeling canvas, the modeler can
populate contained attributes (logical model, physical model or
UML model) automatically with the associated Has A terms.
Data lineage enhancements
The linking services that weave together the InfoSphere
Information Server for Data Integration routines to create
lineage have undergone significant improvements, including
the new ability to select only the projects to be linked, without
unlinking unselected projects. This enhancement yields superior
performance for the linking services by linking only changes
to projects instead of linking the entire project. This capability
is available in all InfoSphere Information Server packages.
IBM InfoSphere Information Server v9.1 supports these
additional stages and asset types:

InfoSphere Information Analyzer rules stage

Big data file stage (BDFS)

Java rule stage

InfoSphere Streams stage

InfoSphere Streams end-point assets
BPMN2 support
Leveraging the BPMN2 capabilities in Rational Software
Architect version 8, the glossary’s Eclipse-based integration
with Rational Software Architect supports the drag-and-drop
of glossary terms to BPMN2 elements. This behavior is in
accordance with the existing glossary’s UML Eclipse plug-in
modeling behavior.
IBM Software 19
OSLC support
Open Services for Lifecycle Collaboration is a method for
displaying information from different systems via web services.
In the InfoSphere Business Information Exchange v9.1 release,
the glossary may consume OSLC services from Rational Asset
Manager and Rational Software Architect Data Manager, as
demonstrated in Figure 12.
Until release v9.1, terms could only be used to describe and
govern technical metadata content stored in the InfoSphere
Information Server metadata repository, as well as URL links
to other servers. This new capability, with its dynamic display
of external content, extends the scope of terms and information
governance rules to provide comprehensive, business-driven
information development and governance.
 
Figure 12:
Consuming OSLC content from Rational Software Architect Data Manager
20 What’s new in IBM InfoSphere Information Server v9.1
Simple access for enterprise collaboration
Single sign-on support
More consumable and accessible than ever, IBM InfoSphere
Business Information Exchange v9.1 supports Windows-based
single sign-on. For organizations using Active Directory, single
sign-on support can achieve this via the SPNEGO connection
mechanism and Kerberos authentication.
Blueprint web viewer
Creating a well-documented, end-to-end blueprint of the
architectural vision for an information project can help IT
departments align business requirements with enterprise and
reference architecture, while encouraging a standardized
approach to minimize rework. By representing the vision across
different domains — including data integration, data archiving,
test data management, application retirement, data warehousing
and business intelligence — blueprints can help data architects
avoid siloed thinking and misalignment with business priorities.
Blueprints also support project execution, based on proven
methodologies. Architects can use blueprints to leverage best
practices, as defined in the appropriate method, and create
documentation regarding the outcome, as well as “who, what,
when and why.” Blueprinting capabilities play a critical role in
keeping the project vision synchronized with the actual solution
by linking the blueprint to business and technical artifacts
throughout the solution lifecycle. With InfoSphere Information
Server, firms can now publish blueprints to the common
metadata repository and share them through web interfaces.
This helps the company enable enterprise collaboration and
consistent development of information flows. Users can publish
their blueprints to InfoSphere Information Server, where any
user may view them. The ability to view blueprints is available
in all InfoSphere Information Server v9.1 packages.
IBM Software 21
 
Figure 13:
Example of InfoSphere Information Server blueprint
22 What’s new in IBM InfoSphere Information Server v9.1
Appendix: Mapping of products to new
features in IBM InfoSphere Information
Server v9.1
Package Product New feature Page
InfoSphere Information
Server for Data Integration
InfoSphere DataStage, InfoSphere Metadata
Workbench, InfoSphere Data Replication
InfoSphere Data Click 4
InfoSphere DataStage
Enriched control of the operational environment 7
Workload management 7
Big data support 8
Private cloud support via IBM PureApplication System 9
InfoSphere DataStage Balanced Optimization*
Balanced Optimization for DB2 for z/OS 9
Balanced Optimization for Hadoop 9
InfoSphere Blueprint Director Blueprint web viewer 20
InfoSphere Metadata Workbench
Impact analysis for data validation rules 12
Data lineage enhancements 18
Single sign-on support 20
InfoSphere Information
Server for Data Quality
InfoSphere QualityStage
InfoSphere Data Quality Console 13
InfoSphere QualityStage Standardization Rules Designer 13
InfoSphere Information Analyzer
Flexible data validation rule output table configuration 11
Registration and reuse of data validation rule output tables 12
Sequencing of data validation rules 12
Distinct output records for data validation rule execution 12
InfoSphere Metadata Workbench
Impact analysis for data validation rules 12
Data lineage enhancements 18
Single sign-on support 20
InfoSphere Blueprint Director Blueprint web viewer 20
*Separately orderable
IBM Software 23
Package Product New feature Page
InfoSphere Business
Information Exchange
InfoSphere Business Glossary
Information governance policies and rules 15
Advanced term relationships 17
BPMN2 support 18
OSLC support 19
Single sign-on support 20
InfoSphere Metadata Workbench
Impact analysis for data validation rules 12
Data lineage enhancements 18
Single sign-on support 20
InfoSphere Blueprint Director Blueprint web viewer 20
InfoSphere Information
Server Enterprise Edition
All products named above All features named above —
IMW14649-USEN-00
© Copyright IBM Corporation 2012
IBM Global Services
Route 100
Somers, NY 10589
U.S.A.
Produced in the United States of America
October 2012
All Rights Reserved
IBM, the IBM logo, ibm.com and InfoSphere are trademarks or registered
trademarks of International Business Machines Corporation in the United
States, other countries, or both. If these and other IBM trademarked terms are
marked on their first occurrence in this information with a trademark symbol
(® or ™), these symbols indicate U.S. registered or common law trademarks
owned by IBM at the time this information was published. Such trademarks
may also be registered or common law trademarks in other countries.
A current list of IBM trademarks is available on the Web at “Copyright
and trademark information” at ibm.com/legal/copytrade.shtml
Other company, product and service names may be trademarks or service
marks of others.
References in this publication to IBM products and services do not
imply that IBM intends to make them available in all countries in which
IBM operates.
Please Recycle