Semantic Data Management

Alex EvangData Management

Sep 9, 2011 (6 years and 14 days ago)

864 views

1

Semantic Data Management

Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

2

Source: Oracle customer

Enterprise Information Challenge

3

Vision of Semantically Linked Data

The Network of Collaborative Knowledge

A web of data that can be
intelligently processed by machines
delivering an environment that:



Intelligent



Solves Problems



Relevant



Answers Questions



Contextual



Easily Searchable



Interconnected



Let Information follow you

“The next generation of information workers will expect a highly visual, connected,
contextual information workplace they can take anywhere.”
-

Forrester

4

Social Semantic Enterprise

Island: Java

Semantic
Tagging


Concept definitions
with unique meaning


Language
independent

or

Relational
Navigation


Recommendations


Find related content
& groups by
navigating
connected entities

to

John

Chambers

or

Cisco CEO

Semantic
Search


Search for concepts


Return semantically
related results

via

Social Connectivity through Semantic Enablement

Semantic

Tagging

5

Semantic Web


Integrate domains of knowledge
through common volcabularies (ie
SKOS)



Manage
relationships

between
collections of images and
associated metadata



RDF as flexible and extensible
data model supports powerful
search and end
-
user discovery of
related content



Rich platform for data integration,
data repurposing, and better
quality control and classification

Semantic Aggregation & Navigation of Data

6

The Web is about links…


The Semantic Web is about the
relationships in those links


Dan Brickley

7

Metadata of documents

8

Linking Data Resources



User


Data

Ontologies

RDF

Query & results

Structured &
Unstructured
Data Sources

9

Relational to Ontological Mapping

Drug

Neuron

Pathological

Agent

Receptor

Channel

inhibits

inhibits

Agent

Neuronal

Property

Pathological

Change

involves

involves

inhibits

Compartment

has

is_located_in

is_located_in

Courtesy: SenseLab, Yale University

10

Importance of SKOS


SKOS provides a simple bridge between the

print


world


and the (Semantic) Web



Thesauri, glossaries, etc, from the library
community


can be made available



LOC is a good example



SKOS can also be used to organize tags,
annotate


other vocabularies,


11

Ontology Integration: Bioinformatics

Source: Siderean Software

12




Storage & Loading



Native W3C RDF graph data store




Fast Bulk, batch & Incremental load




Query



SQL: SEM_MATCH graph pattern query



SPARQL: supported via Jena plug
-
in



Reasoning



RDF, OWL Prime, RDF++ semantic rules



Forward chaining inference model



User defined rule base



Scalability



Scales to billions of triples



Partitioning, RAC, Adv. Compression



Standards & Interoperability



Aligned with W3C specifications



Supported by leading semantic tools

Oracle 11
g

RDF/OWL Graph Data Management

Structured DBMS, Unstructured, Spatial, RSS, email, Documents

13

Semantic Technologies Adopters

Hutchinson 3G
Austria

Life Sciences

Clinical Medicine

Publishing

Thomson Reuters

Education

Telecomm

11g Reference

11g Reference

11g Reference

11g Reference

Defense/

Intelligence

11g Reference

14

People

Information

Communities

The Power of Semantic Connections

My View

Delivering relevant, contextual content based on
relationships, profile & activity

15

Cisco Knowledge Network

RDF Store

Social Graph

Entity and Property
Extraction

Enterprise
Vocabulary Import

e.g. “job roles”,
“customer accounts”

e.g. “product catalog”,
“Directory”


User Entered Tags



Import

Structured Data

Content Repositories

Transactional
Applications

e.g. ”web content”,
“wiki topics”,
“expertise”

P
eople

C
ommunities

I
nfo

16

Use Case: Integrative Data Mining and Ad Hoc Query

Discovery


Clinical


Public

Proteomics




Imaging

In
-
Vitro/Vivo

Genotyping

SNPs/Haplotypes



XREP

plus

IQ

Proteomics

Informatics

System/Semantic Integration Layer

Integrative Informatics

PGI


Lipid

Informatics


Tailored Therapeutic Workbench (TTW)

Integrative Data Mining/Query System

Lipidomics

Gene Expression






Info

Mining

Text

Translational Informatics

Source: WWW2007, Eli Lilly,
S.Stephens

17

Use Case: Drug Discovery / Target Assessment Tool

Source: WWW2007, Eli Lilly,
S.Stephens

18

Why is Oracle Spatial unique in Semantics
Data Management?


Oracle 11g is the only

commercial database

with
native RDF/OWL semantics capability.


Can readily scale to ultra
-
large repositories (+1 billion)


Growing ecosystem of 3
rd

party tools partners


Leverages Oracle Partitioning and Advanced
Compression. RAC is also supported.


Semantics customers expect to deploy very large
servers

19

Transform & Edit
Tools



RDF/OWL Data
Management


SQL & SPARQL
Query


Inferencing


Semantic Rules


Scalability &
Security


Semantic Indexing


OBIEE, Analytics


Graph
Visualization

Cytoscape


Social Network
Analysis



Metadata Registry


Faceted Search


Load, Query

& Inference


Applications &
Analysis

Tools


Extraction, Modeling, Reasoning & Discovery
Workflow

Other Data
Formats

RSS, email

Transaction

Systems

Data
Sources

Unstructured
Content

Entity Extraction
& Transform



OpenCalais



Linguamatics



GATE

Ontology
Engineering



TopQuadrant



Ontoprise

Categorization



Cyc

Custom Scripting


Partner
Tools

Partner/Oracle
Tools

20

Contacts and Resources:

Product Management: Xavier.Lopez@oracle.com


White papers, documentation, discussion forum, sample
code, downloads, press releases:


Google: “Oracle RDF”



http://www.oracle.com/technology/tech/semantic_technologies/index.html