Semantic Data lives everywhere on the Web

observancecookieSecurity

Nov 5, 2013 (3 years and 7 months ago)

60 views

Ben Szekely

Cambridge Semantics



17/09/07 / Page
1

ben@cambridgesemantics.com

Semantic Data lives everywhere on the Web


Ben Szekely

Cambridge Semantics



17/09/07 / Page
2

ben@cambridgesemantics.com

In a perfect world we would have...


Download Semantic Data here

Ben Szekely

Cambridge Semantics



17/09/07 / Page
3

ben@cambridgesemantics.com

The Web is a far from perfect world

Web presentation techniques are great at making data look
nice but are also great at hiding or destroying the
semantics all together

Rich data mired in HTML tables

XML and RDF turned into human
-
only readable data by
style sheets and other transforms

Links to Semantic data are hidden in non
-
visible portions of
the page

A Web page hiding rich Semantic data can be a tough
nut to crack

Ben Szekely

Cambridge Semantics



17/09/07 / Page
4

ben@cambridgesemantics.com

The Non
-
semantic shell of Semantic Data


Ben Szekely

Cambridge Semantics



17/09/07 / Page
5

ben@cambridgesemantics.com


“Semantic Web Through the Eyes of a
Parrot”

Where is the
nice tasty
Semantic
Data?

Ben Szekely

Cambridge Semantics



17/09/07 / Page
6

ben@cambridgesemantics.com

The Nala Semantic Data Collection Plugin

Mozilla Firefox plugin client

OpenAnzo.org
-

RDF Database and Semantic Web
application Server

Breaks through the non
-
Semantic shell around Semantic
data in Web pages

Collects, stores and indexes Semantic data on the client
and in a central database

Allows collaboration, rich query, and discovery of new
relationships between objects from different Web pages

Ben Szekely

Cambridge Semantics



17/09/07 / Page
7

ben@cambridgesemantics.com

A first look at Nala


Ben Szekely

Cambridge Semantics



17/09/07 / Page
8

ben@cambridgesemantics.com

Semantic Web Technology isn't Magic

The Semantic Web

Unattainable, hollistic world of connected Semantic data

Organizations and their IT departments abandon
10
+ years of infrastructure and database
investment

Global agreement on ontologies

Software agents can crawl and make perfect sense of the world with little human guidance

everyone head for the hills!

Semantic Web Technologies

Powerful, concrete tools that can help realize many of the goals of The Semantic Web in
particular situations, systems, and applications

Driven by adoption of important technology standards (HTTP, RDF, SPARQL, OWL, LSID)


Do not require universal agreement and adoption of domain standards (ontologies,
vocabularies, schemas)


Enable organizations to captilize on infrastructure and data investments by exposing existing
data as Semantic data on the Web

Ben Szekely

Cambridge Semantics



17/09/07 / Page
9

ben@cambridgesemantics.com

Semantic Web Technology in Nala

Semantic Web Technology

Resource Description Framework (RDF)


Named graphs

SPARQL


Query language and protocol for RDF

LSID Resolution

Other Technology

HTTP, AJAX, REST, Atom Publishing Protocol

Screen scraping

Text Indexing

'The Machine Readable Web'
-

Lee Feigenbaum and Elias Torres

Part 1
-

http://www.ibm.com/developerworks/library/wa
-
hmach1/

Part 2
-

http://www.ibm.com/developerworks/library/wa
-
hmach2/


Ben Szekely

Cambridge Semantics



17/09/07 / Page
10

ben@cambridgesemantics.com

A more involved scenario

Two different sources have information about the same biological concept.

Bioguid ( Rod Page)


HTTP uri, proprietary RDF, some standard predicates

Herbimi ( Kevin Richards)


LSID resolution

How can we

Crack through the different non
-
Semantic shells?

View this data merged in the same place?

Map key touch points where the ontologies intersect or relate?

Export the merged data for use in our other applications?

Ben Szekely

Cambridge Semantics



17/09/07 / Page
11

ben@cambridgesemantics.com

The state of Nala

Available in TDWG lsid source control

https://lsids.svn.sourceforge.net/svnroot/lsids/trunk/demo/org.gbif.nala

Firefox Installer available soon

Public server installed

tdwgbox.tdwg.org

Future work

Use cases?

Additional development?

Domain
-
specific clients

Ben Szekely

Cambridge Semantics



17/09/07 / Page
12

ben@cambridgesemantics.com

openanzo.org

Open and inclusive community of Semantic Web Technology adopters and
engineers

Pure open source development community

Flagship project:
Anzo RDF Database and Semantic Web Application
Server

Based on IBM Semantic Layered Research Platform project

Founded and supported by new Boston, MA
-
based venture Cambridge
Semantics Inc.

We are looking for contributors, testers, adopters and critics

If you are building (Semantic (Web) applications), give us a try

Documentation, releases, support are coming soon

Ben Szekely

Cambridge Semantics



17/09/07 / Page
13

ben@cambridgesemantics.com

Cracking the shell

By adopting particular Semantic Web technologies as a community,
we can release the potential wealth of knowledge locked in the data
we have siloed in databases and Web pages.