Files by Linked Open Data

steelsquareInternet και Εφαρμογές Web

20 Οκτ 2013 (πριν από 3 χρόνια και 11 μήνες)

332 εμφανίσεις

Enrichment of Library Authority
Files by Linked Open Data
Sources


Gerd Zechmeister

Semantic Web Company


http://www.semantic
-
web.at



Presentation agenda

1.
About us

2.
LOD2 Project

3.
Demonstration Scenario

4.
Process & Results

5.
Summary & Outlook

© Semantic Web Company


http://www.semantic
-
web.at/

2

About us


Based in Vienna (privately held)


20 specialists from several fields


Focus: Semantic (web) technologies &
search applications


1st project based on semantic technologies in
2001


Foundation of
Semantic Web School
in 2004



Semantic Web Company GmbH

since 2008


PoolParty

development started in 2007, on
the market since 2009


© Semantic Web Company


http://www.semantic
-
web.at/

3

© Semantic Web Company


http://www.semantic
-
web.at/

4

PP Thesaurus Manager

© Semantic Web Company


http://www.semantic
-
web.at/

5

1.
Each

concept

in
one

or

many

concept

schemes

2.
Each

concept

has

one

URI

3.
Each

concept

has

one

ore

more

labels

4.
(
Poly
-
)
Hierarchical

and

non
-
hierachical

relations

5.
Matching

between

concepts

from

various

sources

1.

2.

3.

4.

5.

SKOSsy

© Semantic Web Company


http://www.semantic
-
web.at/

6


Select
DBPedia

categories


Choose extraction
depth, data to
extract and format
(TTL,
TriG

etc.)


Extract it and
import it into
PoolParty as Seed
Thesaurus


FP7 project (2010
-
2014)


15 partners (technology researchers,
companies and service providers) from
11 European countries plus 1
associated partner from Korea


Coordinated by the AKSW research
group at the University of Leipzig

© Semantic Web Company


http://www.semantic
-
web.at/

7

LOD Life
-
Cycle
Management

© Semantic Web Company


http://www.semantic
-
web.at/

8


Extraction of RDF
from text, XML and
SQL


Querying and
Exploration using
SPARQL


Authoring of Linked
Data using a
Semantic Wiki


Semi
-
automatic link
discovery between
Linked Data sources


Knowledge
-
base
Enrichment and
Repair


Demonstration
Scenario


Alignment


Example

Data
vs

LOD
resources

in SKOS


Identification

of

matching

concepts



Enrichment


Addition
of

matches

to

Example

Data
dump


© Semantic Web Company


http://www.semantic
-
web.at/

9

Demonstration
Scenario


Applied tools and frameworks



© Semantic Web Company


http://www.semantic
-
web.at/

10

Tool/Framework

Function

Using

SKOS Thesauri

as

graph
/SPARQL
endpoint


Creating

example

data

as

graph
/SPARQL
endpoint

Comparing

data

to

detect

matching

concepts

Extracting

categories

from

DBPedia

to

import

it

as

Thesaurus
into

PoolParty

Demonstration
Scenario


Example Data


Schlagwortnormdatei (SWD = keyword
authority file) from DNB data dump


166.414 concepts in German with
alignments to LCSH, RAMEAU etc.


Expressed in SKOS (hierarchical and
associative relations)

© Semantic Web Company


http://www.semantic
-
web.at/

11

Demonstration
Scenario


SKOS vocabularies for alignment


Standard Thesaurus Economy (STW)


6520 concepts with english/german prefLabel


European Union Thesaurus (EUROVOC)


6797 concepts with multilingual prefLabel


Extracted concepts from DBPedia via
SKOSsy: „Economy“


13294 concepts in German

© Semantic Web Company


http://www.semantic
-
web.at/

12

Process & Results:
preparational steps

1.
Download


SWD
data

dump

from

DNB
server

2.
Evaluation


SKOS
compatibility

3.
Transformation


SWD
data

as

SPARQL
endpoint

4.
Vocabulary

selection


Focus on Economy
vocabularies


© Semantic Web Company


http://www.semantic
-
web.at/

13

Process & Results:

Alignment


Specification in SILK workbench


Define data sources: SWD & EUROVOC


Define tasks: compare all skos:prefLabels
and deliver all matching links


Initiate process and create output file


© Semantic Web Company


http://www.semantic
-
web.at/

14

SILK Workbench

© Semantic Web Company


http://www.semantic
-
web.at/

15

Alignment SWD
vs

EUROVOC

SILK Workbench

© Semantic Web Company


http://www.semantic
-
web.at/

16

Alignment SWD
vs

EUROVOC

Process & Results:

Alignment

© Semantic Web Company


http://www.semantic
-
web.at/

17

SWD

166414
cs
.

STW

6520
cs
.

EUROVOC

6797
cs
.

DPPedia

Wirtschaft

13294
cs
.

3440
matching links

2169

1318

Process & Results:

Enrichment

© Semantic Web Company


http://www.semantic
-
web.at/

18

Upload of
exactmatches

to the
SWD graph in
Virtuoso

Process & Results:

Enrichment

© Semantic Web Company


http://www.semantic
-
web.at/

19

Subject

Predicate

Object

<http://d
-
nb.info/gnd/4000
107
-
6>

<
skos:exactMatch
>

<http://de.dbpedia.org/resource/Abfallwirtschaft>

<http://d
-
nb.info/gnd/4000
107
-
6>

<
skos:exactMatch
>

<http://eurovoc.europa.eu/1158>

<http://d
-
nb.info/gnd/4000
107
-
6>

<skos:exactMatch>

<http://zbw.eu/stw/descriptor/13325
-
0>

© Semantic Web Company


http://www.semantic
-
web.at/

20

SWD

DBPedia

EUROVOC

STW

Process & Results:

Enrichment

© Semantic Web Company


http://www.semantic
-
web.at/

21

<skos:Concept rdf:about="http://d
-
nb.info/gnd/4000107
-
6">


<skos:definition xml:lang="de">Weiter als im Gabler definiert, auch für öffentliche
Abfallwirtschaft</skos:definition>


<dnb:hasCoordinatedConcept
-
of>


<dnb:CoordinatedConcept>


<dnb:coordination
-
of rdf:resource="http://d
-
nb.info/ddc
-
sg/360"/>


<dnb:coordination
-
of rdf:resource="http://d
-
nb.info/gnd/4000107
-
6"/>


<dnb:det2 rdf:resource="http://d
-
nb.info/ddc/class/363.728"/>


</dnb:CoordinatedConcept>


</dnb:hasCoordinatedConcept
-
of>


<skos:related rdf:resource="http://d
-
nb.info/gnd/4000100
-
3"/>


<skos:related rdf:resource="http://d
-
nb.info/gnd/4076573
-
8"/>


<dcterms:identifier>(DE
-
588)040001075</dcterms:identifier>


<dcterms:identifier>(DE
-
588c)4000107
-
6</dcterms:identifier>


<skos:broader rdf:resource="http://d
-
nb.info/gnd/4220414
-
8"/>


<skos:prefLabel xml:lang="de">Abfallwirtschaft</skos:prefLabel>


<skos:exactMatch rdf:resource="http://de.dbpedia.org/resource/Abfallwirtschaft">


<skos:exactMatch rdf:resource="http://eurovoc.europa.eu/1158">


<skos:exactMatch rdf:resource="http://zbw.eu/stw/descriptor/13325
-
0">

</skos:Concept>

Summary & Outlook


Playground for future scenarios


Linked Open Library Data


LOD2 technology stack components


Further applications


Executing tasks for regular updates


Link exchange with LOD providers


Integration of data and cross
-
media (e.g.
geo
-
references, images, AV files)


Expansion of authority files for
cataloguing (e.g. multilingual searches)



© Semantic Web Company


http://www.semantic
-
web.at/

22

Get in contact!

© Semantic Web Company


http://www.semantic
-
web.at/

23

Semantic Web Company GmbH

Mariahilfer Strasse 70/8

1070 Vienna
-

Austria

http://www.semantic
-
web.at/


http://poolparty.biz/


http://twitter.com/semwebcompany




Gerd Zechmeister

Research & Development Manager

g.zechmeister@semantic
-
web.at