Presentation

obtainablerabbiΔιαχείριση Δεδομένων

31 Ιαν 2013 (πριν από 4 χρόνια και 11 μήνες)

129 εμφανίσεις






Data curation in an existing infrastructure:


Stellenbosch University

1
st

African Digital Curation Conference


12


13 February 2008


Wouter Klapwijk

Senior software specialist

Library and Information Service

Overview

1.
Organisational objectives: current status


2.
Example of digital curation in practice



Overview

1.
Organisational objectives: current status


2.
Example of digital curation in practice



Organisational objectives: current status


Current digital curation practices focused on
supporting the e
-
Science/e
-
Research support
framework on campus


Acknowledge the fact that digital curation is more
than just ingesting, preserving and disseminating
research output


Started experimenting with replication technology


LOCKSS


Institutional Repository (IR) will interface with
Research Management System (RIMS)


SA RIMS
(InfoEd) project


Dspace


RIMS interaction via Staging Area
needs attention

Organisational objectives: current status


Library Service, Dept of Research Development
and the Dept of Information Technology: policy
framework


Institutional policies need to be in place


Human Resources: no dedicated programmers, no
dedicated repository administrator, no dedicated
systems administrator, 2 part time staff, no real
budget

Research Support Technology Framework

Do Research

Toolbox

Web survey tool (
SUrvey
)

Citation tools (e.g.
Endnote
)

SAS, SPSS, Matlab
, etc.

Federated Search (
MetaLib
)

Federated Identity Management

Inter
-
institutional ID management

Collaboration

Environment

e.g. For Centres of Excellence, Inter
-
institution

Web & video conferencing, messaging,

document collaboration, blogs, wikis

websites

High
-
speed

Internet

SANReN

Seacom

Remote connectivity

(e.g. SCN)


Institutional

Repository

Lab notes

Preservation

Security & Publishing

Research outputs
:

Articles

Data sets

ETD


Concept

Research Lifecycle Management

Grant

Application

Ethics

Protocol

Negotiate

Approve

Internal

Review

Submit

IP/Ethics/

Contract

Review

Manage

Project &

Contract

Reporting

Research

Output

(Inter)National

Resources








Library systems

Funding search/alerts

Expertise directories

Compiled by Ralph Pina, Stellenbosch University IT Division

e
-
Portfolio


Self
-
maintained for

marketing

e
-
Profile

Hi
-
Perf Computer Cluster

Institutional Repository

INSTITUTIONAL RESEARCH REPOSITORY


SU FEDERATED INTERFACE


OAI
-
PMH Service provider

LEARNING OBJECT REPOSITORY


Storage of learning objects to support the process

of Learning and Teaching (interface with WebCT)

NON
-
ACADEMIC REPOSITORY


For the submission, archival and retrieval of

Digital Objects

REPOSITORY …

RESEARCH REPOSITORY


Centre/s of Excellence

Departmental Research

… etc.

{ CREST / Department of Research Support }

ELECTRONIC THESES / DISSERTATIONS


For the submission of postgraduate research


which is not part of the
Research Repository

Implementation
-

technology plan


Dspace digital repository system


Standardize on version 1.5 for 3 years


Need to workshop OAIS framework (in South
African context)


Full OAIS
-
compliance in Dspace Release 2


Replication of ETD’s with LOCKSS


Proof of
Concept planned for 2009


Some work done with LOCKSS on format
migration


Multiple instances of Dspace


more flexibility,
more personalization

Implementation


virtual server setup


VMWare guest

etd.sun.ac.za

(with tomcat and handle server)

VMWare guest

research.sun.ac.za

(with tomcat and handle server)

VMWare guest

lib.sun.ac.za

(with tomcat and handle server)

DSPACE 1.5 SERVER

SQL SERVER

Linux

Bitstream storage

VMWare server

Linux / BSD

Metadata storage

PostgreSQL

mySQL

METADATA

ISO 19115

EAD

Dublin Core

etc.

SAN

(OPTIONAL)

Overview

1.
Organisational objectives: current status


2.
Example of digital curation in practice



Overview

1.
Organisational objectives: current status


2.
Example of digital curation in practice



A case study


DST
-
NRF Centre of Excellence for Invasion
Biology (
CIB
)


Prepared a set of Use Cases


metadata
requirements, access, permission, roles and
responsibilities


Dedicated Collection Administrator


Datasets


Dublin Core


Publications


ISO
-
19115 (spatial data)

A Case Study
-

managed access

Level

Metadata

Authors

Projects

Theses

Publications

Datasets

L0

L1

L2

L3

L4

No Access

Access

To be confirmed

View if it is owner

Permissions according to levels

(extracted from the CIB Use Cases)

A Case Study


metadata requirements

Conditional statements:
language: documented if not defined by the encoding
standard
characterSet: documented if ISO 10646-1 not used
and not defined by the encoding standard
hierarchyLevel: documented if hierarchyLevel not
equal to "dataset"?
hierarchyLevelName: documented if hierarchyLevel
not equal to "dataset"?
MD_SpatialRepresentation
(from Spatial representation information)
<<Abstract>>
MD_ApplicationSchemaInformation
(from Appli cation schema information)
MD_PortrayalCatalogueReference
(from Portrayal catalogue information)
MD_MetadataExtensionInformation
(from Metadata extensi on information)
MD_ContentInformation
(from Content information)
MD_ReferenceSystem
(from Reference system information)
DQ_DataQuality
(from Data quality information)
MD_Distribution
(from Distri buti on information)
MD_MaintenanceInformation
(from Mai ntenance information)
MD_Metadata
+ fileIdentifier [0..1] : CharacterString
+ language [0..1] : CharacterString
+ characterSet [0..1] : MD_CharacterSetCode = "utf8"
+ parentIdentifier [0..1] : CharacterString
+ hierarchyLevel [0..*] : MD_ScopeCode = "dataset"
+ hierarchyLevelName [0..*] : CharacterString
+ contact : CI_ResponsibleParty
+ dateStamp : Date
+ metadataStandardName [0..1] : CharacterString
+ metadataStandardVersion [0..1] : CharacterString
0..*
+spatialRepresentationInfo
0..*
0..*
+applicationSchemaInfo
0..*
0..*
+portrayalCatalogueInfo
0..*
0..1
+metadataMaintenance
0..1
0..*
+metadataExtensionInfo
0..*
0..*
+contentInfo
0..*
0..*
+referenceSystemInfo
0..*
0..*
+dataQualityInfo
0..*
0..1
+distributionInfo
0..1
MD_Constraints
(from Constraint information)
0..*
+metadataConstraints
0..*
MD_Identification
(from Identification information)
<<Abstract>>
0..*
+resourceMaintenance
0..*
1..*
+identificationInfo
1..*
0..*
+resourceConstraints
0..*
A Case Study


metadata requirements

Link to:


1.
Interface design


2.
Data Dictionary


3.
Dspace Administrator view




Thank you

wklap@sun.ac.za


+27 21 808
-
4378