right

cabbagepatchtapeInternet and Web Development

Feb 5, 2013 (4 years and 9 months ago)

246 views

ForeCite: towards a reader
-
centric

scholarly digital library

Thuy

Dung Nguyen, Min
-
Yen Kan,
Dinh
-
Trung

Dang, Markus
Hänse
,
Ching

Hoi Andy
Hong, Minh
-
Thang

Luong
, Jesse
Prabawa

Gozali
,
Kazunari

Sugiyama & Yee Fan Tan

Minh
-
Thang

Luong

@ WING meeting

14 May 2010

3/18/2013

2

FCWeb

FCReader

FCNote



Author
-
centric

vs. reader
-
centric



Piecemeal solution
vs. unified approach

Overview


ForeCite


Architecture


Design principles


FCWeb


FCReader


FCNote


Backend components

3/18/2013

3

ForeCite
-

Architecture


Ruby on Rails framework


Model
-
View
-
Controller (MVC) design pattern

3/18/2013

4

ForeCite
-

Design Principles


Web services (WS) platform: WS server as a broker +
rudimentary load balancer


Reader
-
centric platform: explore, read, and synthesize



Deeply interlinked components FCW/R/N

3/18/2013

5

Table 2: Publicly available ForeCite web services

FCWeb
-

Homepage


Search interface to FC


Paper statistics
-

a more
dynamic presence


Most read


FCReader


Recently added




FCWeb

&
FCNote


Recently annotated


FCReader

&
FCNote


User statistics


Most active


based on # of
public annotations

3/18/2013

6

FCWeb
-

Views


Paper (
left
): view metadata


keywords, reference and citation info from
KEA

&

ParsCit


annotations and user reviews made in
FCReader

&
FCNote


Person (
right
): regards users as both authors and readers

3/18/2013

7

FCReader


Document annotation


Reading environment + sensemaking the content


Highlights & anchor notes, Reviews (saved into
FCNote
)



support collaborative annotations

3/18/2013

8

FCReader


Document analysis

Rhetorical structure (
RAZ
)

3/18/2013

9

Keywords (
KEA
)

Logical structure (
ParsCit
++)

FCNote


Bibliographic manager +
notetaking system


Access to documents’ metadata & own annotations (online)


Synchronization mechanism


http://FCNote.comp.nus.edu.sg



Key difference in client
-
side (offline)


Self
-
contained HTML page


A heavy modification from TiddlyWiki


3/18/2013

10

FCNote


Note creation & document ingestion


Normal vs. research notes


Research notes:


Title suggestions


Paper metadata


2
-
level tagging: tasks & tags


Share notes (
FCReader, FCWeb
)


Document ingestion:


PDF or BIB (single/multiple)


Paper metadata, keywords, references
automatically extracted (for PDF)


Local document management

3/18/2013

11

Backend components


KEA: Automatic Keyphrase Extraction


ParsCit: Reference String Parser


ParsCit++: Section Labeler


RAZ: Robust Argumentative Zoning


Kairos
: Crawler for Paper Metadata


Record Matching and Canonicalization


FireCite
: Browser
-
based Metadata Extraction
from Publication Web Pages

3/18/2013

12

Q & A







Thank you!

3/18/2013

13