WAIM 2011 / MSRA

jumentousmanlyInternet and Web Development

Oct 21, 2013 (3 years and 10 months ago)

97 views


WAIM 2011

/ MSRA

SUMMER SCHOOL

Sep. 12~1
6
, 2011.
Computer School,
Wuhan University

Homepage:
http://totemdb.whu.edu.cn/summerschool

Special thanks for sponsorship from Microsoft

Research

Asia
(MSRA)


OVERVIEW


Lectures are given by renowned researchers from Microsoft

and

top

universities
. Lectures
combine theory and foundations, concepts and
algorithms, as well as applications and case
studies. Lectures also feature advanced topics, research questions, and group discussions.
Particular emphasis is put on encouraging participants to actively participate in discussions,
brainstorming, and break
-
out sessions.




























SCHEDULE


Sep.
12

Morning

Registration

Afternoon

Opening Ceremony

Chaired by
Prof. Zhiyong Peng,
Wuhan University

The Logic of Queries: from Data Bases to Data
Streams
and the WEB

Prof. Carlo Zaniolo,
UCLA

Sep.
13

Morning

The Logic of Queries: from Data Bases to Data
Streams and the WEB

Prof. Carlo Zaniolo,
UCLA

Afternoon

Big Data and Cloud Computing

Prof. Amr Abbadi, UCSB

Sep.
14

Morning

Information Diffusion In
Social Networks

Prof. Amr Abbadi, UCSB

Afternoon

Graph Database Engine and Computation Platform

Knowledge Bases: Linking database, AI, and
information retrieval

Dr. Haixun Wang,
Microsoft Research

Sep.
1
5

Morning

WAIM 2011
Keynote Speeches: 1.
Analytics

for
Info
-
plosion including information diffusion
studies for the 3.11 Disaster

Prof.
Masaru
Kitsuregawa,
University
of Tokyo.

WAIM 2011
Keynote Speeches:
2
.
Data
-
driven
Modeling and Analysis of Online Social Networks

Prof.
Divyakant Agrawal
,
UCSB.

Afternoon

Research Sessions:

1(a)
Query Processing
, 1(b) Uncertain Data, 1(c)
Social Media(1);

2(a) Semantics, 2(b) Data Mining(1), 2(c) Social
Media(2);

3(a) Semantics, 3(b) Data Mining(1), 3(c) Social
Media(2);


Sep.
1
6

Morning

WAIM 2011
Keynote Speeches:
3
.
Reflections on
Big Data and Enterprise Analytics

Dr.
Surajit Chaudhuri,
Microsoft Research

WAIM 2011
Keynote Speeches:
4
.

Using the Web
for Collaborative Language Learning and Teaching

Prof.
Werner

Winiwarter, University
of Vienna

Afternoon

Research Sessions:

4(a) Data Management, 4(b) Graph Data, 4(c)
Name Disambiguation
;

5(a) Semantics, 5(b) Data Mining(1), 5(c) Social
Media(2);

6(a) Semantics, 6(b) Data Mining(1), 6(c) Social
Media(2);


Closing Ceremony

Chaired by
Dr. Haixun Wang,

Microsoft

Research








APPLICATION




We will have

about

1
5
0 participants.




The registration is free.



Qualified participants should be graduate students or undergraduate students
recommended

for
master

s degree candidate
.



Participants will have the opportunity to
present as an observer

in WAIM 2011 without
paying registration fee
.



We are going to give out an MSRA Student Award after the summer school. If you would like
to be considered for
the award, please submit your CV and a one page proposal


it can be
anything that you think is doable, and will make great impact in data management (in the
most broad sense of data management).


Important Dates

Date

Content

Jul. 10

Applications open

Sep
.
8

Applications close


If you have any questions, please do not hesitate to contact to Dr. Ming Zhong for detailed
explanation.

Phone: 151
-
027
-
02596

Email: mike.clark.whu@gmail.com











WAIM 2011 Summer School

Application Form





报名表请邮寄:
武汉市武昌区珞珈山武汉大学计算机学院

钟鸣

邮政编码:
430072

Please mail this form to:

Dr.
Ming Zhong. Computer School, Wuhan University. Wuhan, Hubei, PRC. 430072



报名截止时间:
2011

9

8



Deadline:

Sep
.
8
, 2011.

姓名

name


性别

sex


出生年月

birthday


称谓

title



单位

institution


通讯地址

address



邮编

postcode


电子邮件

E
mail

电话

phone

传真

f
ax

主要研究兴趣

primary
research
interests


发表论文情况
publication list


招生审批结果

(
勿填
)

acceptance

(do
not

fill)





WAIM 2011 Summer School

Accommodation Form





我们将安排
参与者
入住位于武汉大学计算机学院附近的茶港商务酒店
,请将住宿回执与
报名表一同邮寄


We will provide accommodation to participants in Cha Gang Business Hotel, which is located
near the Computer School of Wuhan University.
P
lease mail this form to us along with the
application form.


A.

个人信息

/ Personal Information



⼠family me




⼠first me

出生日期

⼠bi牴桤ay

护照或身份证号

⼠灡獳sort


国籍

⼠湡tion

性别

⼠獥x

地址

⼠a摤re獳


电话

⼠灨o湥


传真

⼠fax

电子邮件

⼠email


B.

酒店预订

/
B
ooking
H
otel

茶港商务酒店

Cha Gang Business Hotel

单人间

/ single room
(¥
198


人数

⼠灥r獯湳

双人间

/ double room
(¥
248


人数

⼠灥r獯湳

入住时间

⼠捨c捫ci渠nate


离开时间

⼠捨c捫ct⁤ te

总天数

⼠total⁤ ys


C.

注意

/ Notice



本组委会
仅安排
入住


须自付房费


The organization
committee

will

only

book hotel rooms for
you
, and
you

need to pay the
hotel expense by yourself
.



为您方便考虑
,请尽快报名

若酒店已订满请自订酒店
,请见谅。

For your
convenience
, please
registe
r

as soon as possible. If the hotel has no room left,
please book a
nother

ho
tel by yourself. Sorry for that.






COURSE INTRODUCTION


Course 1: The Logic of Queries: from Data Bases to Data Streams and the WEB

Lecturer: Prof. Carlo Zaniolo, UCLA


Logic
-
based query languages represented a cornerstone of the relational database model
introduced by E.F. Codd, and were then significantly extended with the development of Datalog
and other deductive database languages

that support rules and recursive queries. This line of
research produced elegant semantics and implementation techniques, and delivered the
enabling technology for the efficient implementation of recursive queries with stratified negation
supported by com
mercial DBMS and specified by SQL standards. After a lull of several years, the
spread of web
-
age information systems and applications has generated a vigorous resurgence of
interest leading to a "Springtime for Datalog" according to Joseph M. Hellerstein.

It was indeed in
his lab that the idea of logic
-
based declarative specification and design of Internet protocols and
services first emerged; more recent work seeks to extend this approach and develop
Datalog
-
based foundations for parallel and distributed
programming languages. On the Semantic
Web front, a novelty of great interest is represented by the introduction of Linear Datalog for
expressing and supporting efficiently subsets of description logic and reasoning for ontological
queries [2]. Two other s
imple extensions of Datalog of signi_cant practical interest have been
proposed very recently: one is intended for complex graph queries, including page
-
rank queries,
while the other supports more powerful continuous queries on data streams. Finally, novel

challenges and opportunities are emerging at the implementation level, making it possible to
support recursive queries in a Map
-
Reduce environment. This tutorial will provide a unified
treatment of these recent advances by building on the formal foundatio
ns of Datalog (e.g., its
fixpoint and non
-
monotonic semantics).


Lecturer

Biography
:
Carlo Zaniolo


Carlo Zaniolo is currently a Professor of Computer Science at the University of California, Los
Angeles. He is the first holder of the N. E. Friedmann Chair

in Knowledge Science, and director of
the UCLA Web Information System Laboratory. Before joining UCLA, Carlo Zaniolo acquired
twenty+ years of R&D experience in the computer and telecommunication industry. From 1971
till 1976 he was with Burroughs Corpora
tion, Pasadena, California, and from 1976 till 1980 with
the Sperry Research Center, in Sudbury, Massachusetts. From 1980 till 1984, he was a senior
member of the technical staff with AT&T Bell Laboratories, in Murray Hill, NJ. In 1984, he joined
MCC, a re
search consortium of US high
-
tech companies located in Austin Texas. At MCC, Carlo
Zaniolo served as Associate Director in the Advanced Computer Technology Program, and as the
technical leader and manager of the LDL++ project, a leading research project on

logic and
databases. Carlo Zaniolo received a Ph.D. degree in Computer Science from the University of
California, Los Angeles, in 1976, and an M.S. degree in Electrical Engineering from Padua
University, Italy, in 1969. Carlo Zaniolo's research activities

cover several areas of computing,
including data mining, database management systems, intelligent information systems, and data
stream management systems
-
he published more than 200 refereed publications and a graduate
textbook entitled "Advanced Database
Systems," Morgan Kaufmann in 1997. He served as
program chair (or co
-
chair) of the International Conference Very Large Data Bases in 1981 and
1994 (VLDB 1981 & VLDB 1994), the International ACM Conference on Management of Data in
1986 (SIGMOD 1986), the In
ternational Conference on Extending Database Technology in 2000
(EDBT 2000). He also served as general chair or co
-
chair of the 1990 North American Conference
on Logic Programming (NACLP 1990), the 1996 International Workshop on Logic in Databases (LID
196
6) and the IEEE International Conference on Data Mining 2008 (ICDM 2008), and the
HotSWUp 2011 Workshop. He an associate editor for the International Journal on Very Large
Data Bases (VLDB Journal). He served on the PC of more than 140 international confer
ences.


Course 2: Big Data and Cloud Computing

Lecturer: Prof. Amr Abbadi, UCSB


Cloud computing is an extremely successful paradigm of service oriented computing, and has
revolutionized the way computing infrastructure is used. This success has seen a
proliferation in
the number of applications which leverage various cloud platforms, resulting in a tremendous
increase in the scale of the data generated as well as consumed by such applications. Scalable
database management systems, both for update intens
ive application workloads, as well as
decision support systems for descriptive and deep analytics, are thus a critical part of the cloud
infrastructure. The quest for conquering the challenges posed by the management of big data
has led to a plethora of sy
stems. This tutorial aims to clarify some of the critical concepts in the
design space of big data and cloud computing such as: the appropriate systems for a specific set
of application requirements, the research challenges in data management for the cloud
, and
what is novel in the cloud for database researchers? We also aim to address one basic question:
Whether cloud computing poses new challenges in scalable data management or it is just a
reincarnation of old problems? We provide a comprehensive backgro
und study of
state
-
of
-
the
-
art systems for scalable data management and analysis. We also identify the critical
aspects in the design of different systems and the applicability and scope of these systems. A
thorough understanding of current solutions and a
precise characterization of the design space
are essential for clearing the "cloudy skies of data management" and ensuring the success of
DBMSs in the cloud, thus emulating the success enjoyed by relational databases in traditional
enterprise settings.


Co
urse 3: Information Diffusion In Social Networks

Lecturer: Prof. Amr Abbadi, UCSB


With hundreds of millions of users worldwide, social networks provide great opportunities for
social connection, learning, political and social change, as well as individual

entertainment and
enhancement in a wide variety of forms. In fact, social networks have already emerged as a
significant medium for the widespread distribution of news and instructions in mass convergence
events. In light of these notable outcomes, unders
tanding information diffusion over online social
networks is a critical research goal. This greater understanding can be achieved through data
analysis, the development of reliable models that can predict outcomes of social processes, and
ultimately the cr
eation of applications that can shape the outcome of these processes. In this
tutorial, we aim to provide an overview of such recent research based on a wide variety of
techniques such as optimization algorithms, data mining, data streams covering a large
number
of problems such as influence spread maximization, misinformation limitation and study of
trends in online social networks.


Lecturer

Biography
:
Amr Abbadi


Amr El Abbadi

is currently a Professor in the Department of Computer Science at the University
of California, Santa Barbara. He has held visiting professor positions at the University of Campinas
in Brazil, IBM Almaden Research Center, the Swedish Institute of Computer

Science in Stockholm,
at IRISA at the University of Rennes in France, and a visiting Professor in Cairo University. He was
Vice Chair of ICDCS 1999, Vice Chair for ICDE 2002, the Americas Program Chair for VLDB 2000,
the Program co
-
Chair for SIGSPATIAL 20
10 and SOCC 2011. He served as a board member of the
VLDB Endowment from 2002 to 2008, and is currently on the ACM SIGMOD Advisory Board. In
2007, Prof. El Abbadi received the UCSB Senate Outstanding Mentorship Award for his excellence
in mentoring graduat
e students. He is an ACM fellow. His research interests lie in the broad area
of scalable database and distributed systems.


Course 4: Reflections on Big Data and Enterprise Analytics (Also WAIM 2011 Keynote Speech)

Lecturer: Dr. Surajit Chaudhuri, Micros
oft Research


Lecturer

Biography
:
Surajit Chaudhuri


Surajit Chaudhuri is a Principal Researcher and the Research Manager overseeing data
management research activities at Microsoft Research, Redmond. His areas of interest include
self
-
tuning technology fo
r databases, query optimization, data cleaning, and multi
-
tenant
database systems. Working with his colleagues in Microsoft Research and the Microsoft SQL
Server team, he helped incorporate the Index Tuning Wizard
--
and subsequently Database Engine
Tuning A
dvisor
--
into Microsoft SQL Server. He initiated a project on data cleaning at Microsoft
Research whose technology now ships in Microsoft SQL Server Integration Services. He also led
the development of CMT, a conference management web service hosted by Micr
osoft Research
since 1999 for the academic community. Surajit is an ACM Fellow, received the ACM SIGMOD
Contributions Award in 2004, the VLDB 10
-
Year Best Paper Award in 2007, and the ACM SIGMOD
Edgar F. Codd Innovations Award in 2011. He is currently a me
mber of the VLDB Endowment
Board and the ACM SIGMOD Advisory Board. He was the Program Committee Chair for ACM
SIGMOD 2006, a Program
-
Committee Co
-
Chair of ACM SIGKDD 1999 and ACM Symposium of
Cloud Computing 2010. He has served on the editorial boards of
ACM TODS and IEEE TKDE.
Surajit received his Ph.D. from Stanford University and B.Tech. from the Indian Institute of
Technology, Kharagpur.


Course 5: Graph Database Engine and Computation Platform

Lecturer: Dr. Haixun Wang, Microsoft Research


Graph is an

abstract data structure that has high expressive power. Many real
-
life applications can
be modeled by graphs, including biological networks, semantic web and social networks. Thus, a
graph engine is important to many applications. Currently, there are sev
eral players in this field,
including Neo4j, HyperGraphDB, InfiniteGraph, etc. Neo4j is a disk
-
based transactional graph
database. HyperGraphDB is based on key/value pair store Berkeley DB. InfiniteGraph is a
distributed system for large graph data analysi
s. In 2009, Google announced Pregel as its large
scale graph processing platform. Pregel is a batch system, and it does not support online query
processing or graph serving. Trinity is a graph database and computation platform over
distributed memory cloud
. As a database, it provides features such as highly concurrent query
processing, transaction, consistency control. As a computation platform, it provides synchronous
and asynchronous batch
-
mode computations on large scale graphs. Trinity can be deployed o
n
one machine or hundreds of machines.


Course 6: Knowledge Bases: Linking database, AI, and information retrieval

Lecturer: Dr. Haixun Wang, Microsoft Research


Integrating, representing, and reasoning over human knowledge is a computational grand
challen
ge for the 21st century. In this course, I will introduce the Probase project at Microsoft
Research Asia. The goal of the Probase project is to enable machines to understand human
communications. Much interest has been devoted to building universal ontolog
ies, either
automatically constructed or built by community effort, but these have limited scope. Freebase,
the best
-
known community
-
built taxonomy, contains approximately 1,500 concepts, a far cry
from covering everything that exists. Probase is a univers
al, probabilistic taxonomy more
comprehensive than any current taxonomy. It contains more than 2 million concepts, harnessed
automatically from a corpus of 1.68 billion web pages and two years’ worth of search
-
log data. It
enables probabilistic interpretat
ions of this information. The probabilistic nature enables it to
incorporate heterogeneous information naturally. I will explain how the core taxonomy, which
contains hypernym
-
hyponym relationships, is constructed and how it models knowledge's
inherent unc
ertainty, ambiguity, and inconsistency.


Lecturer

Biography
:
Haixun Wang


Haixun Wang is a lead researcher at Microsoft Research Asia in Beijing, China. Before joining
Microsoft, he had been a research staff member at IBM T. J. Watson Research Center for 9 years.
He was Technical Assistant to Stuart Feldman (Vice President of Co
mputer Science of IBM
Research) from 2006 to 2007, and Technical Assistant to Mark Wegman (Head of Computer
Science of IBM Research) from 2007 to 2009. Haixun Wang has published more than 120
research papers in referred international journals and conferenc
e proceedings. He is associate
editor of IEEE Transactions of Knowledge and Data Engineering (TKDE), Journal of Computer
Science and Technology (JCST). He is PC co
-
Chair of CIKM 2012, ICMLA 2011, WAIM 2011. Haixun
Wang got the ER 2008 Conference best paper

award (DKE 25 year award), and ICDM 2009 Best
Student Paper run
-
up award.