jiawei han - Computer Science

desertcockatooΔιαχείριση Δεδομένων

20 Νοε 2013 (πριν από 3 χρόνια και 11 μήνες)

112 εμφανίσεις

5143 Cass Avenue


431 State Hall


Detroit, Michigan 48202


+1.313.577.2477


Fax +1.313.577.6868


http://w
ww.cs.wayne.edu




D
EPARTMENT OF
C
OMPUTER

S
CIENCE



Mining and Searching Graphs in Biological Databases



Jiawei Han

Department of Computer Science

University of Illinois at Urbana
-
Champaign

http://www.cs.uiuc.edu/~hanj


Tuesd
ay,
October 4
, 2005

3:00PM 110

Purdy
-
Kresge Library


Abstract
:

Recent research on pattern discovery has progressed from mining frequent item sets and sequences
to mining structured patterns including trees, lattices, and graphs. As a general data structu
re, graph
can model complicated relations among data with wide applications in bioinformatics. However,
mining and searching large graphs in graph databases is challenging due to the presence of an
exponential number of frequent subgraphs.


In this talk
, we present our recent progress on developing efficient and scalable methods for mining
and searching of graphs in large biological databases. We first introduce gSpan, an efficient method
for mining all the frequent graph patterns in graph databases, by

extension of a depth
-
first frequent
pattern growth method, developed in our previous research. Then we introduce CloseGraph, an
efficient method for mining closed frequent graph patterns. A graph g is closed in a database if there
exists no proper super
graph of g that has the same support as g. After that, we introduce a graph
indexing method, gIndex and a graph approximate searching method, grafil, both taking advantages
of frequent graph mining to construct a compact but highly effective graph index a
nd perform
similarity search with such indexing structures. These methods facilitate mining and querying graph
patterns in large biological databases. Our performance study shows the high promise of our
approach.


Bio:

Jiawei Han, Professor, Department o
f Computer Science, University of Illinois at Urbana
-
Champaign.
He has been working on research into data mining, data warehousing, database systems, spatial and
multimedia databases, deductive and object
-
oriented databases, Web databases, and biological
databases, with over 300 journal and conference publications. He has chaired or served in many
program committees of international conferences and workshops, including 2001 and 2002 SIAM
-
Data Mining Conference (PC co
-
chair), 2004 and 2002 International Co
nferences on Data
Engineering (PC vice
-
chair), 2005 International Conferenceon Data Mining (PC co
-
chair), ACM
SIGKDD conferences, and ACM SIGMOD conferences.He also served or is serving on the editorial
boards for Data Mining and Knowledge Discovery, IEEE
Transactions on Knowledge and Data
Engineering, and Journal of Intelligent Information Systems and the Editor
-
In
-
Cheif of ACM
Transactions on Knoweldge Discovery from Data. He is an ACM Fellow and has received 2004 ACM
SIGKDD Innovations Award. His textb
ook "Data Mining: Concepts and Techniques" (Morgan
Kaufmann, 2001) has been popularly used for data mining courses in universities
.