Regie Felix

throneharshBiotechnology

Oct 2, 2013 (3 years and 11 months ago)

95 views

Regie Felix, B.S. Bioinformatics at CSUSB

Machine Learning Research for the Center of Bio
-
Image Informatics in UC Santa Barbara

Mentor: Nazli Dereli


Private Investigator: Dr. Ambuj Singh

Title:
Time Series Analysis and Machine Learning Techniques on
Various Datasets


Time series is a sequence of data that is

taken in consistent time intervals.

One is able to analyze
trends within the data and use them to predict what will happen in the future. This process is
called time series analysis, which consist
s of three steps: preprocessing, analysis, and
diagnostic
checking. We analyzed a dataset that expressed the amount of air passengers from 1949 to 1960

and developed a
model that
correctly
illustrated the fluctuations of the
data
.
Another type of
analysis
is categorizing the data via time series classification. Machine learning techniques, such
as decision trees and artificial neural networks, are used for this type of classification.

For this
analysis, we used a UCI KDD dataset of EEG sensor values of 20 p
atients (10 alcoholic and 10
non
-
alcoholics) while they were looking at three different stimuli: one picture, two pictures that
match,
and
two pictures that do not match. Our goal was to correctly classify the data so that by
just the EEG results, the mode
l would be able to predict the status of the patient.

Our accuracy
for both decision trees and ANN were low at first; we then tried revised our project by trying
different sensors, increasing the number of data points, and different classifiers.

This
class
ification project on
-
going; we are still trying more machine learning techniques to increase
the accuracy of our model.