Document Type
Dissertation
Degree
Doctor of Philosophy (PhD)
Major/Program
Computer Science
First Advisor's Name
Tao Li
First Advisor's Committee Title
Committee Chair
Second Advisor's Name
Sundaraja Sitharama Iyengar
Second Advisor's Committee Title
Committee Member
Third Advisor's Name
Shu-Ching Chen
Third Advisor's Committee Title
Committee Member
Fourth Advisor's Name
Jinpeng Wei
Fourth Advisor's Committee Title
Committee Member
Fifth Advisor's Name
Wensong Wu
Fifth Advisor's Committee Title
Committee Member
Keywords
Data Mining, Temporal Data Mining
Date of Defense
3-23-2015
Abstract
Many systems and applications are continuously producing events. These events are used to record the status of the system and trace the behaviors of the systems. By examining these events, system administrators can check the potential problems of these systems. If the temporal dynamics of the systems are further investigated, the underlying patterns can be discovered. The uncovered knowledge can be leveraged to predict the future system behaviors or to mitigate the potential risks of the systems. Moreover, the system administrators can utilize the temporal patterns to set up event management rules to make the system more intelligent.
With the popularity of data mining techniques in recent years, these events grad- ually become more and more useful. Despite the recent advances of the data mining techniques, the application to system event mining is still in a rudimentary stage. Most of works are still focusing on episodes mining or frequent pattern discovering. These methods are unable to provide a brief yet comprehensible summary to reveal the valuable information from the high level perspective. Moreover, these methods provide little actionable knowledge to help the system administrators to better man- age the systems. To better make use of the recorded events, more practical techniques are required.
From the perspective of data mining, three correlated directions are considered to be helpful for system management: (1) Provide concise yet comprehensive summaries about the running status of the systems; (2) Make the systems more intelligence and autonomous; (3) Effectively detect the abnormal behaviors of the systems. Due to the richness of the event logs, all these directions can be solved in the data-driven manner. And in this way, the robustness of the systems can be enhanced and the goal of autonomous management can be approached.
This dissertation mainly focuses on the foregoing directions that leverage tem- poral mining techniques to facilitate system management. More specifically, three concrete topics will be discussed, including event, resource demand prediction, and streaming anomaly detection. Besides the theoretic contributions, the experimental evaluation will also be presented to demonstrate the effectiveness and efficacy of the corresponding solutions.
Identifier
FI15032122
Recommended Citation
Jiang, Yexi, "Temporal Mining for Distributed Systems" (2015). FIU Electronic Theses and Dissertations. 1909.
https://digitalcommons.fiu.edu/etd/1909
latex related files
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons
Rights Statement
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).