Document Type
Dissertation
Major/Program
Computer Science
First Advisor's Name
Raju Rangaswami
First Advisor's Committee Title
Committie Chair
Second Advisor's Name
Evangelis Christidis
Third Advisor's Name
Giri Narasimhan
Fourth Advisor's Name
Jiri Schindler
Fifth Advisor's Name
Kaushik Dutta
Keywords
storage systems, on-disk data layout, disk access patterns, disk performance, storage optimizations, tree/graph layout
Date of Defense
7-28-2009
Abstract
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common applications. File systems attempt to reduce this by storing data sequentially on the disk drives, thereby reducing the access latencies. Although this strategy is useful when data is retrieved sequentially, the access patterns in real world workloads is not necessarily sequential and this mismatch results in storage I/O performance degradation. This thesis demonstrates that one way to improve the storage performance is to reorganize data on disk drives in the same way in which it is mostly accessed. We identify two classes of accesses: static, where access patterns do not change over the lifetime of the data and dynamic, where access patterns frequently change over short durations of time, and propose, implement and evaluate layout strategies for each of these. Our strategies are implemented in a way that they can be seamlessly integrated or removed from the system as desired. We evaluate our layout strategies for static policies using tree-structured XML data where accesses to the storage device are mostly of two kinds - parent-tochild or child-to-sibling. Our results show that for a specific class of deep-focused queries, the existing file system layout policy performs better by 5-54X. For the non-deep-focused queries, our native layout mechanism shows an improvement of 3-127X. To improve performance of the dynamic access patterns, we implement a self-optimizing storage system that performs rearranges popular block accesses on a dedicated partition based on the observed workload characteristics. Our evaluation shows an improvement of over 80% in the disk busy times over a range of workloads. These results show that applying the knowledge of data access patterns for allocation decisions can substantially improve the I/O performance.
Identifier
FI09082405
Recommended Citation
Bhadkamkar, Medha, "Improving Storage Performance Through Layout Optimizations" (2009). FIU Electronic Theses and Dissertations. 98.
https://digitalcommons.fiu.edu/etd/98
Rights Statement
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).