Principles for the Design and Operation of Elastic Scientific Applications on Distributed Systems

Rajan Pandiarajan, Dinesh

doi:10.7274/kh04dn42580

PandiarajanDR042015D.pdf (1.76 MB)

Principles for the Design and Operation of Elastic Scientific Applications on Distributed Systems

thesis

posted on 2015-04-17, 00:00 authored by Dinesh Rajan Pandiarajan

Scientific applications often harness the concurrency in their workloads to par- tition and operate them as independent tasks and achieve reasonable performance. To improve performance at scale, the partitions are operated in parallel on large pools of resources in distributed computing systems, such as clouds, clusters, and grids. However, the exclusive and on-demand deployment of applications on these platforms presents challenges. The target hardware is unknown until runtime and variable between deployments when applications are deployed on these platforms. So operating parameters such as the number of partitions and the instances to provision for execution must be determined at runtime for efficient operation. In this work, I build and demonstrate elastic applications to provide the desired characteristics for operation on distributed computing systems. I present case-studies of elastic applications from different scientific domains and draw broad observations on their design and the challenges to their efficient operation. I develop and evaluate techniques at the middleware and the application layer to achieve efficient operation. In effect, the presented techniques create self-operating elastic applications that dy- namically determine the partitions of their workloads and the scale of resources to utilize. I conclude by showing that self-operating applications achieve high time- and cost-efficiency in their deployed environments in distributed computing systems.

History

Date Modified

2017-06-02

Defense Date

2015-04-06

Research Director(s)

Douglas Thain

Committee Members

Scott Emrich Aaron Striegel Jesus Izaguirre

Degree

Doctor of Philosophy

Degree Level

Doctoral Dissertation

Language

English

Alternate Identifier

etd-04172015-014334

Publisher

University of Notre Dame

Program Name

Computer Science and Engineering

Usage metrics

Keywords

resource allocation cloud computing workload partitioning Elastic applications concurrent applications scientific applications data partitioning distributed computing

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Principles for the Design and Operation of Elastic Scientific Applications on Distributed Systems

History

Date Modified

Defense Date

Research Director(s)

Committee Members

Degree

Degree Level

Language

Alternate Identifier

Publisher

Program Name

Usage metrics

Categories

Keywords

Licence

Exports