University of Notre Dame
Browse
AlbrechtM122010T.pdf (651.6 kB)

Design of a Data Repository for a Long-Running Physics Experimen

Download (651.6 kB)
thesis
posted on 2010-12-14, 00:00 authored by Michael Albrecht
Dataset sizes for scientific experiments are expanding at a prodigious rate. Even small-scale laboratories can produce terabytes of raw data each year. This data needs to be stored, but also needs to be analyzed in order to make it anything other than a waste of space. Furthermore, in areas like physics, scientists are frequently looking for interesting events or trends amongst a sea of boring data, making visualization and mass analysis very important.

One experiment that follows this pattern is the Gamma Ray Astrophysics experiment at Notre Dame. In this work I discuss the needs and constraints of data repositories for data-intensive scientific experiments in the context of developing such a system for GRAND. Challenges such as storing large datasets, interface design, fast data analysis, and large-scale data visualization are examined, and solutions are presented in the form of distributed storage and parallel computation.

History

Date Modified

2017-06-02

Research Director(s)

Douglas Thain

Committee Members

Scott Emrich Greg Madey

Degree

  • Master of Science in Computer Science and Engineering

Degree Level

  • Master's Thesis

Language

  • English

Alternate Identifier

etd-12142010-161930

Publisher

University of Notre Dame

Program Name

  • Computer Science and Engineering

Usage metrics

    Masters Theses

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC