University of Notre Dame
Browse
SteinhaeuserK092011D.pdf (7.85 MB)

Viewing the World Through a Network Lens

Download (7.85 MB)
thesis
posted on 2011-04-15, 00:00 authored by Karsten Steinhaeuser
Most conventional data structures and data analysis methods were designed with simple transaction data in mind. However, data miners are increasingly presented with more complex datasets that have embedded within them some relationships or dependencies. Incorporating these relationships into the data mining process can pose both algorithmic as well as computational challenges, but there is also a tremendous opportunity to leverage them as an additional source of information. Indeed, we believe that there is relational structure in every dataset, which can be exploited for analysis and learning if a suitable data representation is used. In this dissertation, we take a look at the world through a 'network lens', that is, we advocate the use of networks for representing and analyzing complex datasets from various domains. First, we propose a methodological advance in the form of a novel algorithm for identifying community structure in networks that is relevant across many domains. Second, we present applications wherein we impose the network view on datasets that do not contain explicit relationships and show how the 'network lens' brings into focus some interesting and potentially useful patterns in the data. Specifically, in climate science we demonstrate the value of networks as a unified framework for descriptive analysis and predictive modeling, which has led to some novel insights in the domain.

History

Date Modified

2017-06-02

Defense Date

2010-12-07

Research Director(s)

Jessica Hellman

Committee Members

Nitesh Chawla Patrick Flynn Auroop Ganguly Edward Bensman Kevin Bowyer

Degree

  • Doctor of Philosophy

Degree Level

  • Doctoral Dissertation

Language

  • English

Alternate Identifier

etd-04152011-122049

Publisher

University of Notre Dame

Program Name

  • Computer Science and Engineering

Usage metrics

    Dissertations

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC