SteinhaeuserK092011D.pdf (7.85 MB)
Viewing the World Through a Network Lens
thesis
posted on 2011-04-15, 00:00 authored by Karsten SteinhaeuserMost conventional data structures and data analysis methods were designed with simple transaction data in mind. However, data miners are increasingly presented with more complex datasets that have embedded within them some relationships or dependencies. Incorporating these relationships into the data mining process can pose both algorithmic as well as computational challenges, but there is also a tremendous opportunity to leverage them as an additional source of information. Indeed, we believe that there is relational structure in every dataset, which can be exploited for analysis and learning if a suitable data representation is used. In this dissertation, we take a look at the world through a 'network lens', that is, we advocate the use of networks for representing and analyzing complex datasets from various domains. First, we propose a methodological advance in the form of a novel algorithm for identifying community structure in networks that is relevant across many domains. Second, we present applications wherein we impose the network view on datasets that do not contain explicit relationships and show how the 'network lens' brings into focus some interesting and potentially useful patterns in the data. Specifically, in climate science we demonstrate the value of networks as a unified framework for descriptive analysis and predictive modeling, which has led to some novel insights in the domain.
History
Date Modified
2017-06-02Defense Date
2010-12-07Research Director(s)
Jessica HellmanCommittee Members
Nitesh Chawla Patrick Flynn Auroop Ganguly Edward Bensman Kevin BowyerDegree
- Doctor of Philosophy
Degree Level
- Doctoral Dissertation
Language
- English
Alternate Identifier
etd-04152011-122049Publisher
University of Notre DameProgram Name
- Computer Science and Engineering
Usage metrics
Categories
No categories selectedLicence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC