University of Notre Dame
Browse
HoensTR042012D.pdf (1.46 MB)

Living in an Imbalanced World

Download (1.46 MB)
thesis
posted on 2012-04-19, 00:00 authored by Thomas Ryan Hoens

Classification is one of the most fundamental tasks in the machine learning and data mining communities. One of the most common challenges faced when trying to perform classification is the class imbalance problem. The introduction of class imbalance into the classification problem poses serious and interesting challenges which must be met in order to provide knowledge. An orthogonal problem to class imbalance arises due to concept drift in data streams. Due to the complexity of each of the issues---much less both in tandem---the combination of class imbalance and concept drift are very understudied.

In this dissertation we discuss classification in an imbalanced world from a variety of angles. First, we propose methods to overcome class imbalance in its simplest incarnation. In subsequent chapters, we remove restrictions in order to provide novel solutions and insights. By the end of this dissertation, we will present a wide variety of solutions to the class imbalance problem, including the combination of class imbalance and concept drift.

Before beginning, however, we present intuitive (and mathematical) definitions of class imbalance and concept drift, as well as an overview of the state of the art methods in each community.

History

Date Modified

2017-06-02

Defense Date

2012-03-28

Research Director(s)

Nitesh V. Chawla

Committee Members

David A. Cieslak Kevin W. Bowyer W. Philip Kegelmeyer

Degree

  • Doctor of Philosophy

Degree Level

  • Doctoral Dissertation

Language

  • English

Alternate Identifier

etd-04192012-034057

Publisher

University of Notre Dame

Program Name

  • Computer Science and Engineering

Usage metrics

    Dissertations

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC