University of Notre Dame
Browse

Shopping with Networks: An Approach to Market Basket Analysis

Download (722.78 kB)
thesis
posted on 2009-04-16, 00:00 authored by Troy Raeder
The market basket problem, the search for meaningful associations in customer purchase data, is one of the oldest problems in data mining. The typical solution involves the mining and analysis of association rules, which take the form of statements such as ``people who buy diapers are likely to buy beer.' It is well-known, however, that typical transaction datasets can support hundreds or thousands of obvious association rules for each interesting rule, and filtering through the rules is a non-trivial task. One may use an interestingness measure to quantify the usefulness of various rules, but there is no single agreed-upon measure and different measures can result in very different rankings of association rules.

In this thesis, we take a different approach to mining transaction data. By modeling the data as a product network, we discover more expressive communities (clusters) in the data, which can then be targeted for further analysis. We demonstrate that the network based approach can isolate influence among products without excessive ambiguous associations. We further consider a collaborative marketplace, where it may be beneficial for the market for stores to share their product networks. To that end, we propose a robust privacy preserving protocol that encourages stores to share their product network without compromising their individual information. We demonstrate the effectiveness of the product networks and the privacy preserving protocol on a real-world store data. Finally, we build upon our experience with product networks to propose a comprehensive analysis strategy by combining both traditional and network-based techniques.

History

Date Modified

2017-06-05

Research Director(s)

Nitesh V. Chawla

Committee Members

Patrick J. Flynn Marina Blanton

Degree

  • Master of Science in Computer Science and Engineering

Degree Level

  • Master's Thesis

Language

  • English

Alternate Identifier

etd-04162009-105752

Publisher

University of Notre Dame

Additional Groups

  • Computer Science and Engineering

Program Name

  • Computer Science and Engineering

Usage metrics

    Masters Theses

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC