New Approaches in Data for AI-Driven Forensics Applications

Shen, Bingyu

doi:10.7274/tb09j389k3v

New Approaches in Data for AI-Driven Forensics Applications

thesis

posted on 2021-11-30, 00:00 authored by Bingyu Shen

Development in deep learning algorithms significantly boosts the performance of digital forensics applications, such as media phylogeny, optical character recognition(OCR), face recognition, and scene text spotting. As the neural networks achieved advanced results, the depth, and width of the architecture inevitably increase. Neural networks are data-driven which indicates that the more complex a neural network is, the more data it requires to train to converge. Therefore it is of great importance to generate and provide enough training data with good quality to models.

In this work, I studied various approaches to collect, augment and generate training data to help to solve forensics tasks using deep learning algorithms. I firstly explored traditional and state-of-the-art Natural Language Processing algorithms and applied them to forensics tasks in real-world settings. Then I integrated a language model into a handwritten text recognition task to correct texts recognized by OCR algorithms. The training process is benefited from generating synthetic texts with spelling errors.

Moving from language-related forensics tasks to forensics in the computer vision area, I attempted to propose methods to generate data that can help the training process of various deep learning algorithms. I developed a 3D engine-powered virtual city generator for synthetic data collection. This virtual city generator can create an unlimited amount of synthetic data for various city scene-related computer vision tasks. I then applied it to a forensics study relates to the human perception of synthetic faces. Finally, I proposed a method to solve the arbitrarily shaped scene text detection problem. Scene texts provide important forensics information and how to extract them accurately has been an active research area for years. I proposed a modified loss function that can help the regression of polygon shapes which significantly simplifies the model architecture and expedite the training process. Therefore we can achieve arbitrarily shaped scene text detection with a comparable performance of state-of-the-art algorithms at a real-time speed. The training process is supported by a new synthetic data generation approach that focused on curved and irregular-shaped text instances. In the end, I created a new large-scale synthetic scene texts dataset that contributes to the research community.

History

Date Modified

2021-12-17

Defense Date

2021-09-17

CIP Code

40.0501

Research Director(s)

Walter Scheirer

Committee Members

Adam Czajka Alice J. O’Toole Kevin Bowyer

Degree

Doctor of Philosophy

Degree Level

Doctoral Dissertation

Alternate Identifier

1288686271

Library Record

6155255

OCLC Number

1288686271

Rights Statement

https://creativecommons.org/licenses/by-nc/3.0/

Additional Groups

Computer Science and Engineering

Program Name

Computer Science and Engineering

New Approaches in Data for AI-Driven Forensics Applications

History

Date Modified

Defense Date

CIP Code

Research Director(s)

Committee Members

Degree

Degree Level

Alternate Identifier

Library Record

OCLC Number

Rights Statement

Additional Groups

Program Name

Usage metrics

Categories

Keywords

Licence

Exports