Carnegie Mellon University
Browse
Feng_CMU-CS-21-120.pdf (16.15 MB)

Human-efficient Discovery of Edge-based Training Data for Visual Machine Learning

Download (16.15 MB)
thesis
posted on 2022-05-04, 18:56 authored by Ziqiang FengZiqiang Feng

Deep learning enables effective computer vision without hand crafting feature extractors. It has great potential if applied to specialized domains such as ecology, military, and medical science. However, the laborious task of creating labeled training sets of rare targets is a major deterrent to achieving its goal. A domain expert’s time and attention is precious. We address this problem by designing, implementing, and evaluating Eureka, a system for human-efficient discovery of rare phenomena from unlabeled visual data. Eureka’s central idea is interactive contentbased

search of visual data based on early-discard and machine learning. We first demonstrate its effectiveness for curating training sets of rare objects. By analyzing contributing factors to human efficiency, we identify and evaluate important systemlevel optimizations that utilize edge computing and intelligent storage. Lastly, we extend Eureka to the task of discovering temporal events from video data.

History

Date

2021-08-23

Degree Type

  • Dissertation

Department

  • Computer Science

Degree Name

  • Doctor of Philosophy (PhD)

Advisor(s)

Mahadev Satyanarayanan

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC