Series
GVU Technical Report Series

Series Type
Publication Series
Description
Associated Organization(s)
Associated Organization(s)
Organizational Unit

Publication Search Results

Now showing 1 - 6 of 6
  • Item
    Learning for Ground Robot Navigation with Autonomous Data Collection
    (Georgia Institute of Technology, 2005) Su, Jie ; Rehg, James M. ; Bobick, Aaron F.
    Robot navigation using vision is a classic example of a scene understanding problem. We describe a novel approach to estimating the traversability of an unknown environment based on modern object recognition methods. Traversability is an example of an affordance jointly determined by the environment and the physical characteristics of a robot vehicle, whose definition is clear in context. However, it is extremely difficult to estimate the traversability of a given terrain structure in general, or to find rules which work for a wide variety of terrain types. However, by learning to recognize similar terrain structures, it is possible to leverage a limited amount of interaction between the robot and its environment into global statements about the traversability of the scene. We describe a novel on-line learning algorithm that learns to recognize terrain features from images and aggregate the traversability information acquired by a navigating robot. An important property of our method, which is desirable for any learning-based approach to object recognition, is the ability to autonomously acquire arbitrary amounts of training data as needed without any human intervention. Tests of our algorithm on a real robot in complicated unknown natural environments suggest that it is both robust and efficient.
  • Item
    On the Design of Cascades of Boosted Ensembles for Face Detection
    (Georgia Institute of Technology, 2005) Brubaker, S. Charles ; Wu, Jianxin ; Sun, Jie ; Mullin, Matthew D. ; Rehg, James M.
    Cascades of boosted ensembles have become popular in the object detection community following their highly successful introduction in the face detector of Viola and Jones. Since then, researchers have sought to improve upon the original approach by incorporating new methods along a variety of axes (e.g. alternative boosting methods, feature sets, etc). We explore several axes that have not yet received adequate attention in this context: cascade learning, stronger weak hypotheses, and feature filtering. We present a novel strategy to determine the appropriate balance between false positive and detection rates in the individual stages of the cascade, enabling us to control our experiments to a degree not previously possible. We show that while the choice of boosting method has little impact on the detector's performance and feature filtering is largely ineffective, the use of stronger weak hypotheses based on CART classifiers can significantly improve upon the standard results.
  • Item
    A Variational inference method for Switching Linear Dynamic Systems
    (Georgia Institute of Technology, 2005) Oh, Sang Min ; Ranganathan, Ananth ; Rehg, James M. ; Dellaert, Frank
    This paper aims to present a structured variational inference algorithm for switching linear dynamical systems (SLDSs) which was initially introduced by Pavlovic and Rehg. Starting with the need for the variational approach, we proceed to the derivation of the generic (model-independent) variational update formulas which are obtained under the mean field assumption. This leads us to the derivation of an approximate variational inference algorithm for an SLDS. The details of deriving the SLDS-specific variational update equations are presented.
  • Item
    Fast Asymmetric Learning for Cascade Face Detection
    (Georgia Institute of Technology, 2005) Wu, Jianxin ; Brubaker, S. Charles ; Mullin, Matthew D. ; Rehg, James M.
    A cascade face detector uses a sequence of node classifiers to distinguish faces from non-faces. This paper presents a new approach to design node classifiers in the cascade detector. Previous methods used machine learning algorithms that simultaneously select features and form ensemble classifiers. We argue that if these two parts are decoupled, we have the freedom to design a classifier that explicitly addresses the difficulties caused by the asymmetric learning goal. There are three contributions in this paper. The first is a categorization of asymmetries in the learning goal, and why they make face detection hard. The second is the Forward Feature Selection (FFS) algorithm and a fast caching strategy for AdaBoost. FFS and the fast AdaBoost can reduce the training time by approximately 100 and 50 times, in comparison to a naive implementation of the AdaBoost feature selection method. The last contribution is Linear Asymmetric Classifier (LAC), a classifier that explicitly handles the asymmetric learning goal as a well-defined constrained optimization problem. We demonstrated experimentally that LAC results in improved ensemble classifier performance.
  • Item
    Oil Painting Assistance Using Projected Light: Bridging the Gap Between Digital and Physical Art
    (Georgia Institute of Technology, 2005) Flagg, Matthew ; Rehg, James M.
    This paper presents a novel interactive system for guiding artists to paint using traditional media and tools. The enabling technology is a multi-projector display capable of controlling the appearance of an artist's canvas. Artists are guided by this display-on-canvas to execute painting techniques. The artist paints according to a linear process of painting by numbers, one layer at a time. Each layer is painted using a set of interaction modes. Preview mode shows the entire layer as the current painting goal. Blank mode shows the state of the painting. Color selection mode displays where to paint a certain color, orientation mode shows how to paint it, and texture highlight mode enhances the texture of the paint following its application. These interaction modes enable the novice to focus on painting sub-tasks in order to simplify the painting process while providing technical guidance ranging from high-level composition to detailed brushwork. In addition to assisting artists for painting, we discuss how our system could be extended to sculpture, woodwork, and other areas of the fine arts.
  • Item
    Boosted Bayesian Network Classifiers
    (Georgia Institute of Technology, 2005) Jing, Yushi ; Pavlovic, Vladimir ; Rehg, James M.
    The use of Bayesian networks for classification problems has received significant recent attention. Although computationally efficient, the standard maximum likelihood learning method tends to be suboptimal due to the mismatch between its optimization criteria (data likelihood) and the actual goal of classification (label prediction accuracy). Recent approaches to optimizing classification performance during parameter or structure learning show promise, but lack the favorable computational properties of maximum likelihood learning. In this paper we present Boosted Bayesian Network Classifiers, a framework to combine discriminative data-weighting with generative training of intermediate models. We show that Boosted Bayesian network Classifiers encompass the basic generative models in isolation, but improve their classification performance when the model structure is suboptimal. This framework can be easily extended to temporal Bayesian network models including HMM and DBN. On a large suite of benchmark data-sets, this approach outperforms generative graphical models such as naive Bayes, TAN, unrestricted Bayesian network and DBN in classification accuracy. Boosted Bayesian network classifiers have comparable or better performance in comparison to other discriminatively trained graphical models including ELR-NB, ELR-TAN, BNC-2P, BNC-MDL and CRF. Furthermore, boosted Bayesian networks require significantly less training time than all of the competing methods.