GIST

Gesture Interpretation using Spatio-Temporal analysis

Note: This page describes some work I did in 1997, I will put new results on this work soon!


GIST

  1. Overview
  2. Motion Segmentation
  3. Feature Detection
  4. Motion Trajectory
  5. Spatio-Temporal Analysis
  6. Content-Based Video Retrieval


  1. Overview

    GIST (Gesture Interpretation using Spatio-Temporal analysis) project is an attempt to recognize and interpret sign gestures of American Sign Language from a video sequence based on an integrated method of motion segmentation, shape, size and color. A multi-scale motion segmentation based on Ahuja's New Transform is applied to a video sequence to get motion regions and their correspondence across frames. Regions of interest, such as fingertip, palm and elbow, are extracted from motion segmented images by formulating and solving a constraint satisfaction problem. From these joints, pixel trajectories are extracted. A spatio-temporal analysis based on time-delay neural network is applied to classify these patterns. The ultimate goal of GIST is to allow content-based video retrieval based on video clips and better understanding of motion segmentation.

    Summary:

  2. Motion Segmentation

  3. Feature Detection

  4. Motion Trajectory

  5. Spatio-Temporal Analysis

  6. Content-Based Video Retrieval