Pattern Recognition and Image Processing Laboratory
Department of Computer Science
A714 Wells Hall
Michigan State University
East Lansing MI 48824-1027
George Stockman, Director
(517) 355-5240, FAX: 432-1061
stockman@cps.msu.edu
http://web.cps.msu.edu/prip
Laboratory Brochure
17 April 1995
Introduction
The Pattern Recognition and Image Processing Laboratory of the
Department of Computer Science at Michigan State University is
located in room 350 of the Engineering Building. The PRIP lab
supports the research of faculty, visiting scholars, graduate and
undergraduate students in the areas of pattern recognition, image
processing, computer vision, and vision-guided robotics.
Image processing is the manipulation and transformation of sensed
images without attaching domain knowledge to the image.
Removing random and structured noise, converting from color to
grayscale, enhancing contrast, and reducing resolution are examples
of image processing. Image processing is often done before pattern
recognition to simplify the extraction of features.
Pattern recognition is the process of grouping or categorizing data
based on similar sets of features. Pattern recognition methods apply
to a broad range of problems in addition to machine vision. Humans
reason using general features such as edges, shapes, colors, and
shadows. Pattern recognition can make this higher level of
abstraction usable in computer vision tasks.
Computer vision includes all areas of machine understanding of the
external environment in response to visual input. Computer vision,
like human visual perception, is the primary source for knowledge of
a changing external world.
Vision-guided robotics is a particular application of computer vision
to the problem of guiding mobile robots and robot manipulators.
This application may involve handling both uncertainty in the
precise location of a robot or manipulator and computer-controlled
interaction with the environment being viewed.
Since its founding in 1978, PRIP lab faculty members have received
over five million dollars in research and equipment grants. Research
projects have been funded by the National Science Foundation,
State of Michigan, Department of Defense, Babcock & Wilcox,
DuPont, General Motors, IBM, Innovision, Institute for Defense
Analysis, Massey-Ferguson, NASA, Northrop, Peugeot S.A.,
Siemens, and Texas Instruments.
PRIP faculty have advised a large number of Ph.D. students, and
have published over one hundred journal articles, papers in
conference proceedings, and book chapters. Six books have been
edited or written by PRIP faculty.
PRIP faculty also provide consultation and assistance to projects
located in the Composite Materials and Structures Center, the
Center for Microbial Ecology, and the Departments of Crop and Soil
Science, Agricultural Engineering, Civil Engineering, Mechanical
Engineering, and Biomechanics.
Faculty
University Distinguished Professor Anil K. Jain is Lab Director, an
Associate Editor of the IEEE Transactions on Neural Networks, The
Journal of Pattern Recognition, Pattern Recognition Letters, the
Journal of Mathematical Imaging, and the Journal of Intelligent
Systems.
Professor George Stockman is an Associate Editor of The Journal of
Pattern Recognition.
Assistant Professor John Weng is an Associate Editor of the IEEE
Transactions on Image Processing.
Visiting Scholars and Students
These visiting scholars and graduate students are currently working
with PRIP faculty.
- Jinlong James Chen
- Shaoyun Chen
- Yuntao Cui
- Chitra Dorai
- David Hammond
- Lin Hong
- Sally Howden
- Wey Shiuan Hwang
- Kalle Karu
- Aniati Murni (visiting scholar)
- Sharathcha Pankanti
- Yogesh Pathak
- Nalini Kanta Ratha
- Ron Sass
- Daniel Swets
- Patchrawat Uthaisombut
- Aditya Vailaya
- Gang Wang
- Marilyn Wulfekuhler
- Dr. Bin Yu (visiting scholar)
- Yu Zhong
Equipment
The Department of Computer Science operates its own computer
facilities for students, faculty, and staff. The facilities consist largely
of Sun servers and workstations, with workstations from Apple,
DEC, and SGI for special applications. A full-time Computing
Facilities Coordinator and ten half time graduate assistants keep the
equipment running around the clock.
The PRIP lab operates a wide range of equipment to support the
activities of its researchers. The lab has a full-time system
administrator and a part-time graduate assistant to see that its
equipment is running and to help researchers in its use. The
following is a list of the major equipment available in the PRIP lab.
- Servers
- Sun SPARCserver 690MP file and SunOS compute
server: 4 CPUs, 96 MB memory, 1,200 MB swap, 14 GB
user disk space
- Sun SPARCstation 20 Solaris compute server: 160 MB
memory, 800 MB swap
- Color Workstations
- (5) Sun SPARCstation 2 with 32 MB memory
- (6) Sun SPARCstation 10 with 32 MB memory
- Sun SPARCstation 20 with 24-bit graphics, 32 MB
memory, CD-ROM drive
- Sun SPARCstation 20 with 8-bit graphics, 32 MB
memory, CD-ROM drive
- Sun 4/330 with 40 MB memory, VME bus
- Splash 2 attached processor on one of the SPARCstation
2 machines. Splash 2 uses Xilinx 4010 FPGA based
processing elements.
- Macintosh IIfx with 20 MB memory, Sharp JX300 color
scanner, frame grabber, tape drive
- Macintosh 8100 with 32 MB memory, Radius
VideoVision Studio Pro Pak, DAT drive
- Output Devices
- Apple LaserWriter IIg printer
- Apple LaserWriter Pro630 printer
- Tektronix Phaser III PXi color printer
- (2) Sony GVM-2000 Trinitron color monitor
- Data Acquisition Devices
- Geometric Research 100X laser range scanner (White
Scanner)
- Identix TV-555 Touch View fingerprint scanner
- HP ScanJet IIcx 24-bit color scanner
- Sony SVO-150 VHS VCR
- Sun SunVideo realtime framegrabber
- Sun VideoPix framegrabber
- A variety of Cohu and Panasonic color cameras with a
selection of lenses in a variety of focal lengths
- Robots
- LABMATE Mobile platform with: Sun SPARCstation
1, two SunVideo framegrabbers, serial/parallel expansion
board; Rhino Robots pan/tilt head; infrared and sonar
sensors
- Unimation PUMA 560 Robot arm and controller, which
is interfaced to the Sun 4/330 using RCCL/RCI software
- Storage Devices
- Contemporary Cybernetics CY-ASP/5 dual 8mm tape
drive with hardware compression
- Pioneer DRM-604X CD-ROM changer
- Sun QIC-150 quarter-inch cassette tape drive
- Optical Equipment
- Bencher copystand and lights
- Two optical tables, stages, and accessories
Software Packages
The PRIP lab has a large selection of software on its Sun Unix
systems. Some of the major packages are:
- Allegro Common Lisp 4.2
- CorelDRAW 3.0
- GBB 2.2 from Blackboard Technology Group
- HIPS 2 from SharpImage Software
- AL TE X2e and the TE X tools
- Libraries:
- eispack
- lapack
- linpack
- minpack
- Numerical Recipes in C
- nswc
- Mathematica 2.2 from Wolfram Research
- SNNS 3.1
- S-Plus 3.2 from Mathsoft
- Sun C, C++, and FORTRAN compilers
- Synopsys 3.2
- Xact 5.1
In addition to these packages, the PRIP group has accumulated
over the years a number of useful programs written in the course of
research projects.
Datasets
The PRIP lab has online a number of useful datasets:
- USPS Set 1-08/92 Address Image Set
- University of Washington English Document Image Database
I, Volumes 1 and 2
- Concordia University character database
- Pat Flynn's range images
- Face databases: Danno, MIT, and Weizmann
- NIST Special Databases 4 and 9 (MFCP and FIGS
fingerprints)
Current Research Projects
Abstracts of current research projects with recent papers. Theses
are listed separately at the end.
3-D Surface and Motion From Images
Two types of problems are being addressed. For monocular
sequences, the task is to compute 3-D motion and structure
parameters of the scene using the results from stereo matching and
image matching. Our contributions include: uniqueness of the
solution from line features as well as its stability; closed-form
solutions from planar scenes and their optimality; significant
improvement in the stability of the closed-form solution using point
features; introduction of optimization techniques so that the
sensitivity to noise approaches a theoretical lower bound
(Cramer-Rao bound); new methods for estimating errors in the
closed-form and the optimized solutions; and a dynamic model for
the modeling, estimation and prediction of 3-D motion using long
image sequences. The algorithms developed have been tested with
synthesized data and real-world images.
For stereo image sequences, the contributions include a closed-form
approximate matrix-weighted solution for motion and structure
from consecutive stereo image pairs, which is better than existing
solutions based on feature points with typical stereo setups; and a
recursive-batch technique for dealing with long stereo image
sequences, which takes advantage of two very different schemes:
Kalman filtering and batch processing.
We have also developed algorithms to solve the correspondence
problem (binocular stereo) and the structure from motion problem.
Our first method is based on multiple image attributes and is one of
the first algorithms that can deal with large image disparities and
compute dense 3-D depth maps. Another method that we have
developed is based on the windowed Fourier Phase (WFP). The
WFP is quasi-linear and spatially dense, with spatial period and
slope controlled by the selected frequency. The WFP includes the
zero-crossings and the peaks as special cases, but it contains
additional information essential for stable matching. Theoretically,
the WFP is complete in representing the signals up to a
multiplicative constant. The implementation of the matching
algorithm resembles that of a neural network and is well suited for
commercially available parallel frame-rate hardware. Experiments
have achieved good results with random-dot images and natural
images, used either as stereograms or as consecutive views of a
moving scene.
- N. Cui, J. Weng, and P. Cohen, "Extended structure and
motion analysis from monocular image sequences," Computer
Vision, Graphics, and Image Processing: Image Understanding,
vol. 59, March 1994.
- J. Weng, "Image matching using the windowed Fourier phase,"
International Journal of Computer Vision, vol. 11, no. 3,
pp. 211-236, 1993.
- J. Weng, "Windowed Fourier phase: completeness and signal
reconstruction," IEEE Transactions on Signal Processing,
vol. 41, pp. 657-666, February 1993.
- J. Weng and T. S. Huang, "3-D motion analysis from image
sequences using point correspondences," in Handbook of
Pattern Recognition and Computer Vision (C. H. Chen, L. F.
Pau, and P. S. Wang, Eds.), World Scientific, 1993.
- J. Weng, T. S. Huang, and N. Ahuja, Motion and Structure
from Image Sequences. Springer-Verlag, 1993.
- J. Weng, N. Ahuja, and T. S. Huang, "Optimal motion and
structure estimation," IEEE Transactions on Pattern Analysis
and Machine Intelligence, vol. 15, pp. 864-884, September 1993.
- J. Weng, P. Cohen, and N. Rebibo, "Motion and structure
estimation from stereo image sequences," IEEE Transactions
on Robotics and Automation, vol. 8, pp. 362-382, June 1992.
- J. Weng, T. S. Huang, and N. Ahuja, "Motion and structure
from line correspondences: closed-form solution, uniqueness,
and optimization," IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 14, pp. 318-336, March 1992.
- J. Weng, N. Ahuja, and T. S. Huang, "Motion and structure
from point correspondences: planar surfaces," IEEE
Transactions on Signal Processing, vol. 39, pp. 2691-2717,
December 1991.
- J. Weng and P. Cohen, "Robust motion estimation using stereo
vision," in Proceedings of the IEEE International Workshop on
Robust Computer Vision, (Seattle, WA), pp. 367-388, October
1990.
- J. Weng, T. S. Huang, and N. Ahuja, "Motion and structure
from two perspective views: algorithm, error analysis and error
estimation," IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 11, pp. 451-476, May 1989.
- J. Weng, T. S. Huang, and N. Ahuja, "3-D motion estimation,
understanding and prediction from noisy image sequences,"
IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 9, no. 3, pp. 370-389, 1987.
Applications of Pattern Recognition and Image Processing
Many projects have been undertaken in cooperation with faculty
outside the Department of Computer Science. In addition to
providing solutions to specific problems, such projects help create a
large database of images useful in verifying our pattern recognition
and image processing algorithms. Current areas of research are in
remote sensing for land use planning, measurement of root systems
for analyzing plant growth, analysis of bacterial culture images,
detection of structures in NMR brain scans, processing of technical
drawings, classification of fingerprints, analysis of sequences of
turbulent flow images, and precise measurement of the human body.
- N. K. Ratha, S. Chen, and A. K. Jain, "Adaptive flow
orientation based feature extraction in fingerprint images,"
Pattern Recognition. To appear.
- K. Karu and A. K. Jain, "Fingerprint classification," Tech.
Rep. CPS-95-9, Michigan State University, 1995.
- A. S. Solberg, A. K. Jain, and T. Taxt, "Multiscale
classification of remotely sensed data: fusion of Landsat TM
and SAR images," IEEE Transactions on Geoscience and
Remote Sensing, vol. 32, pp. 768-778, July 1994.
- O. D. Trier and A. K. Jain, "Goal-directed evaluation of
binarization methods," in Proceedings of the NSF/ARPA
Workshop on Performance Versus Methodology in Computer
Vision, (Seattle, WA), pp. 206-217, June 1994.
- M.-P. Dubuisson, A. K. Jain, and M. K. Jain, "Segmentation
and classification of bacterial culture images," Journal of
Microbiological Methods, vol. 19, pp. 279-295, 1994.
- Q. Huang and G. Stockman, "Generalized tube model:
Recognizing 3-D elongated objects," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition, (New
York, NY), June 1993.
- A. K. Jain and T. Newman, "Range-intensity histogram for
segmenting LADAR images," Pattern Recognition Letters,
vol. 13, pp. 41-56, January 1992.
- J. Ton, J. Sticklen, and A. K. Jain, "Knowledge-based
segmentation of Landsat images," IEEE Transactions on
Geoscience and Remote Sensing, vol. 29, pp. 222-232, March
1991.
- J. Ton and A. K. Jain, "Registering Landsat images using point
pattern matching," IEEE Transactions on Geoscience and
Remote Sensing, vol. 27, pp. 642-651, November 1989.
- A. K. Jain, Ed., Real-time Object Measurement and
Classification. Springer-Verlag, 1988.
Camera System Calibration
Two types of calibration problems are being considered, one is
about image position and the other is about image intensity. For the
former, new techniques have been developed to compensate for both
radial and tangential distortions in a camera lens and to determine
the spatial relationships between the camera and the 3-D world. The
experiments have shown that the distortion compensation leads to a
significant improvement in accuracy. A new normalized measure has
been introduced that can be used to objectively evaluate and
compare the accuracies of various calibration techniques, despite the
parameter differences among the camera systems. Intensity
calibration is necessary because of the peripheral attenuation in
images due to optical lens, whose effect typically results in darker
corners in images. We have developed a model for such peripheral
attenuation. An intensity calibration method has also been
developed to compensate the intensity peripheral attenuation.
- J. Weng, "Calibration for peripheral attenuation in intensity
images," in Proceedings of the First International Conference on
Image Processing, (Austin, Texas), pp. 992-996, November 1994.
- J. Weng, P. Cohen, and M. Herniou, "Camera calibration with
distortion models and accuracy evaluation," IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 14,
pp. 965-980, October 1992.
- J. Weng, P. Cohen, and M. Herniou, "Stereo camera calibration
with nonlinear corrections," in Proceedings of the 10th
International Conference on Pattern Recognition, (Atlantic City,
NJ), pp. 246-253, June 1990.
Comprehensive Learning Theory and SHOSLIF
Comprehensive visual learning is the treatment of theories and
techniques for computer vision systems to automatically learn to
understand comprehensive visual information with minimal
human-imposed rules about the visual world. The concept of
comprehensive learning here implies two coverages: comprehensive
coverage of visual world and comprehensive coverage of vision
algorithm. This project investigates reasons for the shortcomings of
currently prevailing approaches to computer vision and introduces
the promising direction of comprehensive learning towards
overcoming these difficulties. The SHOSLIF (Self Organizing
Hierarchical Optimal Subspace Learning and Inference Framework)
is a framework that aims to provide a unified theory and
methodology for comprehensive visual learning. Its objective is not
just to attack a particular vision problem, but a wide variety of
vision problems. It addresses critical problems such as how to
automatically select the most useful features; how to automatically
organize visual information using a coarse-to-fine space partition
tree that results in a very low, logarithmic time complexity for
retrieving data from a large visual knowledge base, and how to
achieve invariance based on learning. This framework has been used
to build the following systems: SHOSLIF-O, SHOSLIF-M,
SHOSLIF-N, and SHOSLIF-R. The predecessor of the SHOSLIF
is the Cresceptron system.
- J. Weng, N. Ahuja, and T. S. Huang, "Learning recognition and
segmentation using the cresceptron," International Journal of
Computer Vision. To appear.
- J. Weng, "Toward comprehensive visual learning," in Early
Visual Learning (S. K. Nayar and T. Poggio, Eds.), New York:
Oxford University Press, 1995. To appear.
- J. Weng, "On comprehensive visual learning," in NSF/ARPA
Workshop on Performance versus Methodology in Computer
Vision, (Seattle, WA), pp. 152-166, June 1994. Invited paper.
- J. Weng, "SHOSLIF: A framework for object recognition from
images," in Proceedings of the IEEE International Conference on
Neural Networks, (Orlando, FL), pp. 4204-4209, June 1994.
Invited paper.
- J. Weng, "SHOSLIF: A learning system for vision and control,"
in Proceedings of the IEEE Annual Workshop on Architectures
for Intelligent Control Systems, (Columbus, Ohio), August 1994.
Invited paper.
- J. Weng, N. Ahuja, and T. S. Huang, "Learning recognition and
segmentation of 3-D objects from 2-D images," in Proceedings of
the 4th International Conference on Computer Vision, (Berlin,
Germany), pp. 121-128, May 1993.
Document Image Analysis
We have shown that the general framework of using Gabor filters to
characterize image texture is applicable in several different document
image analysis problems. In particular, we have considered the
following three problems: text-graphics separation, address block
location, and bar code localization. In each one of these problems,
the text content or the bar code in the image is considered to define
a unique texture that can be easily characterized by a small number
of Gabor filters. Both supervised and unsupervised methods have
been used to identify regions of text or bar code in the input
document images. The same filter parameters have been used in all
the three problem domains. Experimental results demonstrate the
generality and effectiveness of our approach for segmentation and
classification of document images. Recent work emphasizes skew
detection, separating handwritten and machine printed characters,
and page layout segmentation.
- O. Trier, T. Taxt, and A. K. Jain, "Data capture from maps
based on gray level topographic analysis," in Proceedings of the
International Conference on Document Analysis and
Recognition, (Montreal), 1995. To appear.
- Y. Zhong, K. Karu, and A. K. Jain, "Locating text in complex
color images," Pattern Recognition, 1995. To appear.
- A. K. Jain and Y. Chen, "Address block location using color and
texture analysis," CVGIP: Image Understanding, September
1994.
- A. K. Jain and Y. Chen, "Bar code localization using texture
analysis," in Proceedings of the 2nd Conference on Document
Analysis and Recognition, (Tsukuba City, Japan), pp. 41-44,
October 1993.
- A. K. Jain and S. Bhattacharjee, "Address block location on
envelopes using Gabor filters," Pattern Recognition, vol. 25,
pp. 1459-1477, 1992.
- A. K. Jain and S. Bhattacharjee, "Text segmentation for
automatic document processing," Machine Vision and
Application, vol. 5, pp. 169-184, 1992.
- A. K. Jain, S. Bhattacharjee, and Y. Chen, "On texture in
document images," in Proceedings of the IEEE conference on
Computer Vision and Pattern Recognition, (Urbana, IL),
pp. 677-680, June 1992.
- T. Taxt, P. J. Flynn, and A. K. Jain, "Segmentation of
document images," IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 11, pp. 1322-1329, December 1989.
Face and Object Recognition from Intensity Images
The objective of this project is to recognize and segment objects
from images, using the SHOSLIF approach. Our system uses the
theories of optimal projection for optimal feature selection and a
hierarchical structure for low computational complexity. The system
can proceed under a supervised, unsupervised, or hybrid learning
mode. In the supervised mode, a hierarchy of class labels is provided
with each training image. No class labels are given under the
unsupervised learning mode, and some training images are labeled in
the hybrid mode. In a preliminary experiment, we have trained the
system on a diverse set of objects from natural scenes, ranging from
human faces to street signs to aerial photographs. Eight hundred
images were used for training, 712 human face images (356 classes)
and 82 other objects (41 classes). The disjoint test set consists of 78
faces and 38 other objects, among which 91% of the human faces
were correctly recognized by the top choice (in terms of similarity)
and for the other objects, the corresponding recognition rate is 87%.
Efforts are also underway toward communicating with computers
using facial expressions, authorization using recognition of learned
faces, and general modeling for presentation and animation. The
ability to sense faces in both 2-D and 3-D and the capability of
representing faces for computer manipulation are central to these
applications.
- P. Ballard and G. Stockman, "Control of a computer via facial
aspect," IEEE Transactions on Systems, Man and Cybernetics,
vol. 25, pp. 669-677, April 1995.
- D. Swets and J. Weng, "SHOSLIF-O: SHOSLIF for object
recognition (phase I)," Tech. Rep. CPS-94-64, Michigan State
University, December 1994.
Image Retrieval using Color, Shape and Texture
Content-based image retrieval has evolved as an interesting and
challenging area of research. Large image databases are used in a
number of applications, including criminal identification, multimedia
encyclopedia, geographic information system, online applications for
art and art history, medical image archives, and trademarks. With
the increase in the amount of image data, a fast and automatic
procedure is required for indexing and retrieval.
Most of the recent work has concentrated on developing a single
concise feature like color, shape or texture for retrieval. Single
feature-based indexing and retrieval might lack sufficient
discriminatory information and might not be able to accommodate
large orientation and scale changes. Different features have different
invariance properties and therefore, they should be integrated for
better retrieval results.
Our goal is to develop an efficient content-based image retrieval
scheme for an image database. We have built an image database of
trademark images. Our database currently consists of over 400
logotypes. We are able correctly to retrieve images from this
database on the basis of color, shape and texture.
- A. K. Jain and A. Vailaya, "Image retrieval using color and
shape," Tech. Rep. CPS-95-17, Michigan State University, 1995.
- A. K. Jain, Y. Zhong, and S. Lakshmanan, "Object matching
using deformable template," Tech. Rep. TR-CPS-94-66,
Michigan State University, 1995.
Industrial Inspection
Inspection is the process of determining if a product (part, object,
or item) deviates from a given set of specifications. Inspection
usually involves measurement of specific features such as assembly
integrity, surface finish, geometric dimensions, and so on. Automatic
inspection is desirable because human inspectors are not consistent,
and it has been reported that human visual inspection is at best
80% effective. This level of effectiveness can be achieved only if a
rigidly structured set of inspection checks is implemented. Many
inspection tasks are time-consuming or boring for humans to
perform. For example, human visual inspection has been estimated
to account for 10% or more of the total labor cost of manufactured
products. Some manufactured part defects are too subtle for
detection by a human eye. Machine vision results in lower labor
costs and improved quality. Finally, automatic inspection allows
objects to be inspected in environments unsafe for humans.
We have been working on a variety of inspection tasks, including
nondestructive testing of composite materials, automatic inspection
of surface finish, and locating defects in metal castings using range
images. We work closely with Innovision Corporation, which designs
and builds real-time inspection systems.
P. Uthaisombut, D. Guyer, and G. Stockman, "Using machine
vision to inspect cherries for cracks and bruises," Tech. Rep.
PRIP, Michigan State University, March 1995.
T. Newman and A. K. Jain, "A survey of automated visual
inspection," CVGIP: Image Understanding, vol. 61, pp. 231-262,
March 1995.
T. Newman and A. K. Jain, "Biderectional template matching
for 3-D CAD-based inspection," in Proceedings of the SPIE
conference on Machine Vision Applications in Industrial Vision,
(San Jose, CA), February 1994.
T. Newman and A. Jain, "CAD-based inspection of 3-D objects
using range images," in Proceedings of the 2nd CAD-Based
Vision Workshop, (Champion, PA), pp. 236-243, February 1994.
A. K. Jain and M.-P. Dubuisson, "Segmentation of X-ray and
C-scan images of fiber reinforced composite materials," Pattern
Recognition, vol. 25, pp. 257-270, March 1992.
A. K. Jain, M.-P. Dubuisson, and M. S. Madhukar, "Multisensor
fusion for nondestructive inspection of fiber reinforced composite
materials," in Proceedings of the 6th Conference of the American
Society of Composites, (Albany, NY), pp. 941-950, October 1991.
A. K. Jain, F. Farrokhnia, and D. Alman, "Texture analysis of
automotive finishes," in Proceedings of Vision90, (Detroit, MI),
pp. 8.1-8.16, November 1990.
Integration of Vision Modules
The issue of integration of vision modules in a total system context
is being addressed. Individual cues from visual modules are fallible
and often ambiguous. As a result, only integrated vision systems
can be expected to give reliable performance in practice. The design
of such systems is challenging because each vision module works
under different and possibly conflicting sets of assumptions. We
have proposed and implemented a multiresolution system that
integrates perceptual grouping, segmentation, stereo, shape from
shading, and line labeling modules. We demonstrate the efficacy of
our approach using images of several different realistic scenes. The
output of the integrated system is shown to be relatively insensitive
to the constraints imposed by the individual modules. The
numerical accuracy of the recovered depth is assessed in the case of
synthetically generated data. Finally, we have quantitatively
evaluated our approach by reconstructing geons from the depth data
obtained from the integrated system. Presently, we are exploring
the following enhancements to the existing implementation:
inclusion of more feedback paths, inclusion of more vision modules,
and a more objective evaluation of our results.
- D. Trytten and M. Tuceryan, "The construction of labeled line
drawings from intensity images," Pattern Recognition, vol. 28,
no. 2, pp. 171-198, 1995.
- S. Pankanti and A. K. Jain, "Integrating vision modules: Stereo,
shading, grouping, and line labeling," IEEE Transactions on
Pattern Analysis and Machine Intelligence, 1995. To appear.
- S. Pankanti, A. K. Jain, and M. Tuceryan, "On integration of
vision modules," in Proceedings of the IEEE conference on
Computer Vision and Pattern Recognition, (Seattle, WA), June
1994.
- D. Trytten and M. Tuceryan, "Segmentation and grouping of
object boundaries using energy minimization," in Proceedings of
the IEEE conference on Computer Vision and Pattern
Recognition, (Maui, HI), pp. 730-731, June 1991.
Autonomous Navigation of Mobile Robots
Our research in vision-guided mobile robots includes object
recognition, path planning, pose estimation, and sensor fusion. The
lab has an autonomous robot, called ROME, built upon a
LABMATE mobile robot base from Transitions Research
Corporation. This platform serves as a testbed for experiments
utilizing infrared object detection, ultrasound range finding, and
stereo vision. An onboard Sun SPARCstation with SunVideo
hardware and extra serial and parallel ports for controlling the
robot and receiving sensor data provides the computing power.
Our current work uses the SHOSLIF framework for autonomous
navigation. The task is to control a mobile robot to navigate
autonomously in an unstructured (i.e., unknown) environment based
on only visual images. No active sensors, such as sonar or infrared
proximity sensors, are necessary. The navigation control signals are
used to correct heading direction, speed and step distance. In the
learning phase, ROME was manually controlled to take pictures at
typical positions of a hallway section for training. The intended
control signal associated with each scene was also recorded as a
desired output. ROME went through three training drives inside a
campus building, during which a total of 363 training images was
taken, 280 of which were taken from two straight hallways and 83
from a turn. In the test phase, we let ROME navigate
autonomously along three straight hallways and two turns, including
two trained hall ways and one trained turn. In more than 30 runs,
ROME successfully navigated along straight sections and turns that
it had not learned.
- S. Chen and J. Weng, "SHOSLIF-N: SHOSLIF for autonomous
navigation (phase I)," Tech. Rep. CPS-94-62, Michigan State
University, December 1994.
- H. Dulimarta and A. K. Jain, "Mobile robot localization in
indoor environment," in Proceedings of the 3rd International
Conference on Automation, Robotics, and Computer Vision,
(Singapore), November 1994.
- J. Courtney and A. K. Jain, "Mobile robot localization via
classification of multisensor maps," in Proceedings of the IEEE
International Conference on Robotics and Automation, (San
Diego), May 1994.
- H. Dulimarta and A. K. Jain, "Modular agents for robot
navigation," in Proceedings of the SPIE Conference on Sensor
Fusion VI, vol. 2059, (Boston, MA), September 1993.
Markov Random Field Models
We have examined the role of Markov Random Field (MRF) models
in image segmentation and image synthesis. In addition to image
modeling, MRF models are useful for incorporating contextual
information in decision making. We are particularly interested in
using MRF models for texture classification and segmentation, and
sensor fusion. Two of our main research thrusts in this area are
parameter estimation and parallel implementation.
- S. Lakshmanan, A. K. Jain, and Y. Zhong, "Multi-resolution
image representation using Markov random fields," in
Proceedings of the First International Conference on Image
Processing, vol. III, (Austin, TX), pp. 855-859, November 1994.
- R. Chellappa and A. K. Jain, Eds., Markov Random Fields:
Theory and Applications. New York: Academic Press, 1993.
- B. Gunsel and A. K. Jain, "Visual surface reconstruction using
stochastic models," in Proceedings of the 11th International
Conference on Pattern Recognition, (The Hague), pp. 343-346,
August 1992.
- J. Mao and A. K. Jain, "Texture classification and
segmentation using multiresolution simultaneous autoregressive
models," Pattern Recognition, vol. 25, no. 2, pp. 173-188, 1992.
- S. Nadabar and A. K. Jain, "Parameter estimation in MRF line
process models," in Proceedings of the IEEE conference on
Computer Vision and Pattern Recognition, (Urbana, IL),
pp. 528-533, June 1992.
- S. Nadabar and A. K. Jain, "Edge detection and labeling by
fusion of intensity and range images," in Proceedings of SPIE:
Applications of Artificial Intelligence X, vol. 1708, (Orlando,
FL), pp. 108-119, April 1992.
- A. K. Jain and S. Nadabar, "MRF model-based segmentation
of range images," in Proceedings of the 3rd International
Conference on Computer Vision, (Osaka, Japan), pp. 667-671,
December 1990.
- R. C. Dubes, A. K. Jain, S. G. Nadabar, and C. C. Chen,
"MRF model-based algorithms for image segmentation," in
Proceedings of the 10th International Conference on Pattern
Recognition, (Atlantic City, NJ), pp. 808-814, 1990.
- R. C. Dubes and A. K. Jain, "Random field models in image
analysis," Journal of Applied Statistics, vol. 16, no. 2,
pp. 131-164, 1989.
- R. C. Dubes and C. C. Chen, "Experiments in fitting discrete
Markov random fields to textures," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition, (San
Diego, CA), June 1989.
Matching Rigid Moving Objects
In this project we are interested in developing methods for tracking
rigid moving objects having arbitrary curved surfaces. Intended
applications for this project include intelligent traffic systems,
security systems, and general robot vision. Motion of the moving
objects in a sequence of color images is used to perform image
segmentation and boundary extraction. Motion-based and
color-based segmentations are integrated to obtain a reliable contour
of the object. A 3-D object is modeled by a set of different 2-D
silhouettes. The silhouette of the object observed from any given
viewpoint is derived by the curvature method of Basri and Ullman.
The derived silhouette is then fitted to the observed silhouette to
determine the object pose: fitting is carried out by Newton's
method for nonlinear least-squares minimization of fitting
parameters. Two different approaches to matching are used. One
approach derives salient local features and uses them as matching
primitives. In the second approach, correspondence is guided by
template matching, where the similarity measure is based on the
minimization of the overall Euclidean distance between the derived
silhouette and the observed silhouette. Some of these algorithms
have been successfully tested with images of moving vehicles on
highway ramps and city streets.
- M.-P. Dubuisson and A. K. Jain, "Contour extraction of moving
objects in complex outdoor scenes," International Journal of
Computer Vision, vol. 14, pp. 83-105, 1995.
- J. L. Chen and G. Stockman, "Determining pose of 3-D objects
with curved surfaces," IEEE Transactions on Pattern Analysis
and Machine Intelligence, 1995. To appear.
- M.-P. Dubuisson, A. K. Jain, and W. C. Taylor, "A vision-based
vehicle matching system," in IEEE Symposium on Intelligent
Vehicles, (Paris, France), pp. 266-271, 1994.
- M.-P. Dubuisson and A. K. Jain, "Fusing color and edge
information for object matching," in Proceedings of the First
International Conference on Image Processing, vol. III, (Austin,
TX), pp. 982-986, 1994.
- M.-P. Dubuisson and A. K. Jain, "A modified Hausdorff distance
for object matching," in Proceedings of the 12th International
Conference on Pattern Recognition, (Jerusalem, Israel),
pp. 566-568, October 1994.
- M.-P. Dubuisson and A. K. Jain, "2-D matching of 3-D moving
objects in color outdoor scenes," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition,
(Seattle, WA), pp. 887-891, June 1994.
- J. L. Chen, G. Stockman, and K. Rao, "Recovering and tracking
pose of curved 3-D objects from 2-D images," in Proceedings of
the IEEE conference on Computer Vision and Pattern
Recognition, (New York, NY), June 1993.
- M.-P. Dubuisson and A. K. Jain, "Object contour extraction
using color and motion," in Proceedings of the IEEE conference
on Computer Vision and Pattern Recognition, (New York, NY),
pp. 471-476, June 1993.
Modeling and Recognition of 3-D Objects
We are investigating extraction and evaluation of features for
recognition of three-dimensional objects and construction of object
models. The major thrusts of the research are: sensing and feature
extraction, modeling of 3-D objects, recognition and pose estimation
using matching of object and model features, and integrating
mechanical CAD techniques in object modeling. This research will
have impact in manufacturing environments for automation of
bin-picking, inspection, assembly, and sorting.
We are currently investigating a new approach for the
representation and recognition of 3-D objects with free-form surfaces
from dense range data. Our surface representation scheme, cosmos,
describes an object concisely in terms of maximal surface patches of
constant shape index. These maximal patches are mapped onto the
unit sphere via their orientations, and aggregated via shape spectral
functions. Surface properties such as area, curvedness, and
connectivity that are required to capture local and global
information are also built into the representation. The scheme yields
not only a meaningful and rich description useful for recovering the
object, but provides a set of powerful indexing primitives for
matching. The intended application of this research is automatic
recognition of manufactured and natural objects with free-form
surfaces.
- C. Dorai and A. K. Jain, "COSMOS a representation scheme
for free-form surfaces," in Proceedings of the 5th International
Conference on Computer Vision, (Boston), June 1995.
- J. Mao, A. Jain, and P. Flynn, "Integration of multiple feature
groups and multiple views into interpretation table-based 3-D
object recognition system," in CVGIP: Image Understanding,
May 1995. To appear.
- J. L. Chen and G. Stockman, "Indexing to model aspects using
invariant contour features," Tech. Rep. PRIP, Michigan State
University, November 1994.
- C. Dorai, J. Weng, and A. K. Jain, "Optimal registration of
multiple range views," in Proceedings of the 12th International
Conference on Pattern Recognition, vol. I, (Jerusalem, Israel),
pp. 569-571, October 1994.
- N. S. Ra ja and A. K. Jain, "Obtaining generic parts from range
data using a multi-view representation," in CVGIP: Image
Understanding, vol. 60, pp. 44-64, July 1994.
- J. L. Chen and G. Stockman, "Matching curved 3-D object
models to 2-D images," in Proceedings of the 2nd CAD-Based
Vision Workshop, (Champion, PA), pp. 210-218, February
1994.
- T. Newman, P. Flynn, and A. K. Jain, "Model-based
classification of quadric surfaces," CVGIP: Image
understanding, vol. 58, pp. 235-249, September 1993.
- S. Pankanti, C. Dorai, and A. K. Jain, "Robust feature
detection for 3-D object recognition," in Proceedings of 2031
SPIE Conference on Geometric Methods In Computer Vision
II, (San Diego, CA), pp. 366-377, July 1993.
- S. W. Chen and A. K. Jain, "Strategies of multi-view and
multi-matching for 3-D object recognition," CVGIP: Image
Understanding, vol. 57, no. 1, pp. 121-130, 1993.
- A. K. Jain and P. J. Flynn, Eds., 3-D Object Recognition
Systems. New York: Elsevier, 1993.
- G. Lee and G. Stockman, "Detecting wings in quadric surface
scenes," in Proceedings of SPIE: Applications of Artificial
Intelligence X, vol. 1708, (Orlando, FL), pp. 335-344, April
1992.
- N. S. Ra ja and A. K. Jain, "Recognizing geons from
superquadrics fitted to range data," Image and Vision
Computing, vol. 10, no. 3, pp. 179-190, 1992.
- P. Flynn and A. K. Jain, "3-D object recognition using
invariant feature indexing of interpretation tables," CVGIP:
Image Understanding, vol. 55, pp. 119-129, March 1991.
- P. Flynn and A. K. Jain, "CAD-based computer vision: From
CAD models to relational graphs," IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol. 13,
pp. 114-132, February 1991.
- G. Stockman, G. Lee, and S. W. Chen, "Reconstructing line
drawings from wings: the polygonal case," in Proceedings of the
3rd International Conference on Computer Vision, (Osaka,
Japan), pp. 526-529, December 1990.
- G. Hu and G. Stockman, "3-D surface solution using structured
light and constraint propagation," IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol. 11,
pp. 390-402, April 1989.
- A. K. Jain and R. Hoffman, "Evidence-based recognition of 3-D
objects," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 10, pp. 783-802, November 1988.
- G. Stockman, "Object recognition and localization via pose
clustering," Computer Vision, Graphics, and Image Processing,
vol. 40, pp. 361-387, 1987.
Motion Event Recognition
The objective of this project is to understand temporal events such
as hand signs, facial expressions, lip motion during speech, human
body motions, as well as other events that involve more than one
object, such as, "Tom points to Jim." Clearly, understanding of
temporal events requires the capability of recognizing static objects
in conjunction with their changes. A preliminary investigation of
hand sign recognition has been conducted as the phase I of the
project. In this experiment 504 training samples were used which
contain 28 hand signs such as "angry," "any,' "boy," "yes," "cute,"
"fine," "funny," "girl," "happy," "hi," etc, from which the system
reached a correct recognition rate of 98% for a set of 504
independent test sequences. The approach employed is SHOSLIF
and thus the system is called SHOSLIF-M (SHOSLIF for motion
understanding).
- Y. Cui, , D. Swets, and J. Weng, "Learning-based hand sign
recognition using SHOSLIF-M," in Proceedings of the 5th
International Conference on Computer Vision, (Boston, MA),
June 1995.
- Y. Cui and J. Weng, "Learning-based hand sign recognition," in
Proceedings of the International Workshop on Automatic Face-
and Gesture-Recognition, (Zurich, Switzerland), June 1995.
- Y. Cui and J. Weng, "SHOSLIF-M: SHOSLIF for motion
understanding (phase I for hand sign recognition)," Tech. Rep.
CPS-94-68, Michigan State University, December 1994.
Parallel Algorithms for Computer Vision
Computer vision involves many processing algorithms that demand
an enormous amount of memory and computational resources. This
research project studies the exploitation of parallelism in computer
vision applications and the development of a parallel programming
environment for these applications.
We are studying parallel algorithms for computer vision using
coarse-grained approach on a workstation cluster using PVM, a
high level communication library. We have implemented distributed
algorithms for pattern clustering, motion, structure estimation from
image sequences, and edge detection and surface reconstruction
based on weak membrane models.
High performance custom computing platforms can be built easily
using field-programmable gate arrays (FPGAs) at an affordable
cost to achieve high performance index and fast prototyping. Splash
2 is a Xilinx 4010 FPGA-based array processor designed and
developed by Supercomputing Research Center. We are porting
many vision applications on Splash 2. A fingerprint matching
algorithm for rolled fingerprints has been successfully ported. For
many low-level vision applications such as smoothing, edge
detection, morphological operations, we have implemented a
generalized filter on Splash 2 with near-ASIC (application-specific
integrated circuit) level performance. Currently, we are focusing our
efforts to port two compute-intensive applications, namely, feature
extraction from fingerprint images and page layout segmentation on
Splash 2.
- N. K. Ratha, A. K. Jain, and D. Rover, "Fingerprint matching
on Splash 2," in Splash 2: FPGAs in a Custom Computing
Machine (D. Buell, J. Arnold, and W. Kleinfelder, Eds.), IEEE
Computer Society Press, 1995.
- N. K. Ratha, A. K. Jain, and D. Rover, "Convolution on Splash
2," in FCCM-95, 1995.
- S. Nadabar and A. K. Jain, "Bayesian approach to sensor fusion:
implementation on a Connection Machine (CM-2)," Pattern
Recognition, vol. 28, pp. 11-26, January 1995.
- D. Judd, N. K. Ratha, P. McKinley, J. J. Weng, and A. K. Jain,
"Parallel implementation of vision algorithms on workstation
clusters," in Proceedings of the 12th International Conference on
Pattern Recognition, (Jerusalem), October 1994.
- T. Newman, R. Enbody, and A. K. Jain, "3-D object
recognition: interpretation tree search on an MIMD machine," in
Proceedings of the 11th International Conference on Pattern
Recognition, (The Hague), pp. 337-340, August 1992.
- N. Ra ja, M. Tuceryan, and A. K. Jain, "Texture segmentation
on two high-performance computers," in Proceedings of the 10th
International Conference on Pattern Recognition, (Atlantic
City), pp. 601-605, June 1990.
Region Detection in Medical Images
The objective of this project is to develop a technique that is
reliable, adaptive, and versatile to solve the problem of region
detection in a relatively wide class of medical images. Learning is
essential in achieving this objective. Learning takes place in two
stages: learning for automatic selection of threshold values and
learning for automatic selection of the region of interest from
candidate regions in the attention map. The result from the second
stage is evaluated based on a learned cost measure and the outcome
is fed back to the first stage when necessary. This feedback enhances
the reliability of the entire system. Experiments have been
conducted to approximately locate the endocardium boundaries of
the left and right ventricles from gradient-echo MR images. Cardiac
CT images have also been used for testing.
- J. Weng, A. Singh, and M. Y. Chiu, "Learning-based ventricle
detection from cardiac MR and CT images," in Proceedings of
the IEEE Workshop on Biomedical Image Analysis, (Seattle,
WA), June 1994.
- J. Weng, A. Singh, and M. Y. Chiu, "Fully automatic ventricle
detection from cardiac MR images using machine learning," in
Proceedings of the SPIE conference on Medical Imaging,
(Newport Beach, CA), February 1994.
Statistical Pattern Recognition and Artificial Neural
Networks
We have investigated a number of problems dealing with classifier
design and exploratory pattern analysis. For example, we have
applied bootstrapping (a resampling technique) to the problems of
determining the number of clusters in a data set, calculating the
width of Parzen windows in density estimation, and classifier error
rate estimation. Other problems of interest include curse of
dimensionality, feature selection, decision tree design, tests for
randomness and multivariate normality, and cluster validity.
We are currently developing a systematic ANN design and training
methodology and its applications in pattern recognition and image
processing. The salient feature of our research is to draw upon
statistical pattern recognition techniques to solve some key aspects
of ANN design and learning procedures. We have studied the
relationship between the number of training samples and the
number of hidden nodes, designed an ANN to implement the
k-nearest neighbor decision rule, and have mapped several
multivariate data projection algorithms on appropriate networks.
- M. N. Murty and A. K. Jain, "Knowledge-based clustering
scheme for collection management and retrieval of library
books," Pattern Recognition, 1995. To appear.
- J. Mao and A. K. Jain, "Artificial neural networks for feature
extraction and multivariate data projection," IEEE
Transactions on Neural Networks, vol. 6, pp. 296-317, March
1995.
- J. Mao, K. Mohiuddin, and A. K. Jain, "Minimal network
design and feature selection through node pruning," in
Proceedings of the 12th International Conference on Pattern
Recognition, (Jerusalem), pp. 622-624, October 1994.
- A. K. Jain and K. Karu, "Automatic filter design for texture
discrimination," in Proceedings of the 12th International
Conference on Pattern Recognition, (Jerusalem), pp. 454-458,
October 1994.
- A. K. Jain and J. Mao, "Neural networks and pattern
recognition," in Computational Intelligence for Imitating Life
(Zurada, Marks, and Robinson, Eds.), pp. 196-212, IEEE
Press, 1994.
- I. Sethi and A. K. Jain, Eds., Neural Networks and Statistical
Pattern Recognition. North Holland, 1993.
- J. Mao and A. K. Jain, "Regularization techniques in artificial
neural networks," in Proceedings of the World Congress on
Neural Networks, (Portland, OR), pp. 75-79, July 1993.
- A. K. Jain and J. Mao, "A k-nearest neighbor artificial neural
network classifier," in Proceedings of the International Joint
Conference on Neural Networks, (Seattle, WA), June 1991.
- S. Raudys and A. K. Jain, "Small sample size effects in
statistical pattern recognition," IEEE Transactions on Pattern
Analysis and Machine Intelligence, vol. 13, pp. 252-264, 1991.
- A. K. Jain and R. C. Dubes, Algorithms for Clustering Data.
Prentice-Hall, 1988.
- S. P. Smith and A. K. Jain, "A test to determine the
multivariate normality of a data set," IEEE Transactions on
Pattern Analysis and Machine Intelligence, vol. 10,
pp. 757-761, 1988.
- A. K. Jain and M. D. Ramaswami, "Classifier design with
Parzen windows," in Pattern Recognition in Practice III (E. S.
Gelsema and L. N. Kanal, Eds.), pp. 211-228, North Holland,
1988.
- A. K. Jain, "Pattern recognition," in Encyclopedia of Robotics
(R. C. Dorf, Ed.), pp. 1052-1063, Wiley, 1988.
Texture Analysis
Texture analysis is an important and useful area of study in
machine vision. Even though the diversity of natural and artificial
textures makes it impossible to give a universal definition of texture,
most natural surfaces exhibit texture and a successful vision system
must be able to deal with the textured world surrounding it. We
have been involved in different areas of texture analysis: texture
segmentation, texture classification, and texture synthesis using
techniques such as multi-channel filtering, fractal analysis, and
Markov random fields.
- A. K. Jain and K. Karu, "Learning texture discrimination
masks," IEEE Transactions on Pattern Analysis and Machine
Intelligence, 1995. To appear.
- A. S. Solberg, A. K. Jain, and T. Taxt, "Texture analysis of
SAR images: A comparative study," IEEE Transactions on
Geoscience and Remote Sensing, 1995. Under review.
- M.-P. Dubuisson and R. C. Dubes, "Efficacy of fractal features
in segmenting images of natural textures," Pattern Recognition
Letters, vol. 15, pp. 419-431, 1994.
- M. Tuceryan and A. K. Jain, "Texture analysis," in Handbook
of Pattern Recognition and Computer Vision (C. H. C. et al, Ed.),
pp. 235-276, World Scientific, 1993.
- P. P. Ohanian and R. C. Dubes, "Performance evaluation for
four classes of textural features," Pattern Recognition, vol. 25,
pp. 819-833, 1992.
- J. Mao and A. K. Jain, "Texture classification and
segmentation using multiresolution simultaneous autoregressive
models," Pattern Recognition, vol. 25, no. 2, pp. 173-188, 1992.
- A. K. Jain and F. Farrokhnia, "Unsupervised texture
segmentation using Gabor filters," Pattern Recognition, vol. 24,
pp. 1167-1186, November 1991.
- F. Farrokhnia and A. K. Jain, "A multi-channel filtering
approach to texture segmentation," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition,
(Maui, HI), pp. 364-370, June 1991.
- M. Tuceryan and A. K. Jain, "Texture segmentation using
Voronoi polygons," IEEE Transactions on Pattern Analysis
and Machine Intelligence, vol. 12, pp. 211-216, February 1990.
- R. C. Dubes and C. C. Chen, "Experiments in fitting discrete
Markov random fields to textures," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition, (San
Diego, CA), June 1989.
Transitory Image Sequences and Their Integration
A transitory image sequence is one in which no scene element is
visible through the entire sequence. When a camera system scans a
scene that cannot be covered by a single view, then the image
sequence is called transitory. This project deals with some major
theoretical and algorithmic issues associated with the task of
estimating structure and motion from transitory image sequences. It
is shown that integration with a transitory sequence has properties
that are very different from those with a non-transitory one. Two
representations, world-centered (WC) and camera-centered (CC),
behave very differently with a transitory sequence. The asymptotic
error rates indicate that one representation is significantly superior
to the other, depending on whether one needs camera-centered or
world-centered estimates. Using Cramer-Rao lower error bound, we
show that these error rates are not only the rates obtained by the
proposed algorithm, but also the best rates possible. Based on the
error rate analysis, we introduce an efficient "cross-frame"
estimation technique for the CC representation. For the WC
representation, our analysis indicates that a good technique should
be based on camera global pose instead of interframe motions. In
addition to testing with synthetic data, rigorous experiments were
conducted with real-image sequences taken by a fully calibrated
camera system. A comparison of the experimental results with the
ground truth has demonstrated that reliable structure information
can be obtained from transitory image sequences.
- J. Weng and Y. Cui, "Transitory image sequences and their
integration," Tech. Rep. CPS-95-4, Michigan State University,
March 1995.
- J. Weng, Y. Cui, N. Ahuja, and A. Singh, "Integration of
transitory image sequences," in Proceedings of the IEEE
conference on Computer Vision and Pattern Recognition,
(Seattle, WA), June 1994.
Vision-Guided Robot Manipulators
The objective of this project is to develop a highly adaptive method
for vision-based object manipulation in an unstructured
environment without requiring a complete description of the world
and explicit camera calibration. This project uses SHOSLIF-R,
SHOSLIF for robot manipulators. Unlike conventional robot
systems that depend very much on the availability of accurate global
position of the manipulator and objects, the system under
development learns, through interactive visual feedback, the
unknown and nonlinear relationships among the sensed objects, the
hand, and the visual sensors. The robot manipulator is equipped
with a visual recognition system called Cresceptron that is used to
recognize the objects and the robot hand from images. In the
learning phase an adaptive hierarchical network is automatically
generated to learn the hand-eye coordination as well as the objects.
In the performance phase the network controls the manipulator to
perform some tasks, such as reaching and picking up a learned
object and moving the object to a desired position.
- S. Howden and J. Weng, "Hand-eye coordinated learning using
hierarchical space tessellation," Tech. Rep. CPS-94-29, Michigan
State University, April 1994.
Recent Theses
Recent Ph.D and M.S. theses completed by graduate students in the
PRIP lab.
- Jin-Long Chen. Recognition and Tracking of Curved Objects,
Ph.D. thesis, 1995 (expected).
- Chitra Dorai. Representation and Recognition of 3-D Objects
with Free-Form Surfaces, Ph.D. thesis, 1995 (expected).
- Sharathcha Pankanti. Integration of Vision Modules, Ph.D.
thesis, 1995 (expected).
- Marie-Pierre Dubuisson. Segmentation and Matching of Moving
Vehicles from Complex Outdoor Scenes, Ph.D. thesis, 1995.
- Jian-Chang Mao. Design and Analysis of Neural Networks for
Pattern Recognition, Ph.D. thesis, 1994.
- Qian Huang. Hierarchical Token Grouping in Extracting
Tubular Objects, Ph.D. thesis, 1994.
- Hansye Dulimarta. Client-Server Control Architecture for
Robot Navigation, Ph.D. thesis, 1994.
- Timothy Newman. Experiments in 3D CAD-based Inspection
Using Range Images, Ph.D. thesis, 1993.
- Jonathan Courtney. Mobile Robot Localization Using Pattern
Classification Techniques, M.S. thesis, 1993.
- Steve Walsh. Indoor Robot Navigation Using a Symbolic
Landmark Map, Ph.D. thesis, 1992.
- Greg Lee. Reconstruction of Line Drawing Graphs from Fused
Range and Intensity Imagery, Ph.D. thesis, 1992.
- Sateesha Nadabar. Markov Random Field Contextual Models in
Computer Vision, Ph.D. thesis, 1992.
- Narayan S. Raja. Obtaining Generic Parts from Range Data
Using a Multi-View Representation, Ph.D. thesis, 1992.
- Deborah Trytten. The Construction of Labeled Line Drawings
From Intensity Images, Ph.D. thesis, 1992.
- Marie-Pierre Dubuisson. Multisensor Fusion for Nondestructive
Inspection of Fiber Reinforced Composite Structures, M.S.
thesis, 1991.
- Farshid Farrokhnia. Multichannel Filtering Techniques for
Texture Segmentation and Surface Quality Inspection, Ph.D.
thesis, 1990.
- Patrick Flynn. CAD Based Computer Vision: Modeling and
Recognition Strategies, Ph.D. thesis, 1990.
- Joe Miller. On Sequences of Operations in CAPP, Ph.D. thesis,
1990.
- Sie-Wang Chen. 3-D Representation and Recognition Using
Object Wings, Ph.D. thesis, 1989.
- J. Ton. Knowledge-based Segmentation of Landsat TM Images,
Ph.D. thesis, 1989.
- Gongzhu Hu. Three-Dimensional Scene Representation Using
Structured Light, Ph.D. thesis, 1988.
- Chaur-Chin Chen. Markov Random Fields in Image Analysis,
Ph.D. thesis, 1988.
Photo Collage
The final sheet contains sequences of images illustrating seven
different research projects.
For More Information
For information about current research projects in the PRIP lab,
please contact:
Chitra Dorai/Karissa Miller, PRIP Lab Managers
Department of Computer Science
A714 Wells Hall
Michigan State University
East Lansing, Michigan USA 48824-1027
Email: manager@pixel.cps.msu.edu
WWW: http://web.cps.msu.edu/prip
Click here to send your queries by e-mail.