<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Computer Vision and Robotics Laboratory</title>
	<atom:link href="http://vision.ai.uiuc.edu/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://vision.ai.uiuc.edu</link>
	<description></description>
	<lastBuildDate>Fri, 18 Sep 2009 16:45:41 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Video Shot Detection</title>
		<link>http://vision.ai.uiuc.edu/?p=1084</link>
		<comments>http://vision.ai.uiuc.edu/?p=1084#comments</comments>
		<pubDate>Fri, 08 May 2009 17:04:41 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Detection of photometric distribution discontinuities in video to locate shot changes]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1084</guid>
		<description><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_6_1_icon.gif" border="2" alt="" width="160" height="160" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="http://www.uiuc.edu/ph/www/dugad">R Dugad</a>, K  Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence, identify those frames that represent changes in the parts of the scene being imaged<span id="more-1084"></span></p>
<p align="justify">R. Dugad, K. Ratakonda and N. Ahuja, Robust Video Shot Change Detection, IEEE Workshop on Multimedia Signal Processing, Redondo Beach,&#8230;</p>]]></description>
			<content:encoded><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_6_1_icon.gif" border="2" alt="" width="160" height="160" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="http://www.uiuc.edu/ph/www/dugad">R Dugad</a>, K  Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence, identify those frames that represent changes in the parts of the scene being imaged<span id="more-1084"></span></p>
<p align="justify">R. Dugad, K. Ratakonda and N. Ahuja, Robust Video Shot Change Detection, IEEE Workshop on Multimedia Signal Processing, Redondo Beach, CA, December 1998, 276-278.  <a href="http://vision.ai.uiuc.edu/publications/10.1.1.53.65.pdf">Full Text</a></p>
<p align="justify">R. Dugad, K. Ratakonda and N. Ahuja, Robust Video Shot       Change Detection, Indian Conference       on Computer Vision, Graphics and Image Processing, New Delhi, India,       December 1998, 358-363. <span style="font-family: Book Antiqua,Times New Roman,Times;"><a href="../../abstracts/pub6_6_1_a1298dugad.htm"></a> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1084</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Video Denoising</title>
		<link>http://vision.ai.uiuc.edu/?p=1077</link>
		<comments>http://vision.ai.uiuc.edu/?p=1077#comments</comments>
		<pubDate>Fri, 08 May 2009 17:01:45 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Efficient spatiotemporal filtering for video denoising]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1077</guid>
		<description><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_5_1_icon.jpg" border="2" alt="" width="160" height="131" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Combine spatial and temporal filtering to suppress noise in each frame of a video sequence</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1077"></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">R. Dugad and N. Ahuja, Video Denoising by Combining Kalman and Wiener Estimates, International Conference on Image Processing, Kobe, Japan, Oct. 1999, IV-152-156  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=819568">Abstract</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
&#8230;</p>]]></description>
			<content:encoded><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_5_1_icon.jpg" border="2" alt="" width="160" height="131" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Combine spatial and temporal filtering to suppress noise in each frame of a video sequence</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1077"></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">R. Dugad and N. Ahuja, Video Denoising by Combining Kalman and Wiener Estimates, International Conference on Image Processing, Kobe, Japan, Oct. 1999, IV-152-156  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=819568">Abstract</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1077</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linear Transforms over arbitrary supports</title>
		<link>http://vision.ai.uiuc.edu/?p=1072</link>
		<comments>http://vision.ai.uiuc.edu/?p=1072#comments</comments>
		<pubDate>Fri, 08 May 2009 16:55:59 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Computationally efficient transform domain approaches to image and video denoising, changing resolution and watermarking, both using uncompressed data and directly on the compressed data]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1072</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_3_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">We present a novel iterative approach to define any multidimensional linear transform over an arbitrary shape given that we know its definition over a hypercube</p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center"><span id="more-1072"></span></p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p>LINEAR TRANSFORMS OVER ARBITRARY SUPPORTS</p>
<p>Krishna Ratakonda and Narendra Ahuja<br />
Beckman Institute for Advanced Science&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_3_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">We present a novel iterative approach to define any multidimensional linear transform over an arbitrary shape given that we know its definition over a hypercube</p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center"><span id="more-1072"></span></p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p>LINEAR TRANSFORMS OVER ARBITRARY SUPPORTS<!--mstheme--><!--mstheme--></p>
<p>Krishna Ratakonda and Narendra Ahuja<br />
Beckman Institute for Advanced Science and Technology<br />
Department of Electrical and Computer Engineering<br />
University of Illinois at Urbana-Champaign</p>
<p><!--mstheme--> Problem Definition<!--mstheme--></p>
<blockquote><p>In order to apply a multidimensional linear transform over an arbitrarily shaped support, the usual practice is to fill out the support to a hypercube by zero padding. The problem that we tackle is: how do we redefine the transform over an arbitrary shaped region suited to a given application? We present a novel iterative approach to define any multidimensional linear transform over an arbitrary shape given that we know its definition over a hypercube.</p>
<p>The proposed solution is :</p>
<blockquote>
<blockquote><p><!--mstheme--><!--msthemelist--><!--mstheme--> <!--mstheme--></p>
<p><!--msthemelist--></p>
<table border="0" cellspacing="0" cellpadding="0" width="100%"><!--msthemelist--></p>
<tbody>
<tr>
<td width="42" valign="top"><img src="../../_themes/cvrl-expedition/expbul2a.gif" alt="" hspace="15" width="12" height="12" /></td>
<td width="100%" valign="top"><!--mstheme-->Extensible to all possible shapes of supports</p>
<p><!--mstheme--><!--msthemelist--></td>
</tr>
<p><!--msthemelist--></p>
<tr>
<td width="42" valign="baseline"><img src="../../_themes/cvrl-expedition/expbul2a.gif" alt="" hspace="15" width="12" height="12" /></td>
<td width="100%" valign="top"><!--mstheme-->Adaptable to the needs of a particular application</p>
<p><!--mstheme--><!--msthemelist--></td>
</tr>
</tbody>
</table>
<p><!--mstheme--></p></blockquote>
</blockquote>
</blockquote>
<p><!--mstheme--> Applications:<!--mstheme--></p>
<ol>
<li>Segmentation based Image Compression.</li>
<li>Shape based Video Encoding.</li>
<li>Region Merging.</li>
</ol>
<p><!--mstheme--> Details:<!--mstheme--></p>
<blockquote><p>Discrete linear transforms in two (or more) dimensions are in most cases defined over a rectangular (hypercubic) support. The usual practice when we want to apply the transform over an arbitrarily shaped support is to fill out the rest of the support with zeros to make up the rectangle (hypercube) and then use the natural definition of the transform over a rectangle (hypercube). This is an extension of the 1-D case where we fill out an arbitrary-length data set with zeros to form a data set of length  2neither to  increase the computational speed (through FFTs for Fourier transforms) or to satisfy the definition of the transform (in the case of dyadic wavelets). This, however, does not lead to a satisfactory definition of the linear transform in two or more dimensions for many applications. An example can be used to illustrate this point. The Fourier transform of a function that is constant on a circular support in 2-D is a Jinc. As can be seen from the figure, the magnitude of the Fourier coefficients does not have any relation to the smoothness of the function, which is a  constant within its support.</p>
<p><img src="../../6_4_3/lin1.jpg" alt="" width="534" height="318" /><br />
Fig 1 &#8211; Fourier Transform of a circle.</p>
<p>The above discussion leads us to the following question: What should be the values attributed to the sample points which lie within the rectangle but not within the support of the function? The answer is evidently not unique and depends upon the application. With each possible choice of the values for the pixels which lie outside the support but within the rectangular (hypercubic) region, we can associate a possible function-transform pair. The aim of the proposed scheme is to algorithmically constrain the choice of the possible function-transform pairs in such a way as to lead to the optimal choice of the function-transform pair for the particular application under consideration.</p>
<p>The applications that we consider assume that we have a smooth 2-D function defined on an arbitrarily shaped, connected support; we would like to define the free pixels (the pixels within the rectangular region but outside the support) so as to minimize the high frequency content in the Fourier domain. The problem is formulated in terms of a Projection onto Convex Sets formalism and incorporates a few more constraints such as the bounded variation of the free pixels . The variation in the free pixels is controlled by a parameter Z.</p></blockquote>
<p><!--mstheme--> Results:<!--mstheme--></p>
<p><img src="../../6_4_3/lin2a.jpg" alt="" width="255" height="256" /> <img src="../../6_4_3/lin2b.jpg" alt="" width="256" height="256" /> <img src="../../6_4_3/lin2c.jpg" alt="" width="256" height="256" /><br />
Fig 2a &#8211; Smooth Region                                    Fig 2b &#8211; Fourier transform with Z = 2               Fig 2b &#8211; Fourier transform with Z = 16</p>
<blockquote><p>Results for the Fourier transform as Z is varied are presented . The example function is a smooth region taken out of a natural image (Lena). As can be seen, an increase of Z allows more variation in the free pixels in the spatial domain, thus destroying the ad hoc shape information (the pixels within the support are unchanged in all images). Similar results maybe obtained if we replace the Fourier transform with either DCT or wavelets.</p></blockquote>
<p><!--mstheme--> Publications:<!--mstheme--></p>
<p><!--mstheme--><!--msthemelist--></p>
<blockquote>
<blockquote><p>Ratakonda, K.; Ahuja, N.</p>
<p>Acoustics, Speech, and Signal Processing, 1997. ICASSP-97.,</p>
<p>1997 IEEE International Conference on ,</p>
<p>Volume: 4 , 1997</p>
<p>Page(s): 3041 -3044 vol.4</p></blockquote>
</blockquote>
<p><!--mstheme--></p>
<p><!--more--></p>
<p>K. Ratakonda and N. Ahuja, Discrete Multi-Dimensional Linear Transforms over Connected Arbitrarily Shaped Supports, IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 4, Munich, Germany, April 1997, 3041-3044. Processing, Geneva, Switzerland,  September 1996, 81-84. <a href="../../abstracts/pub6_4_1_a0497krishna.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Downsampling and Unsampling in the DCT Domain, International Conference on Image Processing, Kobe, Japan, Oct. 1999, II-909-913. <a href="../../abstracts/pub6_4_1_a1099dugad.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Altering Resolution in the Compressed Domain, EEE Conference on Computer Vision and Pattern Recognition, Ft. Collins, CO, June 1999, I-213-218. <a href="../../abstracts/pub6_4_1_a0699dugad.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Image Size Change in the Compressed Domain, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 11, No. 4, April 2001, 461-474. <a href="../../abstracts/pub6_4_1_a0401dugad.htm">Abstract </a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1072</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Transform-Domain Watermarking</title>
		<link>http://vision.ai.uiuc.edu/?p=1070</link>
		<comments>http://vision.ai.uiuc.edu/?p=1070#comments</comments>
		<pubDate>Fri, 08 May 2009 16:54:41 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Computationally efficient transform domain approaches to image and video denoising, changing resolution and watermarking, both using uncompressed data and directly on the compressed data]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1070</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_2_icon.jpg" border="2" alt="" width="160" height="158" /></p>
<p> </p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"></h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop a watermarking method that does not use the original image for watermark detection and resilient to image quality degradation</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1070"></span>K. Ratakonda, R. Dugad and N. Ahuja, Digital Image Watermarking: Issues in Resolving Rightful Ownership, Proc. International&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_2_icon.jpg" border="2" alt="" width="160" height="158" /></p>
<p><!--mstheme--> <!--mstheme--></p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--><!--mstheme--></h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop a watermarking method that does not use the original image for watermark detection and resilient to image quality degradation</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1070"></span>K. Ratakonda, R. Dugad and N. Ahuja, Digital Image Watermarking: Issues in Resolving Rightful Ownership, Proc. International Conference on Image Processing, Vol. 2, Chicago, IL, October 1998, 414-418.<a href="../../abstracts/pub6_4_2_a1098dugad.htm">Abstract </a></p>
<p>R. Dugad, K. Ratakonda and N. Ahuja, A New Wavelet Based Scheme for Digital Image Watermarking, Proc. International Conference on Image Processing, Vol. 2, Chicago, IL, October 1998, 419-423. <a href="../../abstracts/pub6_4_2_a1098dugad2.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Scheme for Joint Watermarking and Compression of Video, IEEE International Conference on Image Processing, Vol. 2, Vancouver, BC, Canada, September 2000, 80-84. <a href="../../abstracts/pub6_4_2_a0900dugad.htm">Abstract </a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1070</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Transform Domain Magnification or Superresolution</title>
		<link>http://vision.ai.uiuc.edu/?p=1067</link>
		<comments>http://vision.ai.uiuc.edu/?p=1067#comments</comments>
		<pubDate>Fri, 08 May 2009 16:52:36 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Computationally efficient transform domain approaches to image and video denoising, changing resolution and watermarking, both using uncompressed data and directly on the compressed data]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1067</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_1_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop fast algorithms for magnification or demagnification of a compressed image by a given amount directly in the compressed image format</p>
<p><span id="more-1067"></span>K. Ratakonda and N. Ahuja, Discrete Multi-Dimensional Linear Transforms over Connected Arbitrarily Shaped Supports, IEEE International&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_4_1_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a href="../../dugad/">R Dugad</a>, K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop fast algorithms for magnification or demagnification of a compressed image by a given amount directly in the compressed image format</p>
<p><span id="more-1067"></span>K. Ratakonda and N. Ahuja, Discrete Multi-Dimensional Linear Transforms over Connected Arbitrarily Shaped Supports, IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 4, Munich, Germany, April 1997, 3041-3044. Processing, Geneva, Switzerland,  September 1996, 81-84. <a href="../../abstracts/pub6_4_1_a0497krishna.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Downsampling and Unsampling in the DCT Domain, International Conference on Image Processing, Kobe, Japan, Oct. 1999, II-909-913. <a href="../../abstracts/pub6_4_1_a1099dugad.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Altering Resolution in the Compressed Domain, EEE Conference on Computer Vision and Pattern Recognition, Ft. Collins, CO, June 1999, I-213-218. <a href="../../abstracts/pub6_4_1_a0699dugad.htm">Abstract </a></p>
<p>R. Dugad and N. Ahuja, A Fast Scheme for Image Size Change in the Compressed Domain, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 11, No. 4, April 2001, 461-474. <a href="../../abstracts/pub6_4_1_a0401dugad.htm">Abstract </a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1067</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Video Frame Interpolation</title>
		<link>http://vision.ai.uiuc.edu/?p=1063</link>
		<comments>http://vision.ai.uiuc.edu/?p=1063#comments</comments>
		<pubDate>Fri, 08 May 2009 16:49:03 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Block-based motion estimation for missing video frame interpolation, and spatially scalable (multiresolution) video coding]]></category>
		<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1063</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_3_1_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><a href="http://www.uiuc.edu/ph/www/sc-yoon">S Yoon</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence with missing frames, generate the missing frames by interpolating nearby available frames<span id="more-1063"></span></p>
<p><strong><span style="font-size: x-small;"> Motion Compensated Missing Frame Interpolation</span></strong></p>
<div><span style="font-size: xx-small;">Seung-Chul Yoon and Narendra Ahuja</span><br />
<span style="font-size: xx-small;"><a href="../../scyoon/research/proj_frame_interpolation.html#abstract">[</a> <a href="../../scyoon/research/icip01interpolation.ps">Download Paper]</a> </span>
<div>
<p>Video frames are often dropped during compression at very low bit rates.     At the decoder,&#8230;</p></div></div>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_3_1_icon.jpg" border="2" alt="" width="160" height="160" /></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><a href="http://www.uiuc.edu/ph/www/sc-yoon">S Yoon</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence with missing frames, generate the missing frames by interpolating nearby available frames<span id="more-1063"></span></p>
<p><strong><span style="font-size: x-small;"> Motion Compensated Missing Frame Interpolation</span></strong></p>
<div><span style="font-size: xx-small;">Seung-Chul Yoon and Narendra Ahuja</span><br />
<span style="font-size: xx-small;"><a href="../../scyoon/research/proj_frame_interpolation.html#abstract">[</a> <a href="../../scyoon/research/icip01interpolation.ps">Download Paper]</a> </span></p>
<div>
<p>Video frames are often dropped during compression at very low bit rates.     At the decoder, a missing frame interpolation method synthesizes the missed    frames. We propose a two step motion estimation method for the interoplation.     More specifically, the coarse motion vector field is refined at the decoder    using mesh-based motion estimation instead of using computationally intensive    dense motion estimation. We propose a framework for detecting and utilizing    local motion boundaries in terms of an explicit model. Motion boundaries   are modeled using edge detection and Hough transform. The motion of the occluding   side is represented by affine mapping. The newly appearing region is also   detected. Each pixel is interpolated differently according to adaptive interpolation   based on the property of the pixel: moving, static, and disoccluded. The  resulting quality of interpolated images is constantly better than block-based   interpolation and is comparable to optical flow methods.</p></div>
</div>
<p><strong>Results</strong><br />
We implemented the proposed algorithm using various QCIF (176&#215;144)  video    sequences. Fig. 1(a)         shows a typical motion vector field resulting from the block matching    algorithm  (BMA). The block size for the BMA is set to 16&#215;16. We tried to   make the number  of triangles of a uniform mesh the same as the case of the  BMA. After fixing  the locations of node points, the algorithm obtains  one  motion vector per  node in order to estimate affine parameters per triangle.      Fig. 1(b)         shows the improved motion vector field. Fig. 2         shows comparisons among the block-based method, the pel-recursive method,    and the proposed method. The frame numbers in  Fig. 2         refer to the interpolated frames. It means that we do not include  the   PSNRs  of the transmitted frames in the graphs. The proposed algorithm  shows   a performance  close to the pel-recursive approach in terms of PSNR.      Fig.  3         shows a comparison between two interpolated images using the block-based     method and the proposed method, respectively.</p>
<p><img src="../../scyoon/research/mthr5_org.jpg" alt="" width="197" height="160" /><br />
original image</p>
<p><img src="../../scyoon/research/chg_detection.jpg" alt="" width="197" height="160" /> <img src="../../scyoon/research/invalid_tri.jpg" alt="" width="198" height="160" /><br />
change detection                  invalid triangles  (visible   pixels)</p>
<p><a name="fig1"></a> <img src="../../scyoon/research/block_mv_field.jpg" alt="" width="243" height="200" /> <img src="../../scyoon/research/MV_final.jpg" alt="" width="243" height="200" /><br />
Fig. 1(a): block-based motion field                 Fig. 1(b): improved    motion field<br />
<a name="fig2"></a> <img src="../../scyoon/research/carphone_compare.jpg" alt="" width="270" height="220" /> <img src="../../scyoon/research/foreman_compare.jpg" alt="" width="275" height="220" /><br />
Fig. 2: Comparison of PSNR: block-based method (`&#8211;&#8217;: dashed line),  pel-recursion    method (`-&#8217;: solid line), and proposed method (`-.-&#8217;: dashed  line with dots)</p>
<p><a name="fig3"></a> <img src="../../scyoon/research/itp_no_mesh.foreman2.0.0.jpg" alt="" width="196" height="160" /> <img src="../../scyoon/research/itp_model.foreman2.1.15.1.jpg" alt="" width="196" height="160" /><br />
block-based interpolation           improved interpolation<br />
Fig. 3</p>
<p><!--more--></p>
<p><strong>Downloadable Paper</strong><br />
Seung-Chul Yoon and Narendra Ahuja, &#8220;Frame Interpolation Using Transmitted     Block-Based Motion Vectors,&#8221; To appear in ICIP 2001. <a href="../../scyoon/research/icip01interpolation.ps"> [Download Paper]</a></p>
<p><span><span style="font-family: Book Antiqua,Times New Roman,Times;">S.-C. Yoon and N. Ahuja, Frame Interpolation using Transmitted Block-Based Motion Vectors, Proc. International Conference on Image Processing, Thessaloniki, Greece, October 2001</span></span></p>
<p><strong>Contact Information</strong><br />
Seung-Chul Yoon<br />
Address:<br />
1614 Beckman Institute<br />
405 N. Mathews Avenue, Urbana IL 61801, USA.<br />
Phone: (217) 333-1869 / 244-4392<br />
Email: scyoon@vision.ai.uiuc.edu<br />
Homepage: http://vision.ai.uiuc.edu/scyoon/ <!--more--></p>
<p><a name="abstract"></a> <strong>Abstract: </strong>The proposed technique is designed for interpolating video     frames often dropped during compression using standards, such as MPEG-4   and  H.263 at very low bit rates. The coarse motion vector field is refined   at  the receiving side using mesh-based motion estimation instead of using   computationally  intensive dense motion estimation. We propose a framework   for detecting and  utilizing local motion boundaries in terms of an explicit   model. Motion boundaries  are modeled using edge detection and Hough transform.   The motion of the occluding  side is represented by affine mapping. The newly  appearing region is also  detected. Each pixel is interpolated differently   according to adaptive interpolation  based on the property of the pixel:  moving, static, and disoccluded.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1063</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Segmentation Based Video Coding</title>
		<link>http://vision.ai.uiuc.edu/?p=1056</link>
		<comments>http://vision.ai.uiuc.edu/?p=1056#comments</comments>
		<pubDate>Fri, 08 May 2009 16:46:16 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Multiscale structure based video compression, by estimating and coding region motion instead of pixel motion]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1056</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_2_1_icon.jpg" border="2" alt="" width="160" height="118" /></p>
<p>K Ratakonda , <a href="http://www.uiuc.edu/ph/www/sc-yoon">S Yoon</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence and a hierarchical segmentation representing the natural multiscale spatiotemporal structure, identify the interframe redundancy for efficient video coding which is adaptive to the desired level of coded detail</p>
<p><span id="more-1056"></span></p>
<p>Multiscale Structure-based Video Compression<br />
Seung-Chul Yoon and&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_2_1_icon.jpg" border="2" alt="" width="160" height="118" /></p>
<p>K Ratakonda , <a href="http://www.uiuc.edu/ph/www/sc-yoon">S Yoon</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a video sequence and a hierarchical segmentation representing the natural multiscale spatiotemporal structure, identify the interframe redundancy for efficient video coding which is adaptive to the desired level of coded detail</p>
<p><span id="more-1056"></span></p>
<p>Multiscale Structure-based Video Compression<br />
Seung-Chul Yoon and Krishna Ratakonda and Narendra AhujaWe developed a very low bit rate video compression algorithm using multiscale     image segmentation based hierarchical motion compensation and residual  coding.  The proposed algorithm outperforms the H.261-like coder by 3 dB and the H.263  version 2 by 1 dB. Such gains come from the use of image segmentation   and  reversed motion prediction. The proposed region based reversed motion   compensation  strategy regulates the size and number of regions used, by  pruning multiscale  segmentation of video frames. Since regions used for motion compensation are obtained by segmenting the previously decoded frame, the shape of the regions need not be transmitted to the decoder. Furthermore,  the hierarchical motion compensation strategy involves two stages: it refines  an initial, region level, coarse motion field to obtain a dense motion field  which provides pixel level motion vectors. The refinement procedure does not require any additional information to be transmitted. We also developed a residual coding technique for coding the displaced frame difference after  segmentation based motion compensation. Residual coding is performed using  a method which exploits the fact that the energy of the residual resulting  from motion compensation is concentrated in a priori predictable positions.  This residual coding technique  can also be extrapolated to improve the performance  of coders using a block  based motion compensation strategy.</p>
<p>Results<br />
We compare our coder with a generic block based coder as used in the  H.261   or the H.263 standards. All performance comparison is performed on  the luminance   (Y) component of the video frames. In order to make an objective  comparison,   we used the same quantization strategies to quantize DCT coefficients  for   both the coders. The Huffman codes for motion vectors and DCT coefficients    were the same for both the coders. The frame bit-rate was held (approximately)    fixed for both the coders at 1280 bits. This bit-rate corresponds to a bit-rate  of 9.6 kbps if every fourth frame is coded and a bit-rate of 38.4 kbps if  all the frames are coded.</p>
<p>* Click one of the images to see a video sequence.</p>
<p><a href="../../scyoon/research/susie.mpg"><img src="../../scyoon/research/susie_orig.jpg" alt="" width="207" height="170" /> </a> <a href="../../scyoon/research/susie_blk.mpg"><img src="../../scyoon/research/susie_blk.jpg" alt="" width="207" height="170" /> </a> <a href="../../scyoon/research/susie_reg.mpg"><img src="../../scyoon/research/susie_reg.jpg" alt="" width="207" height="170" /> </a><br />
Original frame                        Standard Approach                          Our approach</p>
<p><a href="../../scyoon/research/missa.mpg"><img src="../../scyoon/research/m_orig.jpg" alt="" width="207" height="170" /> </a> <a href="../../scyoon/research/missa_blk.mpg"><img src="../../scyoon/research/mskip_blk.jpg" alt="" width="207" height="170" /> </a> <a href="../../scyoon/research/missa_reg.mpg"><img src="../../scyoon/research/mskip_reg.jpg" alt="" width="207" height="170" /> </a><br />
Original frame                       Standard Approach                             Our approach</p>
<p>We also present results comparing our residual coding scheme with the usual block DCT based coding scheme.   The overhead of 1 bit per coded block will be transmitted. Such a coder always performs better than the baseline  block DCT scheme. The following figure shows the improvement (in dB PSNR)  over the generic coder, when the quantization step size of AC coefficients  is 16 and 32.</p>
<p><img src="../../scyoon/research/pocsq16.jpg" alt="" width="490" height="419" /> <img src="../../scyoon/research/pocsq32.jpg" alt="" width="490" height="416" /></p>
<p>Downloadable Papers<br />
1. Seung Chul Yoon, Krishna Ratakonda and Narendra Ahuja, &#8220;Low Bit-Rate    Video Coding with Implicit Multiscale Segmentation,&#8221; IEEE Trans. on Circuits    and Systems for Video Technology, Vol. 9, No. 7, pp. 1115-1129, October  1999.  <a href="../../scyoon/research/proj_video_compression.html#csvt99">[Abstract</a> ]<a href="../../scyoon/research/csvt99.ps">[Download Paper]</a></p>
<p>2. Seung Chul Yoon, Krishna Ratakonda and Narendra Ahuja, &#8220;Region based     Video Coding using a Multiscale Image Segmentation,&#8221; Proc. IEEE Int. Conf.     on Image Proc. (ICIP&#8217;97), vol. 2, pp. 510-513, Santa Barbara, 1997. <a href="../../scyoon/research/proj_video_compression.html#icip97_region"> [Abstract</a> ]<a href="../../scyoon/research/icip97coding.ps">[Download Paper]</a></p>
<p>3. Krishna Ratakonda, Seung Chul Yoon and Narendra Ahuja, &#8220;Video Compression:     Coding the Displaced Frame Difference,&#8221; Proc. IEEE Int. Conf. on Image  Proc.  (ICIP&#8217;97), vol. 1, pp. 353-356, Santa Barbara, 1997. <a href="../../scyoon/research/proj_video_compression.html#dfd"> [Abstract</a> ]<a href="../../scyoon/research/icip97dfd.ps">[Download Paper]</a></p>
<p>Contact Information<br />
Seung-Chul Yoon<br />
Address:<br />
1614 Beckman Institute<br />
405 N. Mathews Avenue, Urbana IL 61801, USA.<br />
Phone: (217) 333-1869 / 244-4392<br />
Email: scyoon@vision.ai.uiuc.edu<br />
Homepage: http://vision.ai.uiuc.edu/scyoon/</p>
<p><a name="csvt99"></a> Title: Low Bit-Rate Video Coding with Implicit Multiscale Segmentation<br />
Abstract: Discusses a multiscale segmentation based video compression     algorithm aimed at very low bit-rate applications such as video teleconferencing     and video phones. We introduce novel techniques for multiscale segmentation     based motion compensation and residual coding. Our region based forward   motion compensation strategy (in terms of direction of motion vector, which   is from the previous frame to the current frame) regulates the size and number  of regions used, by pruning a multiscale segmentation of video frames. Since   regions used for motion compensation are obtained by segmenting the previously   decoded frame, the shape of the regions need not be transmitted to the decoder.   Furthermore, our hierarchical motion compensation strategy refines an initial   region level, coarse motion field to obtain a dense motion field which provides   pixel level motion vectors. The refinement procedure does not require any   additional information to be transmitted. This motion compensation technique   effectively addresses the problem of dealing with &#8220;holes&#8221; and &#8220;overlapping   regions&#8221; which are inherent to forward motion compensation. Residual coding   is performed using a novel method which exploits the fact that the energy   of the residual resulting from motion compensation is concentrated in a priori  predictable positions. We show that this residual coding technique can also  be extrapolated to improve the performance of coders using a block based motion compensation strategy. A fusion of these concepts leads to a gain of 2-3 dB in peak signal-to-noise ratio, apart from significant perceptual  improvement, over a generic video coding algorithm using a block based motion  compensation strategy (such as H.261 or H.263) for a variety of test sequences.</p>
<p><a name="icip97_region"></a> Title: Region based Video Coding using a Multiscale Image Segmentation<br />
Abstract: This paper proposes a novel region-based video coding technique    using a multiscale image segmentation method thus obtaining better quality    at the same bit rate. In most of the previous region-based video coding  techniques,  occlusion caused degradation in terms of both PSNR and perceptual  video quality.We  propose a new motion estimation and compensation algorithm  which solves occlusion  related problems effectively. The proposed motion  estimation and compensation  is a two stage procedure: the first stage uses  a coarse motion model while  the second stage uses a dense motion model. The coarse motion model generates  region level motion vectors which are then fine tuned by the dense motion  model which produces pixel level motion vectors. A fusion of these concepts  leads to a gain of 2~3 dB in PSNR over the block-based algorithm for a variety  of test sequences using a fully functional video coder.</p>
<p><a name="dfd"></a> Title: Video Compression: Coding the Displaced Frame Difference<br />
Abstract: Popular techniques employed to code the displaced frame difference (DFD) treat it no differently from an ordinary image for coding purposes. Since the DFD is generated by the process of motion compensation,  such methods do not fully exploit the underlying redundancies. This paper  proposes a DFD coding method which exploits such redundancies while incurring  negligible information overhead. The key idea is to predict locations of high DFD concentration which occupy small portions of the image and use this predicted information (which is also available to the decoder without additional information transmission) to improve the quality of the decoded image. Two key features of the proposed approach are its compatibility with any transform based DFD coding scheme and negligible information overhead. Tests with a fully functional video coder show the efficacy of the proposed approach.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1056</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Structure Based Image Denoising</title>
		<link>http://vision.ai.uiuc.edu/?p=1048</link>
		<comments>http://vision.ai.uiuc.edu/?p=1048#comments</comments>
		<pubDate>Fri, 08 May 2009 16:36:48 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Multiscale structure based image representation using a set of regions]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1048</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_3_icon.gif" border="2" alt="" width="160" height="81" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">P Ishwar, P Moulin, K Ratakonda, <a href="http://www.uiuc.edu/ph/www/msingh">M Singh</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the knowledge of the image regions to smooth out the noise in the region interiors without blurring borders</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1048"></span></p>
<p>Image Denoising Using&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_3_icon.gif" border="2" alt="" width="160" height="81" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">P Ishwar, P Moulin, K Ratakonda, <a href="http://www.uiuc.edu/ph/www/msingh">M Singh</a></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the knowledge of the image regions to smooth out the noise in the region interiors without blurring borders</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-1048"></span></p>
<p>Image Denoising Using Projection on Convex Sets</p>
<p align="center">Krishna Ratakonda</p>
<p style="text-align: center;">and Narendra Ahuja<br />
Beckman Institute for Advanced  Science and Technology<br />
Department of Electrical and Computer Engineering<br />
University of Illinois at Urbana-Champaign</p>
<p style="text-align: left;">This work addresses the problem of denoising of images corrupted by AWGN. The Wiener filter is optimum in minimizing the mean-square-error under suitable assumptions of stationarity of the signal statistics. Locally, such assumptions are reasonable, as in the adaptive realization of theWiener filter whose performance is among the best known till date. Over the last few years, there has been much interest in threshold based denoising schemes. In this paper we present a novel framework for denoising signals from their compact representation in multiple domains. Each domain captures, uniquely, certain signal characteristics better than others. We define confidence sets around data in each domain and find sparse estimates that lie in the intersection of these sets, using a POCS algorithm. Simulations demonstrate the superior nature of the reconstruction (both in terms of mean-square error and perceptual quality) in comparison to the adaptive Wiener filter.</p>
<p>Results</p>
<p>The following images compare the performance of our scheme to that of Donoho and Johnstone&#8217;s.</p>
<p align="center"><img src="../../krishna/fig3a.jpg" border="0" alt="" width="313" height="326" align="center" /> <img src="../../krishna/fig3b.jpg" border="0" alt="" width="321" height="327" align="center" /></p>
<p align="center">(a)                                                                                             (b)</p>
<p align="center"><img src="../../krishna/fig4a.jpg" alt="http://vision.ai.uiuc.edu/krishna/fig4a.jpg" width="310" height="325" /><img src="../../krishna/fig4b.jpg" alt="http://vision.ai.uiuc.edu/krishna/fig4b.jpg" /></p>
<p align="center">(c)                                                                                             (d)</p>
<p>(a) Lena processed with the Donoho-Johnstone scheme (best singleDaubechies filter). PSNR: 33.25 dB. (b) Lena processed with 1-3-4 wavelet filters. PSNR: 35.01 dB. Original noisy image (not shown) PSNR: 31.21 dB. (c) Goldhill processed with the Donoho-Johnstone scheme (best single Daubechies filter). PSNR: 30.59 dB. (d) Goldhill processed with 1-3-5 wavelet filters. PSNR: 33.51 dB. Original noisy image (not shown) PSNR: 29.04 dB.</p>
<p><img src="file:///C:/DOCUME~1/MYRANA~1/LOCALS~1/Temp/moz-screenshot-1.jpg" alt="" /><img src="../../krishna/lena.jpg" alt="http://vision.ai.uiuc.edu/krishna/lena.jpg" /></p>
<p>PSNR as a function of the number of vanishing moments of the wavelet used.</p>
<p>Papers<img src="file:///C:/DOCUME%7E1/MYRANA%7E1/LOCALS%7E1/Temp/moz-screenshot-3.jpg" alt="" /></p>
<ul>
<li>Segmentation based denoising using       multiple compaction domainsSingh, M.; Ishwar, P.; Ratakonda, K.;       Ahuja, N. Image Processing, 1999.       ICIP 99. Proceedings. 1999 International Conference on , Volume:       1 , 1999 Page(s):       372 -375 vol.1</li>
<li>Image denoising using multiple       compaction domainsIshwar, P.; Ratakonda, K.; Moulin, P.;       Ahuja, N. Acoustics, Speech and       Signal Processing, 1998. Proceedings of the 1998 IEEE International       Conference on , Volume: 3 ,       1998 Page(s): 1889 -1892       vol.3</li>
</ul>
<p><!--more--></p>
<p>Publications</p>
<p>K. Ratakonda and N. Ahuja, Restoring Image Quality Through Structure Preserving De-Noising, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 33-40.  <a href="http://www.springerlink.com/content/1033754783504047/">Abstract</a></p>
<p>P. Ishwar, K. Ratakonda, P. Moulin and N. Ahuja, Image De-noising Using Multiple Compaction Domains, International Conference on Acoustics, Speech, and Signal Processing, Vol. III, Seattle, WA, May 12-15, 1998, Invited, 1889-1892.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=681833">Abstract</a></p>
<p>M. Singh, P. Ishwar, K. Ratakonda and N. Ahuja, Segmentation Based Denoising Using Multiple Compaction Domains, International Conference on Image Processing, Kobe, Japan, Oct. 1999, I-372-375.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=821633">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1048</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Structure Based  Image Magnification or Superresolution</title>
		<link>http://vision.ai.uiuc.edu/?p=1045</link>
		<comments>http://vision.ai.uiuc.edu/?p=1045#comments</comments>
		<pubDate>Fri, 08 May 2009 16:32:49 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Multiscale structure based image representation using a set of regions]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1045</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_2_icon.jpg" border="2" alt="" width="160" height="161" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the explicit, known geometry for image scaling, e.g., image expansion for superresolution</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">Details:</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p style="margin: 5px 30px; text-indent: 30px;" align="justify">
</p><p><span id="more-1045"></span></p>
<p style="text-align: center;">POCS BASED ADAPTIVE IMAGE MAGNIFICATION</p>
<p style="text-align: center;">Krishna Ratakonda and Narendra Ahuja<br />
Beckman Institute for Advanced Science and Technology<br />
Department&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_2_icon.jpg" border="2" alt="" width="160" height="161" /></p>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the explicit, known geometry for image scaling, e.g., image expansion for superresolution</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Details:</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p><span id="more-1045"></span></p>
<p style="text-align: center;">POCS BASED ADAPTIVE IMAGE MAGNIFICATION</p>
<p style="text-align: center;">Krishna Ratakonda and Narendra Ahuja<br />
Beckman Institute for Advanced Science and Technology<br />
Department of Electrical and Computer Engineering<br />
University of Illinois at Urbana-Champaign</p>
<p>Problem Definition</p>
<blockquote><dl>
<dt> Resolution enhancement involves the problem of magnifying a small image to several times its size while avoiding blurring, ringing and other artifacts.  We tackle the problem of magnifying an image without incurring the edge enhancement effects and other structural distortions characteristic of classical  image magnification techniques. We propose an iterative algorithm based on a Projections onto Convex Sets (POCS) formalization.</dt>
</dl>
</blockquote>
<p>Applications:</p>
<ol>
<li>Image Magnification.</li>
<li>Phase Retrieval.</li>
</ol>
<p>Details:</p>
<blockquote><p>Classical image magnification  methods include bilinear, bicubic and FIR interpolation schemes followed by a sharpening method like unsharp masking. Such interpolation schemes tend to blur the images when applied indiscriminately. Unsharp masking, which involves subtracting a properly scaled Laplacian of the image from itself, produces artifacts and increases noise. More sophisticated schemes involving wavelet- or fractal-based techniques have also been proposed. Such methods perform extrapolation of the signal in either the wavelet or fractal domain, which leads to objectionable artifacts when the assumptions behind such extrapolation are violated. It may also be noted that such extrapolatory assumptions predict and actively enhance the high-frequency content within the image, thus increasing any noise present in the unmagnified image.</p>
<p>The proposed method starts with an initial magnified image obtained through selective interpolation followed by an iterative procedure which aims to avoid edge-related artifacts while retaining and enhancing sharpness. The initial image is a composite image formed from a base interpolation scheme in the smooth areas of the image and from a selective interpolation mechanism in the non-smooth (or edge) areas. The proposed iterative algorithm aims to find a magnified image satisfying two constraints: one of the constraints is derived from sampling theory while the other constraint reflects the confidence that we place on the initial iterate. Both the constraints are convex sets; thus, we seek a solution which is at the intersection of these two convex sets and  can be obtained using the projection on convex sets (POCS) method.  Starting with the initial iterate, we project alternately on the two constraints. Convergence is guaranteed since we operate within the POCS formalism.</p>
<p>The proposed algorithm consists of three basic steps: Finding edges<br />
Edge locations are found using a multiscale segmentation algorithm. Obtaining the initial image<br />
This is done using a bilinear interpolation scheme in the smooth regions and a selective interpolation mechanism in the non-smooth areas POCS based iterative algorithm<br />
The first constraint set arises from sampling theory which suggests that the unmagnified image can be viewed as being  obtained by subsampling the magnified image without aliasing. The second constraint set arises by constraining the non-edge and edge location values to vary within limits from their initial estimate.</p></blockquote>
<p>Results:</p>
<blockquote><p><img src="../../6_1_2/mag1a.jpg" alt="" width="257" height="257" /> <img src="../../6_1_2/mag1b.jpg" alt="" width="261" height="259" /></p>
<p>Fig 1a &#8211; Lena, 4x magnification using bilinear interpolation                      Fig 1b -  Segmentation of  Fig 1a</p>
<p><img src="../../6_1_2/mag1c.jpg" alt="" width="257" height="257" /> <img src="../../6_1_2/mag1d.jpg" alt="" width="257" height="257" /></p>
<p>Fig 1c &#8211; Proposed method  with coarse segmentation                              Fig 1d &#8211; Proposed method with fine segmentation</p></blockquote>
<p><img src="../../6_1_2/mag2a.jpg" alt="" width="257" height="257" /> <img src="../../6_1_2/mag2b.jpg" alt="" width="257" height="257" /></p>
<p>Fig 2a &#8211; Goldhill, 4x magnification, Bilinear interpolation              Fig 2b &#8211; Proposed scheme (note the crisper bars).</p>
<p>Publications:</p>
<ul>
<li>POCS based adaptive image magnification
<ul>
<li>Ratakonda, K.; Ahuja, N.</li>
</ul>
</li>
</ul>
<p>Image Processing, 1998. ICIP 98. Proceedings. 1998 International Conference on ,  1998</p>
<p>Page(s): 203 -207 vol.3</p>
<p><!--more--></p>
<p align="justify">K. Ratakonda and N. Ahuja, POCS-Based Adaptive Image Magnification, Proc. International Conference on Image Processing, Vol. 3, Chicago, IL, October 1998, 203-207.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=727167">Abstract</a></p>
<p align="justify">K. Ratakonda and N. Ahuja, Super Resolution with Region       Sensitive Interpolation, Proc. Image       Understanding Workshop, New Orleans, LA, May 1997, 537-540.<a href="../../abstracts/pub6_1_2_a0597krishna.htm"></a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1045</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Structure Based Image Compression</title>
		<link>http://vision.ai.uiuc.edu/?p=1042</link>
		<comments>http://vision.ai.uiuc.edu/?p=1042#comments</comments>
		<pubDate>Fri, 08 May 2009 16:31:18 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Multiscale structure based image representation using a set of regions]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1042</guid>
		<description><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_1_icon.jpg" border="2" alt="" width="160" height="159" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the compactness of the structural description for best image compression</p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
</p><p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center"><span id="more-1042"></span></p>
<p>Structure Based Image Compression<br />
Krishna Ratakonda and Narendra Ahuja</p>
<div>Our novel reversible image compression method employs multiscale segmentation    within&#8230;</div>]]></description>
			<content:encoded><![CDATA[<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj6_1_1_icon.jpg" border="2" alt="" width="160" height="159" /></p>
</h2>
<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;">K Ratakonda</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given an image and its hierarchical segmentation representing its natural multiscale structure, use the compactness of the structural description for best image compression</p>
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center">
<p style="margin-left: 0pt; margin-top: 6px; margin-bottom: 6px;" align="center"><span id="more-1042"></span></p>
<p>Structure Based Image Compression<br />
Krishna Ratakonda and Narendra Ahuja</p>
<div>Our novel reversible image compression method employs multiscale segmentation    within a computationally efficient optimization framework to obtain consistently    good performance over a wide variety of images. We present new edge models    that deal effectively with two issues that make such models normally unsuitable    for compression applications: local applicability and large number of parameters    needed for representation. Segmentation information is provided by a recent    transform (1993), which we found to possess qualities making it especially    suitable for compression. The final residual image is obtained using autocorrelation-based    2-D linear prediction. Different implementations providing lossless compression    are presented along with results over a number of common test images. Results    show that the proposed approach can be used to yield robust lossless compression,    while providing consistently and significantly better results than the best  possible JPEG lossless coder.</div>
<div>
<p>Results</p></div>
<div>Results show a consistent 15-20% improvement over the best possible JPEG    lossless standard (see the table below). The results are invariant to the    amount of detail and noise in the image. It is also found that the typical    probability distribution of the residual image values is not Laplacian which   do not use explicit edge modelling. It&#8217;s more Gaussian in shape, thus suggesting   that the residual is mostly random noise. In conclusion, we have proposed   a theoretically sound lossless compression method, which makes no crude approximations  to the structure in the image. We have also proposed ways to represent edge  models, which makes coding them compression wise a variable proposition.</div>
<div>The following tables are the results applying the various implementations.    All results are in bits per pixel. In (a)-(c), interior (of a region) residual    entropy and edge residual entropy are provided in addition to total entropy.    Total entropy in all cases includes the overhead storage space.</div>
<table border="1" width="100%">
<tbody>
<tr>
<td></td>
<td>Airplane</td>
<td>Baboon</td>
<td>Lena</td>
<td>Sailboat</td>
<td>Tiffany</td>
</tr>
<tr>
<td>interior entropy</td>
<td>3.40</td>
<td>4.40</td>
<td>3.91</td>
<td>4.10</td>
<td>3.78</td>
</tr>
<tr>
<td>edge entropy</td>
<td>4.35</td>
<td>4.31</td>
<td>4.48</td>
<td>4.39</td>
<td>4.51</td>
</tr>
<tr>
<td>total entropy</td>
<td>3.94</td>
<td>5.10</td>
<td>4.37</td>
<td>4.49</td>
<td>4.08</td>
</tr>
</tbody>
<p>(a) piecewise constant model</table>
<table border="1" width="100%">
<tbody>
<tr>
<td></td>
<td>Airplane</td>
<td>Baboon</td>
<td>Lena</td>
<td>Sailboat</td>
<td>Tiffany</td>
</tr>
<tr>
<td>interior entropy</td>
<td>3.43</td>
<td>4.47</td>
<td>3.96</td>
<td>4.15</td>
<td>3.81</td>
</tr>
<tr>
<td>edge entropy</td>
<td>4.25</td>
<td>4.21</td>
<td>4.33</td>
<td>4.29</td>
<td>4.32</td>
</tr>
<tr>
<td>total entropy</td>
<td>3.89</td>
<td>5.07</td>
<td>4.27</td>
<td>4.40</td>
<td>3.97</td>
</tr>
</tbody>
<p>(b) piecewise linear model</table>
<table border="1" width="100%">
<tbody>
<tr>
<td></td>
<td>Airplane</td>
<td>Baboon</td>
<td>Lena</td>
<td>Sailboat</td>
<td>Tiffany</td>
</tr>
<tr>
<td>interior entropy</td>
<td>3.47</td>
<td>4.52</td>
<td>4.03</td>
<td>4.23</td>
<td>3.86</td>
</tr>
<tr>
<td>edge entropy</td>
<td>4.01</td>
<td>4.13</td>
<td>4.17</td>
<td>4.17</td>
<td>4.18</td>
</tr>
<tr>
<td>total entropy</td>
<td>3.75</td>
<td>4.95</td>
<td>4.11</td>
<td>4.29</td>
<td>3.89</td>
</tr>
</tbody>
<p>(c) linear prediction model</table>
<table border="1" width="100%">
<tbody>
<tr>
<td>Airplane</td>
<td>Baboon</td>
<td>Lena</td>
<td>Sailboat</td>
<td>Tiffany</td>
</tr>
<tr>
<td>4.12</td>
<td>6.45</td>
<td>4.61</td>
<td>5.25</td>
<td>4.32</td>
</tr>
</tbody>
<p>(d) best JPEG implementation</table>
<div>
<p>Publications<br />
Segmentation Based Reversible Image Compression. Krishna Ratakonda and   Narendra  Ahuja. IEEE Conference on IP, 1996.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1042</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Video Encoding using Coset Codes</title>
		<link>http://vision.ai.uiuc.edu/?p=1027</link>
		<comments>http://vision.ai.uiuc.edu/?p=1027#comments</comments>
		<pubDate>Fri, 08 May 2009 16:08:02 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>
		<category><![CDATA[Video Compression using Wyner-Ziv Codes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1027</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/coset_code.jpg" border="2" alt="" width="160" height="115" /></p>
<p style="margin-top: 0px; margin-bottom: 0px; word-spacing: 0px; text-indent: 4px;">A. Sehgal, A. Jagmohan, N. Ahuja</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">This project deals with scalable coding and robust Internet streaming of predictively encoded media. We frame the problem of predictive coding as a variant of the Wyner-Ziv problem in Information theory. Subsequently, LDPC based coset&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/coset_code.jpg" border="2" alt="" width="160" height="115" /></p>
<p style="margin-top: 0px; margin-bottom: 0px; word-spacing: 0px; text-indent: 4px;">A. Sehgal, A. Jagmohan, N. Ahuja</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">This project deals with scalable coding and robust Internet streaming of predictively encoded media. We frame the problem of predictive coding as a variant of the Wyner-Ziv problem in Information theory. Subsequently, LDPC based coset code constructions are used to compress the media in a scalable, error-resilient manner.<span id="more-1027"></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<ol>
<li>A. Sehgal, A. Jagmohan, N. Ahuja &#8220;Wyner-Ziv Coding of Video: Applications to Error Resilience,&#8221; IEEE Trans. Multimedia, April 2004, pages 249 &#8211; 258. <a href="../../publications/stmm.pdf">Full Text</a></li>
<li>A. Sehgal, A. Jagmohan, N. Ahuja &#8220;A State-free Causal Video Encoding Paradigm, &#8221; Invited Paper, Proc. IEEE Int. Conf. Image Processing, 2003, pp. I-605-608 <a href="../../publications/sicip.pdf">Full Text</a></li>
<li>A. Sehgal, A. Jagmohan, N. Ahuja &#8220;Scalable Predictive Coding and the Wyner-Ziv Problem, &#8221; Proc. IEEE Int. Conf. Comm. Systems, 2002. <a href="../../publications/siccs2.pdf">Full Text</a></li>
<li>A. Sehgal, N. Ahuja, &#8220;Robust predictive coding and the Wyner-Ziv problem,&#8221; Data Compression Conference, Snowbird, Utah, 2002. pp. 103 <a href="../../publications/sdcc.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1027</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Compression of Image-based Rendering Data</title>
		<link>http://vision.ai.uiuc.edu/?p=1025</link>
		<comments>http://vision.ai.uiuc.edu/?p=1025#comments</comments>
		<pubDate>Fri, 08 May 2009 16:04:45 +0000</pubDate>
		<dc:creator>myra</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Video Compression using Wyner-Ziv Codes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1025</guid>
		<description><![CDATA[<p><img src="file:///C:/DOCUME~1/MYRANA~1/LOCALS~1/Temp/moz-screenshot.jpg" alt="" /></p>
<p align="center"><img class="alignleft" src="../../project_new/compimgbased.jpg" alt="" width="160" height="132" /></p>
<p style="margin-top: 0px; margin-bottom: 0px; word-spacing: 0px; text-indent: 4px;">A. Jagmohan, A. Sehgal, N. Ahuja</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">The design of compression techniques for streaming of image-based rendering data to remote viewers. A compression algorithm based on the use of Wyner-Ziv codes is proposed, which satisfies the key constraints for IBR streaming, namely&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img src="file:///C:/DOCUME~1/MYRANA~1/LOCALS~1/Temp/moz-screenshot.jpg" alt="" /></p>
<p align="center"><img class="alignleft" src="../../project_new/compimgbased.jpg" alt="" width="160" height="132" /></p>
<p style="margin-top: 0px; margin-bottom: 0px; word-spacing: 0px; text-indent: 4px;">A. Jagmohan, A. Sehgal, N. Ahuja</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">The design of compression techniques for streaming of image-based rendering data to remote viewers. A compression algorithm based on the use of Wyner-Ziv codes is proposed, which satisfies the key constraints for IBR streaming, namely those of random access for interactivity, and precompression.<span id="more-1025"></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">A. Jagmohan, A. Sehgal, N. Ahuja &#8220;Compression of Light-field Rendering Data using Coset Codes , &#8221; Invited Paper, Proc. Asilomar Conf. on Sig., Syst., and Comp., 2003. <a href="../../publications/sasilomar.pdf">Full Text</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1025</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Multiscale Texture Element Detection</title>
		<link>http://vision.ai.uiuc.edu/?p=1002</link>
		<comments>http://vision.ai.uiuc.edu/?p=1002#comments</comments>
		<pubDate>Fri, 08 May 2009 02:43:23 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[3D Computer Vision: Scene Reconstruction]]></category>
		<category><![CDATA[3D Surface Orientation from Texture Gradient]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=1002</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_1_1_icon.jpg" alt="" width="160" height="115" />In an image containing texture elements at a range of scales, detect all elements, their relative locations and mutual containment relationships.</p>
<p><span id="more-1002"></span></p>
<p class="MsoNormal" style="line-height: 100%;" align="justify">OBJECTIVE<br />
Given a slanted view of a planar, homogeneously textured surface, estimate the surface slant from the image texture gradient.</p>
<p>APPROACH<br />
(1) Identification&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_1_1_icon.jpg" alt="" width="160" height="115" />In an image containing texture elements at a range of scales, detect all elements, their relative locations and mutual containment relationships.</p>
<p><span id="more-1002"></span></p>
<p class="MsoNormal" style="line-height: 100%;" align="justify">OBJECTIVE<br />
Given a slanted view of a planar, homogeneously textured surface, estimate the surface slant from the image texture gradient.</p>
<p>APPROACH<br />
(1) Identification of image texture elements (texels) that correspond to surface texture elements is itself a significant problem since the scale at which surface detail is captured varies continuously with the three-dimensional distance, and therefore across the image texture. The image texels may exhibit a systematic variation in a priori unknown properties, e. g., size, density or contrast. All regions are potential texels. Consequently, all regions, of all sizes and contrasts, are detected at each location and treated as candidate texels.</p>
<p>(2) The estimation of surface slope (slant and tilt) is integrated with the process of selecting texels from among the large number of detected regions. For any given slant and tilt, only those regions across the image are interpreted as texels whose properties, e. g., area distribution, match the spatial distribution predicted by the hypothesized slant and tilt, and which occupy the largest fraction of the image. The image area is used as a measure of  the extent of support for the particular slant-tilt pair.</p>
<p style="line-height: 100%;" align="justify">(3) All possible slant-tilt values are considered as hypotheses, and a search is conducted to find the hypothesis with most support. This is the estimated surface orientation.</p>
<p>RESULTS<br />
Several real life texture images are shown below. For each, estimated values of the slant-tilt angle pairs are depicted graphically by showing how a hypothetical surface containing fixed size disks would look when viewed at the estimated slant and tilt. Thus, a visual comparison of the original texture image and its graphical depiction can be made to obtain a quick assessment of the quality of the derived estimates.</p>
<p>Each of the following figures consists of four parts:</p>
<ol type="a">
<li>
<p style="line-height: 100%;" align="justify">The original texture image,</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">A set of bright regions detected at multiple scales and as candidate texels,</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">The estimated surface slant and tilt values shown graphically, and</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">The subset of bright regions supporting the estimates surface orientation, i.e., the estimated image texels, shown superposed on the original image.</p>
</li>
</ol>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<table style="height: 2425px;" border="1" width="493">
<tbody>
<tr>
<td width="25%" height="19" align="center">a. Original</td>
<td width="25%" height="19" align="center">b. All regions</td>
<td width="27%" height="19" align="center">c. Slant tilt                     estimates</td>
<td width="23%" height="19" align="center">d. Detected texels</td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_2.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_1.gif" border="0" alt="" width="250" height="183" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_4.gif" border="0" alt="" width="250" height="186" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_3.gif" border="0" alt="" width="250" height="185" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_6.gif" border="0" alt="" width="250" height="176" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_5.gif" border="0" alt="" width="250" height="179" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_8.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_7.gif" border="0" alt="" width="250" height="181" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_10.gif" border="0" alt="" width="250" height="182" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_9.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_12.gif" border="0" alt="" width="250" height="180" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_11.gif" border="0" alt="" width="250" height="186" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="15"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_14.gif" border="0" alt="" width="250" height="177" /></td>
<td colspan="2" width="50%" height="15"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_13.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_16.gif" border="0" alt="" width="250" height="178" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_15.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_18.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_17.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_20.gif" border="0" alt="" width="250" height="137" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_19.gif" border="0" alt="" width="250" height="133" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_22.gif" border="0" alt="" width="250" height="180" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_21.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_25.gif" border="0" alt="" width="250" height="176" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_23.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_28.gif" border="0" alt="" width="250" height="177" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_26.gif" border="0" alt="" width="250" height="179" /></td>
</tr>
</tbody>
</table>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">PUBLICATIONS</p>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p align="justify">1. D. Blostein and N. Ahuja, A Multiscale Region Detector, Computer Vision, Graphics and Image Processing, January 1989, 22-41.       <a href="../../abstracts/pub1_1_1_a0189blostein.htm">Abstract</a></p>
<p>2. S. Jackson and N. Ahuja, Elliptical Gaussian Filters, Proceedings International Conference on Pattern Recognition, Vienna, Austria, August 26-29, 1996, 775-779. <a href="../../abstracts/pub1_1_1_a0896jackson.htm">Abstract</a></p>
<p align="justify">3. N. Ahuja and S. A. Jackson, Multiscale Region Detection,       Proceedings       Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996,       961-967.       <a href="../../abstracts/pub1_1_1_a0296jackson.htm">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=1002</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Shape from Texture</title>
		<link>http://vision.ai.uiuc.edu/?p=997</link>
		<comments>http://vision.ai.uiuc.edu/?p=997#comments</comments>
		<pubDate>Fri, 08 May 2009 02:34:26 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[3D Computer Vision: Scene Reconstruction]]></category>
		<category><![CDATA[3D Surface Orientation from Texture Gradient]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=997</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_1_2_icon.jpg" alt="" width="160" height="116" />Given the image of a homogeneously textured planar surface at unknown orientation relative to the camera, and the output of a multiscale image region detector, estimate the surface orientation.</p>
<p><span id="more-997"></span></p>
<p class="MsoNormal" style="line-height: 100%;" align="justify">OBJECTIVE<br />
Given a slanted view of a planar, homogeneously textured surface, estimate the&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_1_2_icon.jpg" alt="" width="160" height="116" />Given the image of a homogeneously textured planar surface at unknown orientation relative to the camera, and the output of a multiscale image region detector, estimate the surface orientation.</p>
<p><span id="more-997"></span></p>
<p class="MsoNormal" style="line-height: 100%;" align="justify">OBJECTIVE<br />
Given a slanted view of a planar, homogeneously textured surface, estimate the surface slant from the image texture gradient.</p>
<p>APPROACH<br />
(1) Identification of image texture elements (texels) that correspond to surface texture elements is itself a significant problem since the scale at which surface detail is captured varies continuously with the three-dimensional distance, and therefore across the image texture. The image texels may exhibit a systematic variation in a priori unknown properties, e. g., size, density or contrast. All regions are potential texels. Consequently, all regions, of all sizes and contrasts, are detected at each location and treated as candidate texels.</p>
<p>(2) The estimation of surface slope (slant and tilt) is integrated with the         process of selecting texels from among the large number of detected regions. For any         given slant and tilt, only those regions across the image are interpreted as texels whose properties,         e. g., area distribution, match the spatial distribution predicted by         the hypothesized slant and tilt, and which occupy the largest fraction of the image.         The image area is used as a measure of  the         extent of support for the particular slant-tilt pair.</p>
<p style="line-height: 100%;" align="justify">(3) All possible slant-tilt values are considered as hypotheses, and         a search is conducted to find the hypothesis with most support. This is the estimated surface orientation.</p>
<p>RESULTS<br />
Several real life texture images are shown below. For each, estimated values of the         slant-tilt angle pairs are depicted graphically by         showing how a hypothetical surface containing fixed size disks would look         when viewed at the estimated slant and tilt. Thus, a visual comparison of         the original texture image and its graphical depiction can be made to obtain a quick assessment         of the quality of the         derived estimates.</p>
<p>Each of the following figures consists of four parts:</p>
<ol type="a">
<li>
<p style="line-height: 100%;" align="justify">The original texture image,</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">A set of bright regions detected at multiple scales and as candidate texels,</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">The estimated surface slant and tilt values shown graphically, and</p>
</li>
<li>
<p style="line-height: 100%;" align="justify">The subset of bright regions supporting the estimates surface orientation,             i.e., the estimated image texels, shown superposed on the original image.</p>
</li>
</ol>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<table style="height: 2425px;" border="1" width="493">
<tbody>
<tr>
<td width="25%" height="19" align="center">a. Original</td>
<td width="25%" height="19" align="center">b. All regions</td>
<td width="27%" height="19" align="center">c. Slant tilt                     estimates</td>
<td width="23%" height="19" align="center">d. Detected texels</td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_2.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_1.gif" border="0" alt="" width="250" height="183" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_4.gif" border="0" alt="" width="250" height="186" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_3.gif" border="0" alt="" width="250" height="185" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_6.gif" border="0" alt="" width="250" height="176" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_5.gif" border="0" alt="" width="250" height="179" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_8.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_7.gif" border="0" alt="" width="250" height="181" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_10.gif" border="0" alt="" width="250" height="182" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_9.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_12.gif" border="0" alt="" width="250" height="180" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_11.gif" border="0" alt="" width="250" height="186" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="15"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_14.gif" border="0" alt="" width="250" height="177" /></td>
<td colspan="2" width="50%" height="15"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_13.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_16.gif" border="0" alt="" width="250" height="178" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_15.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_18.gif" border="0" alt="" width="250" height="184" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_17.gif" border="0" alt="" width="250" height="182" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_20.gif" border="0" alt="" width="250" height="137" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_19.gif" border="0" alt="" width="250" height="133" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_22.gif" border="0" alt="" width="250" height="180" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_21.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_25.gif" border="0" alt="" width="250" height="176" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_23.gif" border="0" alt="" width="250" height="180" /></td>
</tr>
<tr>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_28.gif" border="0" alt="" width="250" height="177" /></td>
<td colspan="2" width="50%" height="19"><img src="../../labhomepage/public_html/projects/1_1_1/slides/slides_26.gif" border="0" alt="" width="250" height="179" /></td>
</tr>
</tbody>
</table>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">PUBLICATIONS</p>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">1. N. Ahuja and T. Huang, IU at UI: An Overview and an       Example on Shape from Texture, Proc.       DARPA Image Understanding Workshop, Boston, April 6-8, 1988, 222-253.</p>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">2. D. Blostein and N.       Ahuja, Representation and       Three-dimensional Interpretation of Image Texture: An Integrated Approach,       First International Conference on       Computer Vision, London, England, June 8-11, 1987, 444-449.       <a href="../../abstracts/pub1_1_2_a0687blostein.htm">Abstract</a> <a href="../../abstracts/pub1_1_2_a0687blostein.htm"> </a></p>
<p align="justify">3. D. Blostein and N. Ahuja, Shape from Texture: Integrating Texture Element Extraction and Surface Estimation, IEEE Trans. Pattern Analysis and Machine Intelligence, December 1989, 1233-1251. <a href="../../abstracts/pub1_1_2_a1289blostein.htm">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=997</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Non-Lambertian Surface Reconstruction and Reflectance Modeling</title>
		<link>http://vision.ai.uiuc.edu/?p=991</link>
		<comments>http://vision.ai.uiuc.edu/?p=991#comments</comments>
		<pubDate>Fri, 08 May 2009 02:30:03 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[3D Computer Vision: Scene Reconstruction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reflectance and Illumination]]></category>
		<category><![CDATA[Reseach Themes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=991</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/reflect_head.jpg" alt="" width="128" height="147" />Non-lambertian surfaces causes difficulties for many stereo systems. We describe methods to recover both 3D surface shape and reflectance models of an object from multiple views. We use an iterative method, based on multi-view shape from shading, to estimate shape&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/reflect_head.jpg" alt="" width="128" height="147" />Non-lambertian surfaces causes difficulties for many stereo systems. We describe methods to recover both 3D surface shape and reflectance models of an object from multiple views. We use an iterative method, based on multi-view shape from shading, to estimate shape and reflectance models. The estimated models can be used to generate objects in new views and under new lighting conditions using computer graphics techniques.</p>
<p><span id="more-991"></span></p>
<ol>
<li>Tianli Yu, Ning Xu and Narendra Ahuja, Recovering Shape and Reflectance Model of Non-Lambertian Objects from Multiple Views, 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR&#8217;04) Volume 2, pp. 226-233, 06 27 &#8211; 07 02, 2004, Washington, D.C., USA <a href="../../publications/CVPR2004Tianli.pdf">Full Text</a></li>
<li>Tianli Yu, Ning Xu and Narendra Ahuja, Shape and View Independent Reflectance Map from Multiple Views, ECCV 2004, LNCS 3024, pp. 602-616, May 11-14, Prague. <a href="../../publications/ECCV2004Tianli.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=991</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>3D Surfaces and Illumination from Stereo and Shading</title>
		<link>http://vision.ai.uiuc.edu/?p=985</link>
		<comments>http://vision.ai.uiuc.edu/?p=985#comments</comments>
		<pubDate>Fri, 08 May 2009 02:27:25 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[Reflectance and Illumination]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=985</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_4_1_icon.jpg" alt="" width="160" height="158" />Given multiple images of a scene, estimate the scene surfaces, illumination and/or reflectance map.</p>
<p><span id="more-985"></span></p>
<p align="justify">1. D. Hougen and N. Ahuja, Integration of Stereo and Shape from Shading using Color, Proc. Second International Conf. on Automation, Robotics and Computer Vision, Vol 1,&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/icons2/proj1_4_1_icon.jpg" alt="" width="160" height="158" />Given multiple images of a scene, estimate the scene surfaces, illumination and/or reflectance map.</p>
<p><span id="more-985"></span></p>
<p align="justify">1. D. Hougen and N. Ahuja, Integration of Stereo and Shape from Shading using Color, Proc. Second International Conf. on Automation, Robotics and Computer Vision, Vol 1, Singapore, September 15-18 1992, pp. CV-6.6.1 &#8211; CV-6.6.5.</p>
<p>2. D. Hougen and N. Ahuja, Estimation of the Light Source Distribution and its Use in Shape Recovery from Stereo and Shading, 4th Int. Conf. on Computer Vision, Berlin, Germany, May 11-14, 1993, 148-155.</p>
<p>3. Darrell R. Hougen and Narendra Ahuja, Adaptive Polynomial Modelling of the Reflectance Map for Shape Estimation from Stereo and Shading, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, June 1994, 991-994.</p>
<p>4. D. Hougen and N. Ahuja, Shape from Appearance: A Statistical Approach to Surface Shape Estimation, Proceedings European Conference on Computer Vision, Cambridge, England, April 1996, 421-429.</p>
<p align="justify">5. M. Singh and N. Ahuja, Estimating Light Sources, Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, December 1998, 76-81.<a href="../../abstracts/pub1_4_1_a1298singhmk.htm">Abstract and Full Text</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=985</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>3D surface from multiple views</title>
		<link>http://vision.ai.uiuc.edu/?p=972</link>
		<comments>http://vision.ai.uiuc.edu/?p=972#comments</comments>
		<pubDate>Fri, 08 May 2009 02:07:42 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[3D Object Modeling]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=972</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/head_animated.gif" alt="" width="160" height="120" />Given multiple calibrated pictures of a real world object captured from different viewpoints, reconstruct a three-dimensional model of the object.</p>
<p><span id="more-972"></span></p>
<ol>
<li>Tianli Yu, Ning Xu and Narendra Ahuja, Reconstructing a Dynamic Surface from Video Sequences Using Graph Cuts in 4D Space-Time, Pattern&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/head_animated.gif" alt="" width="160" height="120" />Given multiple calibrated pictures of a real world object captured from different viewpoints, reconstruct a three-dimensional model of the object.</p>
<p><span id="more-972"></span></p>
<ol>
<li>Tianli Yu, Ning Xu and Narendra Ahuja, Reconstructing a Dynamic Surface from Video Sequences Using Graph Cuts in 4D Space-Time, Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 2, pp. 245-248 08, 2004 Cambridge UK <a href="../../publications/ICPR2004Tianli.pdf">Full Text</a></li>
<li>Ning Xu, Tianli Yu and Narendra Ahuja. Shape from color consistency using node cut. In Proceedings of Asian Conference on Computer Vision, Jeju Island, Korea. January 2004. <a href="../../%7Eningxu/publications.html">Abstract and Full Text</a></li>
<li>Ning Xu and Narendra Ahuja. A Three-view Matching Algorithm Considering Foreshortening Effects. In Proceedings of International Conference on Computer Vision, Pattern Recognition and Image Processing, pp. 635-638, Cary, NC. September 2003. <a href="../../%7Eningxu/publications.html">Abstract and Full Text </a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=972</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dense Stereo Maping Using Kernel Maximum Likelihood Estimation</title>
		<link>http://vision.ai.uiuc.edu/?p=964</link>
		<comments>http://vision.ai.uiuc.edu/?p=964#comments</comments>
		<pubDate>Fri, 08 May 2009 01:58:24 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[Surfaces from Spatial Stereo]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=964</guid>
		<description><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/densestereo.jpg" alt="" width="125" height="125" />A robust stereo matching algorithm using kernel representation of the probability density functions (pdf&#8217;s) of the sources that generate the stereoscopic images. Matching is done using either a Maximum Likelihood framework or using correlation in the pdf domain and an&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://vision.ai.uiuc.edu/project_new/densestereo.jpg" alt="" width="125" height="125" />A robust stereo matching algorithm using kernel representation of the probability density functions (pdf&#8217;s) of the sources that generate the stereoscopic images. Matching is done using either a Maximum Likelihood framework or using correlation in the pdf domain and an MRF prior to model the disparity function.</p>
<p><span id="more-964"></span></p>
<ol>
<li>A. Jagmohan, M. Singh, and N. Ahuja, Dense Two View Stereo Matching Using Kernel Maximum Likelihood Estimation, Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 3, pp. 28-31, 08, 2004 Cambridge UK <a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1334461&amp;isnumber=29387&amp;punumber=9258&amp;k2dockey=1334461@ieeecnfs&amp;query=%28%09dense+stereo+matching+using+kernel+maximum+likelihood+estimation%3Cin%3Emetadata%29&amp;pos=0">IEEE Reference</a></li>
<li>M. Singh, H. Arora and N. Ahuja, Robust Registration and Tracking Using Kernel Density Correlation, 2004 Conference on Computer Vision and Pattern Recognition Workshop on Image and Video Registration, CVPRW&#8217;04 Volume 11, p. 174, 06 27 &#8211; 07 02, 2004, Washington, D.C., USA <a href="../../publications/cvpr2004_msingh_arora_ahuja_kdc.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=964</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Surfaces from Binocular Spatial Stereo</title>
		<link>http://vision.ai.uiuc.edu/?p=946</link>
		<comments>http://vision.ai.uiuc.edu/?p=946#comments</comments>
		<pubDate>Fri, 08 May 2009 00:58:42 +0000</pubDate>
		<dc:creator>avinash</dc:creator>
				<category><![CDATA[3D Computer Vision: Scene Reconstruction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Surfaces from Spatial Stereo]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=946</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj1_3_3_icon.jpg" border="2" alt="" width="128" height="143" /></p>
<p>Given multiple images of a scene, taken from multiple cameras and different viewpoints, find the 3D depth map and surfaces</p>
<p><span id="more-946"></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Surfaces from Stereo, Proc. DARPA Image Understanding Workshop, Miami,   December 9-10, 1985, 98-106. <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=16709">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">
</p><p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj1_3_3_icon.jpg" border="2" alt="" width="128" height="143" /></p>
<p>Given multiple images of a scene, taken from multiple cameras and different viewpoints, find the 3D depth map and surfaces</p>
<p><span id="more-946"></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Surfaces from Stereo, Proc. DARPA Image Understanding Workshop, Miami,   December 9-10, 1985, 98-106. <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=16709">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Surfaces from Stereo, 8th International Conference on Pattern Recognition, Paris, France, October 28-31, 1986, 516-518.</p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">J. Weng, N. Ahuja and T. Huang, Two-View Matching, Second   International Conference on Computer Vision, Tarpon Springs, December 5-8,   1988, 64-73. <a href="../../abstracts/pub1_3a_1_a1288wah.htm">Abstract   and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Extracting Surfaces from Stereo Images: An Integrated Approach, First   International Conference on Computer Vision, London, England, June 8-11,   1987, 284-294.</p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Depth from Stereo, IEEE Conf. on Computer Vision and Pattern Recognition, San   Francisco, June 9-13, 1985.</p>
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">
<p class="MsoNormal" style="margin-right: 7.5pt; text-align: justify;">W. Hoff and N. Ahuja,   Generating Range Map from Stereo Images, 4th Scandinavian Conf. on Image   Analysis, Trondheim, Norway, June 18-20, 1985,   761-768.</p>
<p>W. Hoff and N. Ahuja, Surfaces from Stereo: Integrating   Feature Matching, Disparity Estimation and Contour Detection, IEEE Trans.   Pattern Analysis and Machine Intelligence, February 1989, 121-136. <a href="../../abstracts/pub1_3a_1_a0289ha.htm">Abstract and   Full Text</a></p>
<p>E. Altman and N. Ahuja, A Dynamical Systems   Approach to Integration in Stereo, Proc. DARPA Image Understanding Workshop,   Pittsburgh, Sep 11-13, 1990, 423-427.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=946</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bandwidth Selection for Kernel Density Estimators</title>
		<link>http://vision.ai.uiuc.edu/?p=937</link>
		<comments>http://vision.ai.uiuc.edu/?p=937#comments</comments>
		<pubDate>Thu, 07 May 2009 23:20:07 +0000</pubDate>
		<dc:creator>Tim</dc:creator>
				<category><![CDATA[Image Structure Detection and Representation]]></category>
		<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Statistical Models]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=937</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" src="../../project_new/bandwidth.jpg" alt="" width="128" height="160" /></p>
<p>A regression-based model which admits a realistic framework for automatically choosing bandwidth parameters which minimizes a global error criterion. This is used for automatic segmentation of images at any input resolution scale (for e.g., the wavelet decomposition scale).<span id="more-937"></span></p>
<ol>
<li>M. Singh and&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" src="../../project_new/bandwidth.jpg" alt="" width="128" height="160" /></p>
<p>A regression-based model which admits a realistic framework for automatically choosing bandwidth parameters which minimizes a global error criterion. This is used for automatic segmentation of images at any input resolution scale (for e.g., the wavelet decomposition scale).<span id="more-937"></span></p>
<ol>
<li>M. Singh and N. Ahuja, &#8220;Regression based Bandwidth Selection for Segmentation using Parzen Windows&#8221;, in Ninth IEEE International Conference in Computer Vision, Proceedings, vol. 1, pp. 2-9, Oct. 2003, Nice, France. <a href="../../publications/iccv2003_msingh_ahuja_regression_segmentation.pdf">Full Text</a></li>
<li>Singh and N. Ahuja, Mean-Shift Segmentation with Wavelet-based Bandwidth Selection, IEEE Workshop on Applications in Computer Vision, pp. 43-50, Dec. 3-4, 2002, Florida. <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1182154">Full Text</a></li>
</ol>
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" /><!--Session data--><br />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=937</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Estimation and Segmentation of Images Using Parametric Image Models</title>
		<link>http://vision.ai.uiuc.edu/?p=924</link>
		<comments>http://vision.ai.uiuc.edu/?p=924#comments</comments>
		<pubDate>Thu, 07 May 2009 22:27:50 +0000</pubDate>
		<dc:creator>Tim</dc:creator>
				<category><![CDATA[Image Structure Detection and Representation]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Statistical Models]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=924</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" src="../../project_new/estimationsegment.jpg" alt="" width="128" height="152" /></p>
<p>Models of spatial variation in images are central to a large number of low-level computer vision problems including  egmentation, registration, and 3D structure detection. Often, images are represented using parametric models to characterize (noise-free) image variation, and, additive noise. However,&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" src="../../project_new/estimationsegment.jpg" alt="" width="128" height="152" /></p>
<p>Models of spatial variation in images are central to a large number of low-level computer vision problems including  egmentation, registration, and 3D structure detection. Often, images are represented using parametric models to characterize (noise-free) image variation, and, additive noise. However, the noise model may be unknown and parametric models may only be valid on individual segments of the image. Consequently, we model noise using a nonparametric kernel density estimation framework and use a locally or globally linear parametric model to represent the noise-free image pattern. This results in a  ovel, robust, redescending, M- parameter estimator for the above image model which we call the Kernel Maximum Likelihood estimator (KML). We also provide a provably convergent, iterative algorithm for the resultant optimization problem. The estimation framework is empirically validated on synthetic data and applied to the task of range image segmentation.</p>
<p><span id="more-924"></span></p>
<ol>
<li>M. Singh, H. Arora and N. Ahuja, &#8220;A Robust Probabilistic Estimation Framework for Parametric Image Models&#8221;, European Conference on Computer Vision, LNCS 3021, pp. 508-522, 2004. <a href="http://vision.ai.uiuc.edu/~harora1/research/robust_eccv04_paper.pdf">Full Text</a></li>
</ol>
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--><br />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=924</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tele-collaboration in interactive augmented environments</title>
		<link>http://vision.ai.uiuc.edu/?p=883</link>
		<comments>http://vision.ai.uiuc.edu/?p=883#comments</comments>
		<pubDate>Thu, 07 May 2009 21:44:21 +0000</pubDate>
		<dc:creator>xianbiao</dc:creator>
				<category><![CDATA[Head-mounted projective display technology]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=883</guid>
		<description><![CDATA[<p><img class="alignnone" title="proj9_2_1_icon.jpg" src="http://vision.ai.uiuc.edu/icons2/proj9_2_1_icon.jpg" alt="" width="183" height="134" />To demonstrate the featured capabilities of the HMPD technology, and explore its application for distance collaboration in interactive augmented environments</p>
<p><span id="more-883"></span></p>
<p>&#124; Demo &#124; <a href="../../hong/go.htm">Details</a> &#124; <a href="../../pubs/pub9_2_1.htm">Publications</a> &#124; <a href="../../themes/synthesis.rendering.visualization.htm">Back to Theme</a> &#124;</p>
]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" title="proj9_2_1_icon.jpg" src="http://vision.ai.uiuc.edu/icons2/proj9_2_1_icon.jpg" alt="" width="183" height="134" />To demonstrate the featured capabilities of the HMPD technology, and explore its application for distance collaboration in interactive augmented environments</p>
<p><span id="more-883"></span></p>
<p>| Demo | <a href="../../hong/go.htm">Details</a> | <a href="../../pubs/pub9_2_1.htm">Publications</a> | <a href="../../themes/synthesis.rendering.visualization.htm">Back to Theme</a> |</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=883</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Head-mounted projective display technology</title>
		<link>http://vision.ai.uiuc.edu/?p=871</link>
		<comments>http://vision.ai.uiuc.edu/?p=871#comments</comments>
		<pubDate>Thu, 07 May 2009 21:40:33 +0000</pubDate>
		<dc:creator>xianbiao</dc:creator>
				<category><![CDATA[Head-mounted projective display technology]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=871</guid>
		<description><![CDATA[<p><img class="alignnone" title="proj9_1_1_icon.jpg" src="http://vision.ai.uiuc.edu/icons2/proj9_1_1_icon.jpg" alt="" width="160" height="169" />To optimize a novel visualization device referred to as head-mounted projective display (HMPD), and develop a multi-user interactive workbench with tele-presence capability<span id="more-871"></span></p>
<p>&#124; Demo &#124; <a href="../../hong/hmpd.htm">Details</a> &#124; <a href="../../pubs/pub9_1_1.htm">Publications</a> &#124; <a href="../../themes/synthesis.rendering.visualization.htm">Back to Theme</a> &#124;</p>
]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" title="proj9_1_1_icon.jpg" src="http://vision.ai.uiuc.edu/icons2/proj9_1_1_icon.jpg" alt="" width="160" height="169" />To optimize a novel visualization device referred to as head-mounted projective display (HMPD), and develop a multi-user interactive workbench with tele-presence capability<span id="more-871"></span></p>
<p>| Demo | <a href="../../hong/hmpd.htm">Details</a> | <a href="../../pubs/pub9_1_1.htm">Publications</a> | <a href="../../themes/synthesis.rendering.visualization.htm">Back to Theme</a> |</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=871</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Object Category Modeling using Interest Points</title>
		<link>http://vision.ai.uiuc.edu/?p=857</link>
		<comments>http://vision.ai.uiuc.edu/?p=857#comments</comments>
		<pubDate>Thu, 07 May 2009 21:34:23 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[object categorization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=857</guid>
		<description><![CDATA[<table style="height: 99px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-872" title="5_6_2-22" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/5_6_2-22.jpg" alt="5_6_2-22" width="174" height="114" /></td>
<td style="text-align: justify;" valign="top">
<p>An automatic object detection, localization and segmentation system is proposed for object categories. Object categories are modelled as templates of patches around interest points, encoding both location and appearance information. The automatic segmentation algorithm integrates the localization information with the&#8230;</p></td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 99px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-872" title="5_6_2-22" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/5_6_2-22.jpg" alt="5_6_2-22" width="174" height="114" /></td>
<td style="text-align: justify;" valign="top">
<p>An automatic object detection, localization and segmentation system is proposed for object categories. Object categories are modelled as templates of patches around interest points, encoding both location and appearance information. The automatic segmentation algorithm integrates the localization information with the edge information in the image.</td>
</tr>
</tbody>
</table>
<p><span id="more-857"></span></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=857</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Extracting subimages of an unknown category from a set of images</title>
		<link>http://vision.ai.uiuc.edu/?p=823</link>
		<comments>http://vision.ai.uiuc.edu/?p=823#comments</comments>
		<pubDate>Thu, 07 May 2009 21:21:58 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[object categorization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=823</guid>
		<description><![CDATA[<table style="height: 52px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-837" title="sinisa_icon1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/sinisa_icon1.jpg" alt="sinisa_icon1" width="174" height="82" /></td>
<td style="text-align: justify;" valign="top">Given a set of images, possibly containing objects from an unknown category, determine if a category is present. If a category is present, learn spatial and photometric model of the category. Given an unseen image, segment all occurrences of the&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 52px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-837" title="sinisa_icon1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/sinisa_icon1.jpg" alt="sinisa_icon1" width="174" height="82" /></td>
<td style="text-align: justify;" valign="top">Given a set of images, possibly containing objects from an unknown category, determine if a category is present. If a category is present, learn spatial and photometric model of the category. Given an unseen image, segment all occurrences of the category.</td>
</tr>
</tbody>
</table>
<p><span id="more-823"></span></p>
<h3>Publications</h3>
<ol>
<li>S. Todorovic and N. Ahuja, Extracting subimages of an unknown category from a set of images,  in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2006), vol. 1, pp. 927-934, New York, NY, 2006 <a href="http://vision.ai.uiuc.edu/papers/TodorovicAhuja_CategoryModeling_CVPR06.pdf" target="_self">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=823</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sparse Lumigraph Relighting by Illumination and Reflectance Estimation from Multi-View Images</title>
		<link>http://vision.ai.uiuc.edu/?p=797</link>
		<comments>http://vision.ai.uiuc.edu/?p=797#comments</comments>
		<pubDate>Thu, 07 May 2009 21:14:10 +0000</pubDate>
		<dc:creator>xianbiao</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[Sparse Lumigraph Relighting by Illumination and Reflectance Estimation from Multi-View Images]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=797</guid>
		<description><![CDATA[<p><span style="font-family: Times New Roman;"><img class="alignnone" title="relighting.jpg" src="http://vision.ai.uiuc.edu/project_new/relighting.jpg" alt="" width="198" height="153" />A</span> novel relighting approach that does not assume that the illumination is known or controllable<span style="font-family: Times New Roman;">. </span>Instead, we estimate the illumination and texture from multi-view images captured under a single illumination setting, given the object shape.</p>
<p><span id="more-797"></span></p>
<ol>
<li> Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<p><span style="font-family: Times New Roman;"><img class="alignnone" title="relighting.jpg" src="http://vision.ai.uiuc.edu/project_new/relighting.jpg" alt="" width="198" height="153" />A</span> novel relighting approach that does not assume that the illumination is known or controllable<span style="font-family: Times New Roman;">. </span>Instead, we estimate the illumination and texture from multi-view images captured under a single illumination setting, given the object shape.</p>
<p><span id="more-797"></span></p>
<ol>
<li> Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Eurographics Symposium on Rendering (EGSR), 2006 <a href="../../%7Ewanghc/papers/EGSR06_SceneFactor.pdf"> Full Text</a></li>
<li> Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Technical Sketch, SIGGRAPH, 2006 <a href="../../%7Ewanghc/papers/siggraph06_sketch.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=797</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Six-legged Robot Design</title>
		<link>http://vision.ai.uiuc.edu/?p=786</link>
		<comments>http://vision.ai.uiuc.edu/?p=786#comments</comments>
		<pubDate>Thu, 07 May 2009 21:09:38 +0000</pubDate>
		<dc:creator>qingxiong</dc:creator>
				<category><![CDATA[Parallel Processing and Robotics]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[walking robot]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=786</guid>
		<description><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><img class="alignleft" src="../../icons2/proj8_2_1_icon.jpg" alt="http://vision.ai.uiuc.edu/icons2/proj8_2_1_icon.jpg" width="160" height="136" /></p>
<p>J Cocatre-Zilgien, F Delcomyn, Z Ding, J Hart, G Kremesec, L Lu, M Nelson, J Reichler, K Tan, Design and implementation of a pneumatic, six-legged robot with the geometry, number of leg joints, and joint functionality modeled after the American&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><img class="alignleft" src="../../icons2/proj8_2_1_icon.jpg" alt="http://vision.ai.uiuc.edu/icons2/proj8_2_1_icon.jpg" width="160" height="136" /></p>
<p>J Cocatre-Zilgien, F Delcomyn, Z Ding, J Hart, G Kremesec, L Lu, M Nelson, J Reichler, K Tan, Design and implementation of a pneumatic, six-legged robot with the geometry, number of leg joints, and joint functionality modeled after the American Cockroach</p>
<p><span id="more-786"></span>|<a href="../../jmh/Hexapod-BEAC99.mpg">Demo</a>|<a href="../../jmh/marks-hex-tst.htm">Details</a>|<a href="../../jmh/ITG-Forum/hexapod-ppt.htm">Presentation</a>|<a href="../../jmh/poster.html">Poster</a>|<a href="../../pubs/pub8_2_1.htm">Publications</a>|</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=786</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Videoshop: A New Framework for Video Editing in Gradient Domain</title>
		<link>http://vision.ai.uiuc.edu/?p=771</link>
		<comments>http://vision.ai.uiuc.edu/?p=771#comments</comments>
		<pubDate>Thu, 07 May 2009 21:06:38 +0000</pubDate>
		<dc:creator>xianbiao</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>
		<category><![CDATA[VideoShop: A New Framework for Video Editing]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=771</guid>
		<description><![CDATA[<p><img class="alignnone" title="face1.jpg" src="http://vision.ai.uiuc.edu/project_new/face1.jpg" mce_src="http://vision.ai.uiuc.edu/project_new/face1.jpg" alt="" width="159" height="132"/>A new framework for seamless video editing in gradient domain with the objective of replacing video segments in one video sequence from those in another, composing video sequences by juxtaposing multiple other video sequences, etc.</p>
<p><img src="http://vision.ai.uiuc.edu/wordpress/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" mce_src="http://vision.ai.uiuc.edu/wordpress/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" alt="" class="mceWPmore mceItemNoResize" title="More..."/></p>
<p>1.Hongcheng Wang, Ning Xu, Ramesh Raskar&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" title="face1.jpg" src="http://vision.ai.uiuc.edu/project_new/face1.jpg" mce_src="http://vision.ai.uiuc.edu/project_new/face1.jpg" alt="" width="159" height="132">A new framework for seamless video editing in gradient domain with the objective of replacing video segments in one video sequence from those in another, composing video sequences by juxtaposing multiple other video sequences, etc.</p>
<p><img src="http://vision.ai.uiuc.edu/wordpress/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" mce_src="http://vision.ai.uiuc.edu/wordpress/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" alt="" class="mceWPmore mceItemNoResize" title="More..."></p>
<p>1.Hongcheng Wang, Ning Xu, Ramesh Raskar and Narendra Ahuja, Videoshop: A New Framework for Video Editing in Spatio-Temporal Gradient Domain, IEEE, Video Proceedings, International Conference on Computer Vision and Pattern Recognition, 2005&nbsp; <a mce_href="http://vision.ai.uiuc.edu/~wanghc/papers/editing-graphical_model.pdf" href="http://vision.ai.uiuc.edu/%7Ewanghc/papers/editing-graphical_model.pdf">Full Text</a><br />
2. Hongcheng Wang, Ramesh Raskar and Narendra Ahuja, Seamless Video Editing, Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, Aug. 2004, pp. III-858-861&nbsp; <a mce_href="http://vision.ai.uiuc.edu/~wanghc/papers/icpr04_editing.pdf" href="http://vision.ai.uiuc.edu/%7Ewanghc/papers/icpr04_editing.pdf">Full Text</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=771</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Path Planning</title>
		<link>http://vision.ai.uiuc.edu/?p=774</link>
		<comments>http://vision.ai.uiuc.edu/?p=774#comments</comments>
		<pubDate>Thu, 07 May 2009 21:05:09 +0000</pubDate>
		<dc:creator>qingxiong</dc:creator>
				<category><![CDATA[Atonomous navigation]]></category>
		<category><![CDATA[Parallel Processing and Robotics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=774</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj8_1_2_icon.jpg" border="2" alt="" width="160" height="120" /></p>
<p> </p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"></h2>
<p>J Chuang, Y Hwang, R Ruff</p>
<p>Given a mobile object required to move from a source location/orientation to a destination location/orientation, compute a path that it can follow and the orientation and velocity values it must assume along the path to&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj8_1_2_icon.jpg" border="2" alt="" width="160" height="120" /></p>
<p><!--mstheme--> <!--mstheme--></p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--><!--mstheme--></h2>
<p>J Chuang, Y Hwang, R Ruff</p>
<p>Given a mobile object required to move from a source location/orientation to a destination location/orientation, compute a path that it can follow and the orientation and velocity values it must assume along the path to efficiently and smoothly move from the source to the destination.<span id="more-774"></span></p>
<p>| <a href="../../labhomepage/public_html/projects/8_1_2/pathplanning.htm#demo"> Demo</a> | <a href="../../labhomepage/public_html/projects/8_1_2/pathplanning.htm">Details </a> | <a href="../../pubs/pub8_1_2.htm">Publications</a>|</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=774</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New publications</title>
		<link>http://vision.ai.uiuc.edu/?p=765</link>
		<comments>http://vision.ai.uiuc.edu/?p=765#comments</comments>
		<pubDate>Thu, 07 May 2009 20:58:53 +0000</pubDate>
		<dc:creator>emre</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=765</guid>
		<description><![CDATA[<ul>
<li>Qingxiong Yang, Kar-Han Tan and Narendra Ahuja, Real-time O(1) Bilateral Filtering,  IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2009.</li>
<li>C. Gao, H. Hua and N. Ahuja, A Hemispherical Imaging Camera, Computer Vision and Image Understanding, 2009, in press. <a href="http://www.sciencedirect.com/science?_ob=ArticleURL&#38;_udi=B6WCX-4VV2NR1-1&#38;_user=10&#38;_rdoc=1&#38;_fmt=&#38;_orig=search&#38;_sort=d&#38;view=c&#38;_acct=C000050221&#38;_version=1&#38;_urlVersion=0&#38;_userid=10&#38;md5=9dc7d44177b67c361853fc57d76b1682">Abstract</a> <a href="../publications/Gao-CVIU09.pdf">Full&#8230;</a></li></ul>]]></description>
			<content:encoded><![CDATA[<ul>
<li>Qingxiong Yang, Kar-Han Tan and Narendra Ahuja, Real-time O(1) Bilateral Filtering,  IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2009.</li>
<li>C. Gao, H. Hua and N. Ahuja, A Hemispherical Imaging Camera, Computer Vision and Image Understanding, 2009, in press. <a href="http://www.sciencedirect.com/science?_ob=ArticleURL&amp;_udi=B6WCX-4VV2NR1-1&amp;_user=10&amp;_rdoc=1&amp;_fmt=&amp;_orig=search&amp;_sort=d&amp;view=c&amp;_acct=C000050221&amp;_version=1&amp;_urlVersion=0&amp;_userid=10&amp;md5=9dc7d44177b67c361853fc57d76b1682">Abstract</a> <a href="../publications/Gao-CVIU09.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Texel-based Texture Segmentation, International Conference on Computer Vision (ICCV), Kyoto, Japan, September-October 2009.</li>
<li>E. Akbas and N. Ahuja, From ramp discontinuities to segmentation tree, 9th Asian Conference on Computer Vision (ACCV), Xi’an, China, September 2009. <a href="http://vision.ai.uiuc.edu/publications/accv2009_akbas_ahuja.pdf" target="_blank">Full text</a></li>
<li>Q.Kong, A.Kumar, N.Ahuja and Y.Liu, Robust Segmentation of Freight Containers in Train Monitoring Videos, Workshop on Applications of  Computer Vision(WACV), Snowbird, Utah, December 2009.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=765</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sketch-Based Object Selection in Images</title>
		<link>http://vision.ai.uiuc.edu/?p=754</link>
		<comments>http://vision.ai.uiuc.edu/?p=754#comments</comments>
		<pubDate>Thu, 07 May 2009 20:54:03 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[HCI]]></category>
		<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=754</guid>
		<description><![CDATA[<table style="height: 142px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-772" title="proj5_3_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_3_1_icon.jpg" alt="proj5_3_1_icon" width="160" height="116" /></td>
<td style="text-align: justify;" valign="top">To assist humans in referring to specific parts of an image, and performing desired operations on these parts, through natural-like interpersonal communication, e.g. by freely drawing sketches over the image which mean specific editorial operations such as move, expand and&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 142px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-772" title="proj5_3_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_3_1_icon.jpg" alt="proj5_3_1_icon" width="160" height="116" /></td>
<td style="text-align: justify;" valign="top">To assist humans in referring to specific parts of an image, and performing desired operations on these parts, through natural-like interpersonal communication, e.g. by freely drawing sketches over the image which mean specific editorial operations such as move, expand and delete.</td>
</tr>
</tbody>
</table>
<p><span id="more-754"></span></p>
<table border="0" cellspacing="0" cellpadding="0" width="100%">
<tbody>
<tr>
<td>
<h2><span style="color: #003399;">Applications</span></h2>
</td>
<td valign="top"></td>
</tr>
</tbody>
</table>
<p>A sketching interface is an appropriate choice for image editing because the pen and pencil are ubiquitous tools for creating graphics, and sketch-based interaction would be very natural for use in handheld personal digital assistants and the emerging class of �tablet� computers. A typical example of the kind of sketch allowed by the selection tool we will describe is shown in figure 1(a). With the sketch shown, our selection tool was able to extract the head of the crane, as shown in figure 1(b). Despite the fairly convoluted profile of the crane�s head, which consists of both sharp boundaries along the beak and complicated boundaries around the crane�s crown, our selection tool needed only the simple sketch shown, which is natural and can be drawn quickly, without much skill and dexterity. We believe that this tool will enable sophisticated and high-fidelity graphical selection and manipulation operations in situations where the user may not have much time, such as during a live sports broadcast, or may not have much dextrous control, such as on a mobile handheld device, or may want to avoid the monotony and tedium of selection, without compromising the quality of the selection.</p>
<table border="0" cellspacing="0" cellpadding="0" width="640">
<tbody>
<tr>
<td>
<h2><a name="Algorithm"></a><span style="color: #003399;">Algorithm</span></h2>
</td>
<td valign="top"></td>
</tr>
<tr>
<td valign="top">
<table style="height: 100%;" border="0" width="100%">
<tbody>
<tr>
<td>
<h3><span>Step 1: Draw Sketch</span></h3>
<p>User draws sketch over the image. Users can make three types of sketch strokes:</p>
<ul>
<li> Points</li>
<li> Lines</li>
<li> Regions (closed loops)</li>
</ul>
</td>
</tr>
<tr>
<td>
<h3><span>Step 2: Image Segmentation</span></h3>
<p>Compute a global segmentation of the image. We can use any existing segmentation method. The segment map illustrated is computed by recursive binary-split vector quantization in the (x,y,rgb) space.</td>
</tr>
<tr>
<td>
<h3><span>Step 3: Sketch Processing</span></h3>
<p>Use sketch to select a number of segments from the image that form the foreground object.</td>
</tr>
<tr>
<td>
<h3><span>Step 4: Simplicial Decomposition</span></h3>
<p>We want to decompose the image into small triangles so they can be processed individually. We can use:</p>
<ul>
<li> Delaunay Triangulation of the segment centroids, [1] or</li>
<li> Use the segmentation to compute a triangulation [2].</li>
</ul>
</td>
</tr>
<tr>
<td>
<h3><span>Step 5: Object Extraction</span></h3>
<p>Each triangle is processed separately as follows:</p>
<ul>
<li> Find a linear approximation to the local object boundary.</li>
<li> Re-segment the image within the boundary using the same segmentation algorithm as in step 2.</li>
<li> Classify the new segments into foreground/background using the linear boundary.</li>
<li> Pick a window of pixels around each of the new segment centroids and use them as samples of foreground/background colors</li>
<li> Estimate alpha channel</li>
</ul>
</td>
</tr>
<tr>
<td>
<h3><span>Step 6: Complete Selection</span></h3>
<p>Composite the result from each triangle back into a complete selection.</td>
</tr>
</tbody>
</table>
</td>
<td valign="top"><img src="../../%7Etankh/Selection/fig03.jpg" border="0" alt="" width="436" height="889" /></p>
<p>Figure 2: Steps in selection algorithm.</td>
</tr>
</tbody>
</table>
<table border="0" cellspacing="0" cellpadding="0" width="640">
<tbody>
<tr>
<td valign="top">
<h2><a name="Results"></a><span style="color: #003399;">Results</span></h2>
</td>
<td valign="bottom"></td>
</tr>
<tr>
<td valign="bottom"><img src="../../%7Etankh/Selection/fig07b.jpg" border="0" alt="" width="320" height="230" /></td>
<td valign="bottom"><img src="../../%7Etankh/Selection/fig07c.jpg" border="0" alt="" width="320" height="233" /></td>
</tr>
<tr>
<td>
<h3>Sketch</h3>
<p><img src="../../%7Etankh/Selection/crane_1.jpg" border="0" alt="" width="320" height="350" /></td>
<td>
<h3>Resulting Selection</h3>
<p><img src="../../%7Etankh/Selection/crane_5.jpg" border="0" alt="" width="320" height="350" /></td>
</tr>
</tbody>
</table>
<p>Figure 3: Results computed with the Delaunay Triangulation decomposition method.</p>
<h3>Publications</h3>
<ol>
<li>Kar-Han Tan and Narendra Ahuja. Selecting Objects with Freehand Sketches, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 337-344.  <a href="http://vision.ai.uiuc.edu/~tankh/Selection/selection-iccv2001.pdf">Full Text</a></li>
<li> Kar-Han Tan and Narendra Ahuja. A Representation for Image Structure and its Application in Object Selection with Freehand Sketches. In Proceedings IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2001. <a href="http://vision.ai.uiuc.edu/~tankh/Selection/selection-cvpr2001.pdf" target="_self">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=754</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Videoshop: A New Framework for Video Editing in Gradient Domain</title>
		<link>http://vision.ai.uiuc.edu/?p=753</link>
		<comments>http://vision.ai.uiuc.edu/?p=753#comments</comments>
		<pubDate>Thu, 07 May 2009 20:53:30 +0000</pubDate>
		<dc:creator>xianbiao</dc:creator>
				<category><![CDATA[Reseach Themes]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=753</guid>
		<description><![CDATA[<p><img class="alignnone" title="face1.jpg" src="http://vision.ai.uiuc.edu/project_new/face1.jpg" alt="" width="159" height="132" /></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">A new framework for seamless video editing in gradient domain with the objective of replacing video segments in one video sequence from those in another, composing video sequences by juxtaposing multiple other video sequences, etc.</p>
<p><span id="more-753"></span></p>
<ol>
<li>
<p align="justify"><a href="../../%7Ewanghc">Hongcheng Wang</a>, <a href="../../%7Eningxu">Ning Xu</a>, <a href="http://www.merl.com/people/raskar/">Ramesh Raskar</a> and&#8230;</p></li></ol>]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" title="face1.jpg" src="http://vision.ai.uiuc.edu/project_new/face1.jpg" alt="" width="159" height="132" /></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">A new framework for seamless video editing in gradient domain with the objective of replacing video segments in one video sequence from those in another, composing video sequences by juxtaposing multiple other video sequences, etc.</p>
<p><span id="more-753"></span></p>
<ol>
<li>
<p align="justify"><a href="../../%7Ewanghc">Hongcheng Wang</a>, <a href="../../%7Eningxu">Ning Xu</a>, <a href="http://www.merl.com/people/raskar/">Ramesh Raskar</a> and <a href="../../ahuja.html">Narendra Ahuja</a>, Videoshop: A New Framework for Video Editing in Spatio-Temporal Gradient Domain, IEEE, Video Proceedings, International Conference on Computer Vision and Pattern Recognition, 2005</p>
</li>
<li>
<p align="justify"><a href="../../%7Ewanghc">Hongcheng Wang</a>, <a href="http://www.merl.com/people/raskar/"> Ramesh Raskar</a> and <a href="../../ahuja.html">Narendra Ahuja</a>, Seamless Video Editing, Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, Aug. 2004, pp. III-858-861</p>
</li>
</ol>
<p>Related projects:  <p><a href='http://vision.ai.uiuc.edu/?p=771'><h3>Videoshop: A New Framework for Video Editing in Gradient Domain</h3></a><br style='clear:both'><a href='http://vision.ai.uiuc.edu/?p=771'>Read more...</a><br><br><br></p></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=753</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Interview with Dr. Ahuja</title>
		<link>http://vision.ai.uiuc.edu/?p=742</link>
		<comments>http://vision.ai.uiuc.edu/?p=742#comments</comments>
		<pubDate>Thu, 07 May 2009 20:46:36 +0000</pubDate>
		<dc:creator>emre</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=742</guid>
		<description><![CDATA[<p>An interview with Prof. Ahuja appered in the &#8220;Leading Expert&#8221; series on the <a href="http://www.ece.uiuc.edu" target="_blank">ECE</a> department&#8217;s web page.</p>
<p><span id="more-742"></span></p>
<p>Narendra Ahuja,  Donald Biggar Willet Professor of Engineering<img class="alignright" src="http://www.ece.illinois.edu/images/photos/140/n-ahuja.jpg" alt="Narendra Ahuja" width="140" height="200" /></p>
<p>Leading Expert in Computer Vision</p>
<p><em><strong>Q: </strong>What is your area of expertise? </em><br />
<strong>A: </strong> My area is computer vision, a&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p>An interview with Prof. Ahuja appered in the &#8220;Leading Expert&#8221; series on the <a href="http://www.ece.uiuc.edu" target="_blank">ECE</a> department&#8217;s web page.</p>
<p><span id="more-742"></span></p>
<p>Narendra Ahuja,  Donald Biggar Willet Professor of Engineering<img class="alignright" src="http://www.ece.illinois.edu/images/photos/140/n-ahuja.jpg" alt="Narendra Ahuja" width="140" height="200" /></p>
<p>Leading Expert in Computer Vision</p>
<p><em><strong>Q: </strong>What is your area of expertise? </em><br />
<strong>A: </strong> My area is computer vision, a field related to image processing, pattern recognition, sensing, and robotics. The main goal is to automatically understand the contents of images and video&#8211;like human vision does routinely&#8211;which is why the field is also called “image understanding.” Specific tasks range from very simple and general ones like estimating whether an orange on the tree is ripe to eat or is it within reach, to very complex ones like assigning semantic and possibly subjective labels such as “an attractive outfit” or “that serene view of the skies.”</p>
<p><em><strong>Q: </strong>Give me a brief synopsis of your education and career. </em><br />
<strong>A: </strong> My bachelor’s is in electronics engineering, my master’s is in electrical communication engineering, and my PhD is in computer science. My bachelor’s is from the Birla Institute of Technology and Science at Pilani, India, my master’s is from the Indian Institute of Science in Bangalore, India, and my PhD is from the University of Maryland, College Park. I grew up not far from the Taj Mahal, about 30 or 40 miles away. After my PhD, I came to Illinois.</p>
<p><em><strong>Q: </strong>You have been at Illinois since 1979. What do you enjoy most about being here? </em><br />
<strong>A: </strong> The longest I have been at any one place is here at Illinois! I never thought I would be here this long. But it’s been one pleasant journey. I like the supportive environment UIUC provides. Urbana-Champaign offers a lifestyle conducive to research. You don’t have to spend an hour and half each way to work. Collegial department, great colleagues, and wonderful students.</p>
<p><em><strong>Q: </strong>Why did you become an engineer? </em><br />
<strong>A: </strong> When I was growing up, engineering was one of the most desired professions. If you were doing well in school, you basically went into engineering or medicine. I knew definitely that I didn’t want to become a doctor. I didn’t have any interest in drugging a frog unconscious and dissecting it, which the biology students did regularly. And I don’t like the sight of misery and suffering, although the thought of being able to do something about it was very enticing. Engineering appealed to me on the other hand. I always liked physics when I was in school. Chemistry never really appealed to me as much, nor were my friends passionate about it! I basically made a choice between the wet sciences and the dry sciences, in favor of the dry.</p>
<p><em><strong>Q: </strong>How did you become interested in researching this particular field? </em><br />
<strong>A: </strong> When I was thinking about the PhD, we had a professor from Maryland, Azriel Rosenfeld, visit my institute in Bangalore. He was a founder of the field of computer vision. He had come to give a talk, and I thought, this is as good a field as anything I have considered so far. I was also doing my master’s thesis in speech, a reasonably related area. I wrote to him and came to Maryland, and from then on it has been just a tunnel.</p>
<p><em><strong>Q: </strong> What keeps you interested in the field? </em><br />
<strong>A: </strong> The challenge posed by the fact that vision is indeed possible. Without a living proof of it in biology, the feasibility of vision would probably have been a controversial notion. It is a formidable challenge to engineering. It will take several generations of many, many people to get even close to the stage where we may duplicate it. If you don’t pay close attention to our ability to see, it seems like an easy task. However, it’s not only difficult by itself, and by that I mean the sensing of light and converting it into something meaningful. But it’s also difficult because it involves lots of bits of knowledge that come from other modalities of sensing and knowledge acquired in the past. So there is this top-down element that regulates how we interpret the sensory signals that are coming in. It keeps me interested because it keeps me busy trying to solve the next problem.</p>
<p><em><strong>Q: </strong>Tell me about a research accomplishment you’re proud of? </em><br />
<strong>A: </strong> I have had different periods of emphases on different groups of problems. I have worked on the integration of diverse information sources. We have different cues coming in from the three dimensional world, from texture to stereo to motion to color. So the engineering design question that has run through my work is how to analyze them individually and fuse them together into a holistic construct. In the early ’90s, I had sort of an intellectual accident, and we wound up developing a camera called the omnifocus camera. The whole field of new camera designs has since been an active research area for many researchers across the world. The camera we developed was able to produce a 3D panoramic image with every object in focus along with estimates of depth. We have since developed about eight more cameras. One of these cameras acquires a 360 × 170 degree image, which records a whole hemisphere of an image. You can come back to the recorded video and watch any specific part of it at your leisure. You can break it down by sections of the hemisphere and replay an event that may have occurred. One of these cameras is installed in the main branch of the Busey Bank in Urbana and another one is at work in Missouri. More recently, we have found a way of discovering visual themes present in large sets of images that help unify diverse appearances in terms of a simple explanation.</p>
<p><em><strong>Q: </strong>What do you enjoy most about teaching? </em><br />
<strong>A: </strong> Teaching keeps giving you new ideas, in addition to the opportunities to enjoy the moments when you see that the students are learning and understanding, and then using and extending that knowledge. I’ve had roughly 45 PhD students. Often they start with very little, and they falter and run into blind alleys and then suddenly take off. And then they go and do their own things. It’s fun to see them spread their wings and fly. They teach you, too, when you are teaching them. Research comes with teaching, the two are interdependent. Research has a major impact on the contents of the courses and classroom teaching.</p>
<p><em><strong>Q: </strong> What role do students play in your research? </em><br />
<strong>A: </strong> They are the central pillars of my research. I enjoy my meetings with them, discussing issues, raising questions, planning investigations, and of course their coming back and reporting the results. It’s a weekly cycle. We look at what happened last week, what this week’s results showed us, and then plan how to move forward.</p>
<p><em><strong>Q: </strong>What award have you won that is the most meaningful? </em><br />
<strong>A: </strong> I was probably most influenced by the very first national level award I won in the final year of my high school. I actually received two different awards that year, both from the president of India. For one, I got a scholarship for subsequent six years of my education. The other one was for being a President’s Scout&#8211;I think it is called Eagle Scout in the US. I got to shake hands with the president, which is etched in my mind because he was an eminent philosopher besides being the president. The other awards that mean a lot to me are those that come from peers and students because it means that those who are affected by it the most feel that I am doing something worthwhile.</p>
<p><em><strong>Q: </strong> What does the future hold? </em><br />
<strong>A: </strong> Computer vision has come an incredibly long way. The theory and the applications are both flourishing. A lot has been accomplished. And of course, cameras are everywhere now. However, compared to what still needs to be done, we’ve only scratched the surface so far. It’s just an extremely large problem. So we can expect a lot of excitement to come our way in this field for a very long time to come.</p>
<p><em><strong>Q: </strong> What else do you hope to accomplish with your research? </em><br />
<strong>A: </strong> There are so many problems within apparent reach that I hardly know where to begin the list. One such problem is to improve the content of the raw data acquired by the cameras I mentioned before. I would certainly like to push that up, to a level where they don’t miss anything left or right, near or far, dark or light, coarse or detailed. Another major problem I wish to help solve is extraction and precise characterization of natural structure in images and video. Overall, I want to enhance our understanding of the vision process, and apply it to everyday life in ways that themselves need research, and many of the most creative of which, I am sure, are yet to be discovered.</p>
<p>Resource: <a href="http://www.ece.illinois.edu/news/experts/fe-ahuja.html" target="_blank">http://www.ece.illinois.edu/news/experts/fe-ahuja.html</a><em><br />
</em></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=742</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gesture Recognition</title>
		<link>http://vision.ai.uiuc.edu/?p=725</link>
		<comments>http://vision.ai.uiuc.edu/?p=725#comments</comments>
		<pubDate>Thu, 07 May 2009 20:41:10 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[faces and gestures]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=725</guid>
		<description><![CDATA[<table style="height: 142px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-736" title="proj5_2_3_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_3_icon.gif" alt="proj5_2_3_icon" width="160" height="120" /></td>
<td style="text-align: justify;" valign="top"><span style="color: #000000;">GIST (Gesture Interpretation using Spatio-Temporal analysis) project is an attempt to recognize and interpret sign gestures of American Sign Language from a video sequence based on an integrated method of motion segmentation, shape, size and color. A multi-scale motion segmentation&#8230;</span></td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 142px;" border="0" width="774">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-736" title="proj5_2_3_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_3_icon.gif" alt="proj5_2_3_icon" width="160" height="120" /></td>
<td style="text-align: justify;" valign="top"><span style="color: #000000;">GIST (Gesture Interpretation using Spatio-Temporal analysis) project is an attempt to recognize and interpret sign gestures of American Sign Language from a video sequence based on an integrated method of motion segmentation, shape, size and color. A multi-scale motion segmentation based on Ahuja&#8217;s New Transform is applied to a video sequence to get motion regions and their correspondence across frames. Regions of interest, such as fingertip, palm and elbow, are extracted from motion segmented images by formulating and solving a constraint satisfaction problem. From these joints, pixel trajectories are extracted. A spatio-temporal analysis based on time-delay neural network is applied to classify these patterns. The ultimate goal of GIST is to allow content-based video retrieval based on video clips and better understanding of motion segmentation.<br />
</span></td>
</tr>
</tbody>
</table>
<p><span id="more-725"></span></p>
<h3>Publications</h3>
<ol>
<li>E. Altman and N. Ahuja and F. Kishino,<strong> </strong>Hand Trajectory Recognition Using Dynamical Systems, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 321-324.</li>
<li>M.-H. Yang and N. Ahuja, Extracting Gestural Motion Trajectory, 1998 IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, April 1998, 10-15.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.55.4880">Abstract</a></li>
<li>M.-H. Yang and N. Ahuja, Extraction and Classification of Motion Patterns for Hand Gesture Recognition, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Santa Barbara, CA, June 1998, 892-897.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.57.4334">Abstract</a></li>
<li>M.H. Yang and N. Ahuja, Gaussian Mixture Modeling of Human Skin Color and Its Applications in Image and Video Databases, Proc. of the SPIE: Storage and Retrieval for Image and Video Databases VI, Vol. 3656, San Jose, CA, Jan. 1999, 458-466.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.56.8637">Abstract</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=725</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Face Detection</title>
		<link>http://vision.ai.uiuc.edu/?p=700</link>
		<comments>http://vision.ai.uiuc.edu/?p=700#comments</comments>
		<pubDate>Thu, 07 May 2009 20:36:34 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[faces and gestures]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=700</guid>
		<description><![CDATA[<table style="height: 142px;" border="0" width="687">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-707" title="proj5_2_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_1_icon.jpg" alt="proj5_2_1_icon" width="132" height="132" /></td>
<td style="text-align: justify;" valign="top">We present a probabilistic method  to detect human faces using a mixture of factor analyzers.  One characteristic of this mixture model is that it concurrently  performs clustering and, within each cluster,  local dimensionality reduction.   A wide range of face images&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 142px;" border="0" width="687">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-707" title="proj5_2_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_1_icon.jpg" alt="proj5_2_1_icon" width="132" height="132" /></td>
<td style="text-align: justify;" valign="top">We present a probabilistic method  to detect human faces using a mixture of factor analyzers.  One characteristic of this mixture model is that it concurrently  performs clustering and, within each cluster,  local dimensionality reduction.   A wide range of face images that consists of faces in different poses, faces in different expressions and faces under different lighting conditions is used as the training set to capture the variations of human faces.  In order to fit the mixture model to the sample face images,  the parameters are estimated using an EM algorithm. Experimental results show that faces in different poses, with facial expressions, and under different lighting conditions  are detected by our method.</td>
</tr>
</tbody>
</table>
<p><span id="more-700"></span></p>
<h3>Publications</h3>
<ol>
<li>M.-H. Yang, D. Kriegman and N. Ahuja, Detecting Faces in Images: A Survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001.  <a href="http://vision.ai.uiuc.edu/mhyang/papers/pami02a.pdf">Full Text</a></li>
<li> M.-H. Yang, D. Kriegman and N. Ahuja, Face Detection Using Multimodal and Density Modes, Computer Vision and Image Understanding, 2001.  <a href="http://vision.ucsd.edu/kriegman-grp/papers/cviu01.pdf">Full Text</a></li>
<li> M.H. Yang and N. Ahuja, Detecting Human Faces in Color Images, IEEE Conference on Computer Vision and Pattern Recognition, Ft. Collins, CO, June 1999, I-466-472.  <a href="https://eprints.kfupm.edu.sa/34642/1/34642.pdf">Full Text</a></li>
<li>M.-H. Yang, N. Ahuja, D. Kriegman, Face Detection using a       Mixture of Factor Analyzers, International       Conference on Image Processing, Kobe, Japan, Oct. 1999, III-612-616.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=817188">Abstract</a></li>
<li>M.-H.       Yang, N. Ahuja and D. Kriegman, Face Detection Using Mixtures of Linear       Subspaces, Fourth IEEE Int. Conference on Automatic Face and Gesture       Recognition (FG 2000), Grenoble, France, March 2000, 70-76.  <a href="http://vision.ai.uiuc.edu/mhyang/papers/fg2000-paper2.pdf">Full Text</a></li>
<li> D. Roth, M.-H. Yang and N. Ahuja, A SNoW-Based Face Detector, Proc. Advances in Neural Information Processing Systems (NIPS&#8217;99), Denver, CO, December 1999, 862-868.  <a href="http://eprints.kfupm.edu.sa/21084/1/21084.pdf">Full Text</a></li>
<li>M.-H. Yang and N. Ahuja, Detecting Human Faces in Color Images, IEEE International Conference on Image Processing, Vol. 1, Chicago, IL, October 1998, 127-130.</li>
<li>M.-H. Yang, D. Roth and N. Ahuja, Face Detection Using Large Margin Classifiers, Proc. International Conference on Image Processing, Thessaloniki, Greece, October 2001.  <a href="http://eprints.kfupm.edu.sa/21084/1/21084.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=700</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Collision Detection</title>
		<link>http://vision.ai.uiuc.edu/?p=705</link>
		<comments>http://vision.ai.uiuc.edu/?p=705#comments</comments>
		<pubDate>Thu, 07 May 2009 20:35:27 +0000</pubDate>
		<dc:creator>qingxiong</dc:creator>
				<category><![CDATA[Atonomous navigation]]></category>
		<category><![CDATA[Parallel Processing and Robotics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=705</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj8_1_1_icon.jpg" border="2" alt="" width="160" height="123" /></p>
<p> </p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a name="8_1_1"></a></h2>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">N Bridwell, J Chuang, F Kishino, Y Kitamura, H Takemura, R Yen, R Chien</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a set of objects moving in a known fashion and a set of still obstacles, detect or predict collisions between specific pairs of objects.</p>
<p><span id="more-705"></span></p>
<p align="justify">
<h3>Publications</h3>
</p><p align="justify">N. Ahuja, R.&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj8_1_1_icon.jpg" border="2" alt="" width="160" height="123" /></p>
<p><!--mstheme--> <!--mstheme--></p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--><a name="8_1_1"></a><!--mstheme--></h2>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">N Bridwell, J Chuang, F Kishino, Y Kitamura, H Takemura, R Yen, R Chien</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Given a set of objects moving in a known fashion and a set of still obstacles, detect or predict collisions between specific pairs of objects.</p>
<p><span id="more-705"></span></p>
<p align="justify">
<h3>Publications</h3>
<p align="justify">N. Ahuja, R. T. Chien, R. Yen and N. Bridwell, Interference Detection and Collision Avoidance Among Three Dimensional Objects, Proc. 1st National Conf. on Artificial Intelligence, Stanford University, August 19-21, 1980, 44-48. <a href="../../abstracts/pub8_1_1_a0880ahuja.htm">Abstract</a></p>
<p align="justify">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino,       Efficient Interference Detection Among Objects Using Octree and Polyhedral       Shape Representations, First Asian Conference on Computer Vision, November       23-25, 1993, Osaka, Japan, 775-779. <a href="../../abstracts/pub8_1_1_a1193kitamura.htm">Abstract</a></p>
<p align="justify">J.-H. Chuang and N. Ahuja, An Analyticaly Tractable Potential Field Model of Free Space and Its Application in Obstacle Avoidance, IEEE Trans. on Systems, Man, and Cybernetics, Vol. 28, No. 5, October 1998, 729-736. <a href="../../abstracts/pub8_1_1_a1098chuang.htm">Abstract</a></p>
<p>Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Colliding Face Detection among 3-D Objects using Octree and Polyhedral Shape Representation, Journal of the Robotics Society of Japan, Vol. 14, No. 5, 1996, 733-742. <a href="../../abstracts/pub8_1_1_a0096kitamura.htm">Abstract</a></p>
<p align="justify">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, A Study of Interference Detection in Virtual Cooperative Workspace for an Operator Assistant, Workshop on Human Interface, Tokyo, May 24, 1993, vol. 8, no. 2, 247-254. (Part in English, part in Japanese.) Also published as a Technical Report of Technical meeting on Human Communication Engineering of IEICE, HC93-17 (in Japanese). <a href="../../abstracts/pub8_1_1_a0593kitamura.htm">Abstract</a></p>
<p align="justify">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Interference Detection Among Objects for Operator Assistance in Virtual Cooperative Workspace, International Workshop on Robot and Human Communication, Tokyo, Japan, pp. 442-447. IEEE, November 1993.. <a href="../../abstracts/pub8_1_1_a1193kitamura1.htm">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=705</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Efficient Algorithms and Architectures</title>
		<link>http://vision.ai.uiuc.edu/?p=702</link>
		<comments>http://vision.ai.uiuc.edu/?p=702#comments</comments>
		<pubDate>Thu, 07 May 2009 20:34:20 +0000</pubDate>
		<dc:creator>qingxiong</dc:creator>
				<category><![CDATA[Efficient hardware layout of tree structures]]></category>
		<category><![CDATA[Parallel Processing and Robotics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=702</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj7_2_1_icon.jpg" border="2" alt="" width="160" height="129" /></p>
<p> </p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><a name="7_2_1"></a></h2>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">A Choudhary, S Das, C Debrunner, J Patel, M Sharma, S Swamy</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop computationally efficient, e.g., divide-and-conquer or DSP chip oriented, algorithms for different classes of computer vision algorithms, and to define special purpose, e.g., parallel multiprocessor, architectures that efficiently&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj7_2_1_icon.jpg" border="2" alt="" width="160" height="129" /></p>
<p><!--mstheme--> <!--mstheme--></p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--><a name="7_2_1"></a><!--mstheme--></h2>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">A Choudhary, S Das, C Debrunner, J Patel, M Sharma, S Swamy</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">To develop computationally efficient, e.g., divide-and-conquer or DSP chip oriented, algorithms for different classes of computer vision algorithms, and to define special purpose, e.g., parallel multiprocessor, architectures that efficiently execute the algorithms</p>
<h3><span id="more-702"></span>Publications</h3>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">N. Ahuja and S.       Swamy, Multiprocessor Pyramids for       Bottom-up Image Analysis, Proc. IEEE       Conf. on Pattern Recognition and Image Processing, Las Vegas, June       13-17, 1982, 380-385.       <a href="../../abstracts/pub7_2_1_a0682Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">N. Ahuja and S.       Swamy, Interleaved Pyramid Architectures       for Bottom-Up Image Analysis, Proc.       6th Int. Conf. on Pattern Recognition, Munich, Germany, October 19-22,       1982, 388-391.               <a href="../../abstracts/pub7_2_1_a1082Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">
<p class="MsoNormal" style="margin-right: 10px; margin-top: -5px; margin-bottom: -5px;" align="justify">N.       Ahuja, Efficient Planar Embedding of Trees for VLSI       Layouts, IEEE Int. Conf. on Systems,       Man and Cybernetics, Bombay/New Delhi, India, December 30, 1983 &#8211;       January 7, 1984, Proc. 28-32.       <a href="../../abstracts/pub7_2_1_a1283Ahuja.htm">Abstract</a></p>
<p align="justify">N. Ahuja and S. Swamy, Multiprocessor Pyramid Architectures for Bottom-Up Analysis, IEEE Trans. Pattern Analysis and Machine Intelligence, July 1984, 463-475. <a href="../../abstracts/pub7_2_1_a0784Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-bottom: -5px; margin-right: 10px; margin-top: -5px;" align="justify">N.       Ahuja, Efficient Planar Embedding of Trees for VLSI layouts, Proc.       7th Int. Conf. on Pattern Recognition, Montreal, Canada, July 30 &#8211; Aug       2, 1984, 460-464.       <a href="../../abstracts/pub7_2_1_a0884Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-bottom: -5px; margin-right: 10px; margin-top: -5px;" align="justify">
N. Ahuja and S. Swamy, Multiprocessor Pyramid Architectures for Bottom-up Image Analysis,       Test and Measurement World, November 1985, 66-76. <a href="../../abstracts/pub7_2_1_a1185Ahuja.htm">Abstract</a></p>
<p align="justify">M. Sharma, J. H. Patel and N. Ahuja, NETRA: A Multiprocessor Computer Architecture for Image Understanding, IEEE Workshop on Computer Architecture for Pattern Analysis and Image Database Management, Miami, November 18-20, 1985, 92-98. <a href="../../abstracts/pub7_2_1_a1185sharma.htm">Abstract</a></p>
<p>N. Ahuja, Efficient Planar Embedding of Trees for VLSI Layouts, Computer Vision, Graphics and Image Processing, May 1986, 189-203.<a href="../../abstracts/pub7_2_1_a0586Ahuja.htm">Abstract</a></p>
<p>A. N. Choudhary, S. Das, N. Ahuja, and J. H. Patel, A Reconfigurable and Hierarchical Parallel Processing Architecture: Performance Results for Stereo Vision, 10th International Conference on Pattern Recognition &#8211; Computer Vision, Vol. 2, Atlantic City, June 1990, 389-393. <a href="../../abstracts/pub7_2_1_a0690Choudhary.htm">Abstract</a></p>
<p>A. N. Choudhary, J. H. Patel, and N. Ahuja, NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems, IEEE Trans. Parallel and Distributed Systems, Vol. 4, No. 10, October 1993, 1092-1104. <a href="../../abstracts/pub7_2_1_a1093Choudhary.htm">Abstract</a></p>
<p align="justify">C. H. Debrunner and N. Ahuja, A Bottom-up Minimum Spanning Tree Algorithm for Multiprocessor Pyramid Architectures, Proc. Workshop on Algorithm-Guided Parallel Architectures for Automatic Target Recognition, Leesburg, VA, July 16-18, 1984, 51-78.<a href="../../abstracts/pub7_2_1_a0784Debrunner.htm">Abstract</a></p>
<p align="justify">A. N. Choudhary, S. Das, N. Ahuja and J. Patel, Surface Reconstruction from Stereo Images: An Implementation on a Hypercube Multiprocessor, Fourth Conference on Hypercube Concurrent Computers and Applications, Monterey, CA, March 1989, 1045-1052.<a href="../../abstracts/pub7_2_1_a0389Choudhary.htm">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=702</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DSP Algorithms</title>
		<link>http://vision.ai.uiuc.edu/?p=687</link>
		<comments>http://vision.ai.uiuc.edu/?p=687#comments</comments>
		<pubDate>Thu, 07 May 2009 20:28:18 +0000</pubDate>
		<dc:creator>qingxiong</dc:creator>
				<category><![CDATA[Efficient hardware layout of tree structures]]></category>
		<category><![CDATA[Parallel Processing and Robotics]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=687</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj7_1_1_icon.jpg" border="2" alt="" width="160" height="132" /></p>
<p> </p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><span style="color: #f7be84;"></span></h2>
<p style="margin: 5px 0pt; text-indent: 30px;" align="justify">M Aggarwal</p>
<p style="margin: 5px 0pt; text-indent: 30px;" align="justify">To improve the Throughput of Flexible-Precision DSPs via Algorithm Transformation</p>
<p><span id="more-687"></span></p>
<h3>Publications</h3>
<p>M. Aggarwal, N. Shanbhag and N. Ahuja, Improving the Throughput of Flexible-Precision DSPs via Algorithm Transformation, International Conference on Acoustics, Speech, and Signal Processing, Vol. V, Seattle, WA, May 12-15,&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj7_1_1_icon.jpg" border="2" alt="" width="160" height="132" /></p>
<p><!--mstheme--> <!--mstheme--></p>
<h2 style="text-indent: 4px; word-spacing: 0pt; margin-top: 0pt; margin-bottom: 0pt;"><!--mstheme--><span style="color: #f7be84;"><!--mstheme--></span></h2>
<p style="margin: 5px 0pt; text-indent: 30px;" align="justify">M Aggarwal</p>
<p style="margin: 5px 0pt; text-indent: 30px;" align="justify">To improve the Throughput of Flexible-Precision DSPs via Algorithm Transformation</p>
<p><span id="more-687"></span></p>
<h3>Publications</h3>
<p>M. Aggarwal, N. Shanbhag and N. Ahuja, Improving the Throughput of Flexible-Precision DSPs via Algorithm Transformation, International Conference on Acoustics, Speech, and Signal Processing, Vol. V, Seattle, WA, May 12-15, 1998, 3069-3072. <a href="../../abstracts/pub7_1_1_a0598Manoj.htm">Abstract</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=687</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning to Recognize 3D Objects</title>
		<link>http://vision.ai.uiuc.edu/?p=644</link>
		<comments>http://vision.ai.uiuc.edu/?p=644#comments</comments>
		<pubDate>Thu, 07 May 2009 20:11:33 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[learning for object recognition]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=644</guid>
		<description><![CDATA[<table style="height: 275px;" border="0" width="816">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-670" title="mismatch1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/mismatch1.gif" alt="mismatch1" width="265" height="280" /></td>
<td style="text-align: justify;" valign="top">A learning account for the problem of object recognition is developed within the PAC (Probably Approximately Correct) model of learnability. The key assumption underlying this work is that objects can be recognized (or, discriminated) using simple representations in terms of&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 275px;" border="0" width="816">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-670" title="mismatch1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/mismatch1.gif" alt="mismatch1" width="265" height="280" /></td>
<td style="text-align: justify;" valign="top">A learning account for the problem of object recognition is developed within the PAC (Probably Approximately Correct) model of learnability. The key assumption underlying this work is that objects can be recognized (or, discriminated) using simple representations in terms of  &#8220;syntactically&#8221; simple relations over the raw image. Although the potential number of these simple relations could be huge, only a few of them are actually present in each observed image and a fairly small number of those observed is relevant to discriminating an object. We show that these properties can be exploited to yield an efficient learning approach in terms of sample and computational complexity, within the PAC model. No assumptions are needed on the distribution of the observed objects and the learning performance is quantified relative to its past experience. Most importantly, the success of learning an object representation is naturally tied to the ability to represent it as a function of some intermediate representations extracted from the image.We evaluate this approach in a large scale experimental study in which the SNoW learning architecture is used to learn representations for the 100 objects in the Columbia Object Image Database (COIL-100). Experimental results exhibit very good generalization and robustness properties of the SNoW-based method  elative to other approaches. SNoW&#8217;s recognition rate degrades more gracefully when the training data contains fewer views and it shows similar behavior also in some preliminary experiments with partially occluded objects.</td>
</tr>
</tbody>
</table>
<p><span id="more-644"></span></p>
<ol style="text-align: justify;">
<h2 style="text-align: justify;">1. Approach</h2>
<p style="text-align: justify;">Statistical learning theory has had an influence on many applications ranging from classification and object recognition, grouping and segmentation, illumination modeling, scene reconstruction and others. The rising role of learning methods, made possible by significant improvements in computing power and storage, is largely motivated by the realization that explicit modeling of complex phenomena in a messy world cannot be done without a significant role of learning. Learning is required for model and knowledge acquisition as well  as to support generalization and avoid brittleness. However, many statistical and probabilistic learning models require making explicit assumptions, e.g., on the distribution that governs the occurrences of instances in the world. For visual inference problems such as recognition, categorization and detection, making these assumptions seems unrealistic.This work develops a distribution free learning theory account to an archetypical visual recognition problem: object recognition. The problem is viewed as that of learning a representation of an object that, given a new image, is used to identify the target object in it. The learning account is developed within the PAC (Probably Approximately Correct) model of learnability. This framework allows us to (1) quantify success relative to the experience with previously observed objects, without making assumptions on the distribution, and (2) study the theoretical limits of what can be learned from images in terms of the expressivity of the intermediate representation used by the learning process. That is, learnability guarantees that objects sampled from the same distribution as the one that governs the experience of the learner are very likely to be recognized correctly. In addition, the framework gives guideline to developing practical algorithmic solutions to the problem.</p>
<p style="text-align: justify;">Earlier works have discussed the possibility of identifying the theoretical limits of what can be learned from images and found that learning in terms of the raw representation of the images is computationally intractable. Attempts to explain this focused the dependence of learnability on the representation of the object but failed to provide a satisfactory explanation for it, or a practical solution.</p>
<p style="text-align: justify;">The approach developed here builds on suggestions made in and relies heavily on the development of a feature efficient learning approach. This is a learning process capable of learning quickly and efficiently in domains in which the number of potential features is a very large but any concept of interest actually depends on a fairly small number of them. At the heart of the learning approach are two assumptions that we abstract as follows:</p>
<h3 style="text-align: justify;">Representational:</h3>
<p style="text-align: justify;">There exists a (possibly infinite) collection <em>M</em> of &#8220;explanations&#8221; such that an object <em>o</em> can be represented as a simple function of elements in <em>M</em>.</p>
<h3 style="text-align: justify;">Procedural:</h3>
<p style="text-align: justify;">There exists a process that, given an image in which the target object <em>o </em>occurs, generates efficiently &#8220;explanations&#8221; in <em>M</em> that are present in the image and  such that, with high probability, at least one of them is in the representation of<em> o</em>.<br />
Under these assumptions we prove that there exists an efficient algorithm that, given a collection of images labeled as positive or negative examples of the target object, can learn a good representation of the object. That is, it can learn a representation that, with high probability, would make correct predictions on future images that contain (or do not contain) the object. Furthermore, we show that under these conditions, the learned representations are robust under realistic noise conditions.  A significant non-assumption of our approach is that it has no prior knowledge on the distribution of images nor it is trying to estimate it. The learned representation is guaranteed to perform well when tested on images sampled from the distribution that governed the data observed in its learning experience.</p>
<p style="text-align: justify;">The framework developed here is very general.  The <em>explanations </em>alluded to above can represent a variety of computational processes and information sources that  operate on the image. They can depend on local properties of the image, the relative positions of primitives in the image, and even external information sources or context variables. Thus, the theoretical support given here applies also to an intermediate learning stage in a hierarchical process. In order to generate the explanations efficiently, this work assumes that they are syntactically simple in terms of the raw image. However, the explanation might as well be syntactically simple in terms of previously learned or inferred predicates, giving rise to hierarchical representation.</p>
<h2 style="text-align: justify;">2. Results</h2>
<p style="text-align: justify;">We use the Columbia Object Image Library (COIL-100) database in all the experiments below (COIL is available at http://www.cs.columbia.edu/CAVE). The COIL-100 dataset consists of color images of 100 objects where the images of the objects that were taken at pose intervals of 5 degrees, i.e., 72 poses per object. The images were also normalized such that the larger of the two object dimensions (height and width) fits the image size off 128 x 128 pixels.  Figure 1 shows the images of the 100 objects taken in frontal view, i.e., zero pose angle. The 32 highlighted objects in 1 are considered more difficult to recognize by other researchers; we use all 100 objects including these in our experiments. Each color image is converted to a gray-scale image of 32 x 32 pixels for 128 x 128 pixels.  Figure 1 shows the images of the 100 objects taken in frontal view, i.e., zero pose angle. The 32 highlighted objects in Figure 1 are considered more difficult to recognize by other researchers; we use all 100 objects including these in our experiments. Each color image is converted to a gray-scale image of 32 x 32 pixels for our experiments.</p>
<p style="text-align: justify;"><img src="../../mhyang/images/select.gif" alt="" width="720" height="180" /></p>
<p style="text-align: justify;">Figure 1. Columbia Object Image Library (COIL-100) consists of 100 objects of varying poses (5 degrees apart). The objects are shown in row order where the highlighted ones are those considered more difficult to recognize by other researchers.</p>
<h3 style="text-align: justify;">2.1 Ground Truth of the COIL-100 Dataset</h3>
<p style="text-align: justify;">At first glance, it seems difficult to recognize the objects in the COIL dataset because it consists of a large number of objects with varying pose, texture, shape and  size. Since each object has 72 images of different poses (5 degrees apart), many view-based recognition methods use 36 (10 degrees apart) of them for training and the remaining images for testing. However, it turns out that under these dense sampling conditions the recognition problem is not difficult (even when only grey-level images are used). Namely, in this case, instances that belong to the same object are very close to each other in the image space (where each data point represents an image of an object in a certain pose). We verified this by experimenting with a simple nearest neighbor classifier (using the Euclidean distance), resulting in an average recognition rate of 98.50% (54 errors out of 3,600 tests).  Figure 2 shows some of the objects misclassified by nearest neighbor method.In principle, one may want to avoid using the nearest neighbor method since it requires a lot of memory for storing templates and its recognition time complexity is high. The goal here was simply to show that this simple method is comparable to the complex SVM approaches for the case of dense sampling. Therefore, the abovementioned recognition problem is not appropriat for comparison among different methods.</p>
<p style="text-align: justify;"><img src="../../mhyang/images/mismatch.gif" alt="" width="528" height="134" /></p>
<p style="text-align: justify;">Figure 2. Mismatched objects using the nearest neighbor method. (<em>x</em>:<em>a</em>,y:<em>b</em>) means that object <em>x</em> with view angle <em>a</em> is recognized as object <em>y</em> with view angle <em>b.</em> It shows some of the 54 errors (out of 3,600 test samples) made by the nearest neighbor classifier when there are 36 views per object in the training set. (See paper for datils).</p>
<p style="text-align: justify;">Table 1. Recognition rates of nearest neighbor classifier</p>
<table style="text-align: justify;" border="1" width="80%">
<tbody>
<tr>
<td></td>
<td>30 objects randomly selected<br />
from COIL dataset</td>
<td>32 objects shown seletected by Pointil and Verri</td>
<td>The whole 100 objects in COIL<br />
dataset</td>
</tr>
<tr>
<td>Errors/Tests</td>
<td>14/1080</td>
<td>46/1152</td>
<td>54/3600</td>
</tr>
<tr>
<td>Recognition Rate</td>
<td>98.70%</td>
<td>96.00%</td>
<td>98.50%</td>
</tr>
</tbody>
</table>
<p style="text-align: justify;">It is interesting to see that the pairs of the objects on which the nearest neighbor method misclassified have similar geometric configurations and similar poses.  A close inspection shows that most of the recognition errors are made between the three packs of chewing gums, bottles and cars. Other dense sampling cases are easier for this method. Consequently, the set of selected objects in an experiment has direct effects on the recognition rate.  This needs to be taken into account when evaluating results that use only a subset of the 100 objects (typically 20 to 30) from the COIL dataset for experiments. Table 1 shows the recognition rates of nearest neighbor classifiers in several experiments in which 36 poses of each object are used for templates and the remaining 36 poses are used for tests.</p>
<p style="text-align: justify;">Given this baseline experiment we have decided to perform our experimental comparisons in cases in which the number of views of objects available in training is limited. Some of our preliminary results were presented in companion papers.</p>
<h3 style="text-align: justify;">2.2 Experiment Setups</h3>
<p style="text-align: justify;">Applying SNoW to 3D object recognition requires specifying the architecture used and the representation chosen for the input images. As described above, to perform object recognition we associate a target unit with each target object. This target learns a definition of the object in terms of the input features extracted from the image. We could either define a single SNoW unit which contains target subnetworks for all the 100 different target objects, or we may define different units, each with  several  (e.g., two) competing target objects. Statistically, this approach is advantageous although, clearly, it requires a  lot more computation. The architecture selected affects both the training time, where learning a definition for object <em>a </em>makes use of negative examples of other objects that are part of the same unit but, more importantly, it makes a difference in testing; rather that two competing objects for a decision, there may be a hundred. The chances for a spurious mistake caused by an incidental view point are clearly much higher. On the other hand, it has significant advantages in terms of space complexity and the appeal of the evaluation mode.</p>
<p style="text-align: justify;">SVMs are two-class classifiers which, for an <em>c</em>-class pattern recognition problem, need to train c*(c-1)/2 binary classifiers. Since we compare the performance of the proposed SNoW-based method with SVMs, in order to maintain a fair comparison we have to perform it in the {\em one-against-one} scheme. That is, we use SNoW units of size two. To classify a test instance, tournament-like pair-wise competition between all the machines is performed and the winner determines<br />
the label of the test instance.  The recognition rates of the SVM and SNoW based methods shown in Table 2 were performed using the one-against-one scheme. That is, we trained 4,950 classifiers for each method and evaluated 99 (=50+25+12+6+3+2+1) classifiers on each test instance.</p>
<h3 style="text-align: justify;">2.3 Results Using Pixel-Based Representation</h3>
<p style="text-align: justify;">Table 2. shows the recognition rates of the SNoW-based method, the SVM-based method (using linear dot product for the kernel function), and the nearest neighbor classifier using the COIL-100 dataset. The important parameter here is that we vary the number of views of an object (<em>v</em>) during training and use the rest of the views (72-<em>v)</em> of an object for testing.Table 2. Experimental results of three classifiers using the 100 objects in the COIL-100 dataset</p>
<table style="text-align: justify;" border="1" width="80%">
<tbody>
<tr>
<td></td>
<td colspan="4" align="center"># of View/Test</td>
</tr>
<tr>
<td></td>
<td>36/3600 tests</td>
<td>18/5400 tests</td>
<td>8/6400 tests</td>
<td>4/6800 tests</td>
</tr>
<tr>
<td>SNoW</td>
<td>95.81%</td>
<td>92.31%</td>
<td>85/13%</td>
<td>81.46%</td>
</tr>
<tr>
<td>Linear SVM</td>
<td>96.03%</td>
<td>91.30%</td>
<td>84.80%</td>
<td>78.50%</td>
</tr>
<tr>
<td>Nearest Neighbor</td>
<td>98.50%</td>
<td>87.54%</td>
<td>79.52%</td>
<td>74.63%</td>
</tr>
</tbody>
</table>
<p style="text-align: justify;">The experimental results show that the SNoW-based method performs as well as the SVM-based method when many views of the objects are present during training and outperforms SVM-based method when the numbers of views is limited. Although it is not surprising to see that the recognition rate decreases as the number of views available during training decreases, it is worth noticing that both SNoW and SVM are capable of recognizing 3D objects in the COIL-100 dataset with satisfactory performance if enough views (e.g., &gt;18) are provided. Also they seems to be fairly robust even if only a limited number of views (e.g., 8 and<br />
4) are used for training; the performance of both methods degrades gracefully.</p>
<p style="text-align: justify;">To provide some more insight into these methods, we note that in the SVM-based methods, only 27.78% (20 out of 72) of the input vectors serves as support vectors. For SNoW, out of 262,144 potential features in the pixel-based representation, only 13,805 were active in the dense case (i.e., 36 views). This shows the advantage gained from using the sparse  rchitecture. However, only a small number of those may be relevant to the representation of each target, as a more careful look as the SNoW output hypothesis reveals.</p>
<p style="text-align: justify;">An additional potential advantage of the SNoW architecture is that it does not learn discriminators, but rather can learn a representation for each object, which can then be used for prediction in the one-against-all scheme or to build hierarchical representations. However, as is shown in Table~\ref{chap6-table4}, this implies a significant degradation is the performance.  Finding a way to make better predictions in the one-against-all scheme is one of the important issues for future investigation, to better exploit the advantages of this approach.</p>
<p style="text-align: justify;">Table 3. Recognition rates of SNoW using two learning paradigms</p>
<table style="text-align: justify;" border="1" width="80%">
<tbody>
<tr>
<td></td>
<td colspan="4" align="center"># of View</td>
</tr>
<tr>
<td>SNoW</td>
<td>36</td>
<td>18</td>
<td>8</td>
<td>4</td>
</tr>
<tr>
<td>One-against-one</td>
<td>95.81%</td>
<td>92.31%</td>
<td>85.13%</td>
<td>81.46%</td>
</tr>
<tr>
<td>Linear SVM</td>
<td>90.52%</td>
<td>85.50%</td>
<td>81.85%</td>
<td>76.00%</td>
</tr>
</tbody>
</table>
<h3 style="text-align: justify;">2.4  Results Using Edge-Based Representation</h3>
<p style="text-align: justify;">For each 32 x 32 edge map, we extract horizontal and vertical edges (of length at least 3 pixels) and then encode as our features conjunctions of two of these edges. The number of potential features of this sort is 2,096,128. However, only an average of 1,822 of these is active for objects in the COIL-100 dataset.  To reduce the computational cost the feature vectors were further pruned and only the 512 most frequently occurring features were retained in each image.Table 3 shows the performance of the SNoW-based method when conjunctions of edges are used to represent objects.  As before, we vary the number of views of an object <em>n </em>during training and use the rest of the views<em> 72-n</em> of an object for testing.  The results indicate that conjunctions of edges provide useful information for object recognition and that SNoW is able to learn very good object representations using these features.  The experimental results also<br />
exhibit the relative advantage of this representation increases when the number of views per object is limited.</p>
<p style="text-align: justify;">Table 4. Experimental results of four classifiers using the 100 objects in the COIL-100 dataset</p>
<table style="text-align: justify;" border="1" width="80%">
<tbody>
<tr>
<td></td>
<td colspan="4" align="center"># of View/Test</td>
</tr>
<tr>
<td></td>
<td>36/3600 tests</td>
<td>18/5400 tests</td>
<td>8/6400 tests</td>
<td>4/6800 tests</td>
</tr>
<tr>
<td>SNoW w/ conjunction of edges</td>
<td>96.25%</td>
<td>94.13%</td>
<td>89.23%</td>
<td>88.28%</td>
</tr>
<tr>
<td>SNoW w/ intensity values</td>
<td>95.81%</td>
<td>92.31%</td>
<td>85.13%</td>
<td>81.46%</td>
</tr>
<tr>
<td>Linear Support Vector Machine</td>
<td>96.03%</td>
<td>91.30%</td>
<td>84.80%</td>
<td>78.50%</td>
</tr>
<tr>
<td>Nearest Neighbor</td>
<td>98.50%</td>
<td>87.54%</td>
<td>79.52%</td>
<td>74.63%</td>
</tr>
</tbody>
</table>
<h2 style="text-align: justify;">3. Conclusion</h2>
<p style="text-align: justify;">The main contribution of this work we view in proposing a learning framework for visual learning and exhibiting its feasibility. In this approach learnability can be rigorously studied without making assumptions on the distribution of the observed objects but, via the PAC model, the learned hypothesis&#8217; performance naturally depends on its prior experience. An important feature of the approach is that learning is not studied directly in terms of the raw data but rather with respect to intermediate representations extracted from it and can thus be quantified in terms of the ability to generate expressive intermediate representations. In particular, it makes explicit the requirements from these representations to allow learnability. We believe that research in vision should concentrate on the study of these intermediate representations.We evaluated the approach and demonstrated its feasibility in a large scale experiment in the context of learning for object recognition. Our experiments allowed us also to perform a fair comparison between two successful and related  learning methods and study them in the context of object recognition. We have illustrated our approach in a large scale experimental study in which we use the SNoW learning architecture to learn representations for the 100 objects in COIL-100.  Although it is<br />
clear that object recognition in isolation is not the ultimate goal, this study shows the potential of this computational approach as a basis for studying and supporting more realistic visual inferences.</p>
<p style="text-align: justify;">We note that for a fair comparison among different methods, we have used pixel-based presentation in the experiments. It is clear, however, that the edge-based representation used is even more effective and robust and should be the starting point for future research. There is no question that the RGFs used in this<br />
work are not general enough to support more challenging recognition problems; the intention was merely to exhibit the general approach. We believe that pursing the direction of using complex intermediate representations will benefit future work on recognition and, in particular, robust recognition under various types of noise.</ol>
<h3>Publications</h3>
<ol style="text-align: justify;">
<li> Dan Roth, Ming-Hsuan Yang and Narendra Ahuja, Learning to Recognize Objects, to appear in Neural Computation, 2001.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.1689">Abstract</a></li>
<li> Ming-Hsuan Yang, Dan Roth and Narendra Ahuja, Learning to Recognize 3D Objects With SNoW<strong>,</strong> podium presentation, in Proceedings of the Sixth European Conference on Computer Vision (ECCV 2000) , pp. 439-454, vol. 1, Dublin, June, 2000.  <a href="http://l2r.cs.uiuc.edu/~danr/Papers/YangRoAh00a.pdf">Full Text</a></li>
<li> Dan Roth, Ming-Hsuan Yang and Narendra Ahuja, Learning to Recognize Objects, in Proceedings of the 2000 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2000), pp. 724-731, vol. 1, Hilton Head, June, 2000.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.1689">Abstract</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=644</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Out-of-Core Tensor Approximation of Multidimensional Matrices of Visual Data</title>
		<link>http://vision.ai.uiuc.edu/?p=633</link>
		<comments>http://vision.ai.uiuc.edu/?p=633#comments</comments>
		<pubDate>Thu, 07 May 2009 20:07:23 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Out-of-Core Tensor Approximation of Multi-Dimensional Matrices of Visual Data]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Synthesis, Rendering and Visualization]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=633</guid>
		<description><![CDATA[<p><img class="alignleft" style="border: 0pt none;" src="../../project_new/flower.gif" border="0" alt="" width="145" height="110" />An algorithm for memory (core) efficient tensor approximation that obtains a compact representation of multidimensional visual data for efficient image-based rendering. The algorithm manages with a small memory size. We apply it to 6D Bidirectional Texture Functions (BTFs), 7D Dynamic&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" style="border: 0pt none;" src="../../project_new/flower.gif" border="0" alt="" width="145" height="110" />An algorithm for memory (core) efficient tensor approximation that obtains a compact representation of multidimensional visual data for efficient image-based rendering. The algorithm manages with a small memory size. We apply it to 6D Bidirectional Texture Functions (BTFs), 7D Dynamic BTFs and 4D temporal volume sequences.</p>
<p><span id="more-633"></span></p>
<ol>
<li><span style="font-family: Wingdings-Regular;"> Hongcheng Wang, Qing Wu, Lin Shi, Yizhou Yu and Narendra Ahuja, Out-of-Core Tensor Approximation of Multi-Dimensional Matrices of Visual Data,  in ACM SIGGRAPH 2005</span><span style="font-family: Wingdings-Regular; font-size: x-small;">.</span></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=633</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Face Recognition</title>
		<link>http://vision.ai.uiuc.edu/?p=634</link>
		<comments>http://vision.ai.uiuc.edu/?p=634#comments</comments>
		<pubDate>Thu, 07 May 2009 20:06:44 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[faces and gestures]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=634</guid>
		<description><![CDATA[<table style="height: 133px;" border="0" width="835">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-688" title="proj5_2_2_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_2_icon.gif" alt="proj5_2_2_icon" width="137" height="122" /></td>
<td valign="top">To develop methods to tell the identity of a person from a frontal image and evaluate its performance with state-of-the-art methods</td>
</tr>
</tbody>
</table>
<p><span id="more-634"></span></p>
<h3>Publications</h3>
<ol style="text-align: justify;">
<li>Y. Kitamura, J. Ohya, N. Ahuja and F. Kishino, Computational Taxonomy and Recognition of Facial Expressions, First Asian Conference on&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<table style="height: 133px;" border="0" width="835">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-688" title="proj5_2_2_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_2_2_icon.gif" alt="proj5_2_2_icon" width="137" height="122" /></td>
<td valign="top">To develop methods to tell the identity of a person from a frontal image and evaluate its performance with state-of-the-art methods</td>
</tr>
</tbody>
</table>
<p><span id="more-634"></span></p>
<h3>Publications</h3>
<ol style="text-align: justify;">
<li>Y. Kitamura, J. Ohya, N. Ahuja and F. Kishino, Computational Taxonomy and Recognition of Facial Expressions, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 434-437.</li>
<li>J. Ma, N. Ahuja, C. Neti and A. Senior, Recovering Frontal-Pose Image from a Single Profile Image<strong>,</strong> IEEE International Conference on Image Processing, Vol. 2, Vancouver, BC, Canada, September 2000, 243-247.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=899288">Abstract</a></li>
<li>M.-H. Yang, N. Ahuja and D. Kriegman, Face Recognition Using Kernel Eigenfaces, IEEE Int. Conference on Image Processing, Vol. 1, Vancouver, BC, Canada, Sept. 2000, 37-41.  <a href="http://eprints.kfupm.edu.sa/40383/1/40383.pdf">Full Text</a></li>
<li>M.H. Yang and N. Ahuja, A Geometric Approach to Train Support Vector Machines, IEEE International Conference on Computer Vision and Pattern Recognition, Vol. I, Hilton Head, SC, June 2000, 430-437.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=855851">Abstract</a></li>
<li>M.H. Yang and N. Ahuja, Face Detection and Hand Gesture Recognition for Vision-Based Human Computer Interaction<strong>,</strong> Kluwer Academic Publishers, 2001.  <a href="http://www.springer.com/computer/computer+imaging/book/978-0-7923-7409-1">Abstract</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=634</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Predictive Multiple Description Coding using Wyner-Ziv Codes</title>
		<link>http://vision.ai.uiuc.edu/?p=619</link>
		<comments>http://vision.ai.uiuc.edu/?p=619#comments</comments>
		<pubDate>Thu, 07 May 2009 20:01:47 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Image Processing and Communications]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Video Compression using Wyner-Ziv Codes]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=619</guid>
		<description><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/wz_codes.jpg" border="2" alt="" width="160" height="103" /></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Two-channel predictive multiple description coding is posed as a variant of the Wyner-Ziv coding problem. Practical code constructions are proposed within this framework, and the performance of the proposed codes is compared with conventional approaches, for communication of a first-order&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/wz_codes.jpg" border="2" alt="" width="160" height="103" /></span></p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">Two-channel predictive multiple description coding is posed as a variant of the Wyner-Ziv coding problem. Practical code constructions are proposed within this framework, and the performance of the proposed codes is compared with conventional approaches, for communication of a first-order Gauss-Markov source over erasure channels with independent failure probabilities.</p>
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify">
<p style="margin: 5px 30px; text-indent: 30px;" align="justify"><span id="more-619"></span></p>
<ol>
<li>A. Jagmohan, A. Sehgal, N. Ahuja, &#8220;WYZE-PMD based Multiple Description Video Codec,&#8221; Proc. IEEE Int. Conf. Multimedia Expo, 2003, pp. I-569-572 <a href="../../publications/sicme.pdf">Full Text</a></li>
<li>A. Jagmohan, N. Ahuja, &#8220;Wyner-Ziv Encoded Predictive Multiple Descriptions,&#8221; Proc. Data Compression Conference, p. 213 2003. <a href="../../publications/jdcc.pdf">Full Text</a></li>
<li>A. Jagmohan, A. Sehgal, N. Ahuja, &#8220;Predictive Encoding using Coset Codes, &#8221; Invited Paper, Proc. IEEE Int. Conf. Image Processing, pp. I-29-32, 2002. <a href="../../publications/sicip2.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=619</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning for Object Recognition</title>
		<link>http://vision.ai.uiuc.edu/?p=607</link>
		<comments>http://vision.ai.uiuc.edu/?p=607#comments</comments>
		<pubDate>Thu, 07 May 2009 19:55:56 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[learning for object recognition]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=607</guid>
		<description><![CDATA[<table style="height: 86px;" border="0" width="762">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-614" title="proj5_1_2_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_1_2_icon.gif" alt="proj5_1_2_icon" width="103" height="103" /></td>
<td style="text-align: justify;" valign="top">A learning algorithm accounting for the problem of object recognition is developed within the PAC (Probably Approximately Correct) model of learnability. We evaluate this apporach using the COIL-100 database and exhibit its advantages over conventional methods.</td>
</tr>
</tbody>
</table>
<p><span id="more-607"></span></p>
<h3>Publications</h3>
<ol>
<li>M.H. Yang, N. Ahuja and&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<table style="height: 86px;" border="0" width="762">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-614" title="proj5_1_2_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_1_2_icon.gif" alt="proj5_1_2_icon" width="103" height="103" /></td>
<td style="text-align: justify;" valign="top">A learning algorithm accounting for the problem of object recognition is developed within the PAC (Probably Approximately Correct) model of learnability. We evaluate this apporach using the COIL-100 database and exhibit its advantages over conventional methods.</td>
</tr>
</tbody>
</table>
<p><span id="more-607"></span></p>
<h3>Publications</h3>
<ol>
<li>M.H. Yang, N. Ahuja and D. Roth, View-Based 3D Object Recognition Using SNoW, Proc. of the Fourth Asian Conference on Computer Vision, Vol. 2, Taipei, January 2000, 830-835.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.41.8348">Abstract</a></li>
<li>D. Roth, M.-H. Yang and N. Ahuja, Learning to Recognize Objects, IEEE International Conference on Computer Vision and Pattern Recognition, Vol. I, Hilton Head, SC, June 2000, 724-731.</li>
<li>J. Weng, N. Ahuja and T. Huang,<strong> </strong>Learning, Recognition and Segmentation of 3-D Objects from 2-D Images, 4th Int. Conf. on Computer Vision, Berlin, Germany, May 11-14, 1993, 121-128.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=607</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning of Low-level Spatiotemporal Structural Patterns</title>
		<link>http://vision.ai.uiuc.edu/?p=572</link>
		<comments>http://vision.ai.uiuc.edu/?p=572#comments</comments>
		<pubDate>Thu, 07 May 2009 18:11:32 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[learning for object recognition]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=572</guid>
		<description><![CDATA[<table style="height: 145px;" border="0" width="780">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-599" title="proj5_1_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_1_1_icon.jpg" alt="proj5_1_1_icon" width="160" height="135" /></td>
<td style="text-align: justify;" valign="top">Given an image or a video sequence, a prespecified set of low level, spatial and/or temporal descriptors of the image/video structure, and a higher level interpretation of the structure, use computational learning methods to derive a succinct relationship between the&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 145px;" border="0" width="780">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-599" title="proj5_1_1_icon" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/proj5_1_1_icon.jpg" alt="proj5_1_1_icon" width="160" height="135" /></td>
<td style="text-align: justify;" valign="top">Given an image or a video sequence, a prespecified set of low level, spatial and/or temporal descriptors of the image/video structure, and a higher level interpretation of the structure, use computational learning methods to derive a succinct relationship between the interpretation and the low level structural description.</td>
</tr>
</tbody>
</table>
<p><span id="more-572"></span></p>
<h3>Publications</h3>
<ol style="text-align: justify;">
<li>J. Weng, N. Ahuja and T. S. Huang, Learning Recognition and Segmentation Using the Cresceptron, International Journal of Computer Vision, 25(2), 1997, 109-143.  <a href="http://www.springerlink.com/content/x55211517142235u/">Abstract</a></li>
<li>N. Srinivasa and N. Ahuja, A Topological and Temporal Correlator Network for Spatiotemporal Pattern Learning, Recognition and Recall, IEEE Transactions on Neural Networks, Vol. 10, No. 2, March 1999, 356-371.  <a href="http://people.cecs.ucf.edu/georgiopoulos/mlrg/MLRG_documents/madan-spatiotemporal.pdf">Full Text</a></li>
<li>J. Weng, N. Ahuja and T. S. Huang, Cresceptron: A Self-Organizing Neural Network Which Grows Adaptively, Proc. Int. Joint Conf. on Neural Networks, Vol. 1, Baltimore, Maryland, June 1992, 576-581.  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=287150">Abstract</a></li>
<li>B. Perrin, N. Ahuja and N. Srinivasa,<strong> </strong>Learning Multiscale Image Models of 2D Object Classes, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 323-331.  <a href="http://vision.ai.uiuc.edu/newpubs/perrinAhujaObjRec.pdf">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=572</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Image Ensembles/ Video analysis Using Image-As-Matrix Representation</title>
		<link>http://vision.ai.uiuc.edu/?p=541</link>
		<comments>http://vision.ai.uiuc.edu/?p=541#comments</comments>
		<pubDate>Thu, 07 May 2009 17:29:15 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[tensor]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=541</guid>
		<description><![CDATA[<table style="height: 150px;" border="0" width="764">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-555" title="rank12" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/rank12.jpg" alt="rank12" width="173" height="140" /></td>
<td valign="top">The goal of this project is to explore new algorithms based on multilinear algebra for representation of multidimensional data in computer vision.</td>
</tr>
</tbody>
</table>
<p><span id="more-541"></span></p>
<h3>Publications</h3>
<ol>
<li>Hongcheng Wang and Narendra Ahuja, Rank-R Approximation of Tensors Using Image-as-Matrix Representation, IEEE International Conference on Computer Vision and&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<table style="height: 150px;" border="0" width="764">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-555" title="rank12" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/rank12.jpg" alt="rank12" width="173" height="140" /></td>
<td valign="top">The goal of this project is to explore new algorithms based on multilinear algebra for representation of multidimensional data in computer vision.</td>
</tr>
</tbody>
</table>
<p><span id="more-541"></span></p>
<h3>Publications</h3>
<ol>
<li>Hongcheng Wang and Narendra Ahuja, Rank-R Approximation of Tensors Using Image-as-Matrix Representation, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2005.  <a href="http://vision.ai.uiuc.edu/publications/10.1.1.60.7704.pdf">Full Text</a></li>
<li>Hongcheng Wang and Narendra Ahuja,<strong> </strong>Compact Representation of Multidimensional Data Using Tensor Rank-One Decomposition, Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 1, pp. 44-47 08 23 &#8211; 08, 2004, Cambridge UK. <a href="http://vision.ai.uiuc.edu/~wanghc/papers/icpr04_tensor.pdf" target="_self">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=541</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Facial Expression Decomposition</title>
		<link>http://vision.ai.uiuc.edu/?p=522</link>
		<comments>http://vision.ai.uiuc.edu/?p=522#comments</comments>
		<pubDate>Thu, 07 May 2009 16:36:07 +0000</pubDate>
		<dc:creator>bernard</dc:creator>
				<category><![CDATA[Learning, Recognition and Human Computer Interaction]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[tensor]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=522</guid>
		<description><![CDATA[<table style="height: 140px;" border="0" width="797">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-536" title="expression1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/expression1.jpg" alt="expression1" width="187" height="132" /></td>
<td valign="top">New algorithms for facial image analysis based on multilinear algebra. We learn the expression subspace and person subspace from a corpus of images based on Higher-Order Singular Value Decomposition, and investigate their applications in facial expression synthesis, face recognition and&#8230;</td></tr></tbody></table>]]></description>
			<content:encoded><![CDATA[<table style="height: 140px;" border="0" width="797">
<tbody>
<tr>
<td valign="top"><img class="alignleft size-full wp-image-536" title="expression1" src="http://vision.ai.uiuc.edu/wordpress/wp-content/uploads/2009/05/expression1.jpg" alt="expression1" width="187" height="132" /></td>
<td valign="top">New algorithms for facial image analysis based on multilinear algebra. We learn the expression subspace and person subspace from a corpus of images based on Higher-Order Singular Value Decomposition, and investigate their applications in facial expression synthesis, face recognition and facial expression recognition.</td>
</tr>
</tbody>
</table>
<p><span id="more-522"></span></p>
<h3>Publications</h3>
<ol>
<li>Hongcheng Wang, Narendra Ahuja, Facial Expression Decomposition, Ninth IEEE International Conference on Computer Vision Volume 2, p. 958 10 13 &#8211; 10, 2003, Nice, France. <a title="Facial Expression Decomposition" href="http://vision.ai.uiuc.edu/~wanghc/papers/iccv03_facial.pdf" target="_self">Full Text</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=522</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Recent Publications</title>
		<link>http://vision.ai.uiuc.edu/?p=436</link>
		<comments>http://vision.ai.uiuc.edu/?p=436#comments</comments>
		<pubDate>Thu, 07 May 2009 08:28:21 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=436</guid>
		<description><![CDATA[<h3>2009</h3>
<ol>
<li><span style="font-size: 11pt; color: #1f497d;"> </span>C. Gao, H. Hua and N. Ahuja, A Hemispherical Imaging Camera, Computer Vision and Image Understanding, 2009, in press. <a href="http://www.sciencedirect.com/science?_ob=ArticleURL&#38;_udi=B6WCX-4VV2NR1-1&#38;_user=10&#38;_rdoc=1&#38;_fmt=&#38;_orig=search&#38;_sort=d&#38;view=c&#38;_acct=C000050221&#38;_version=1&#38;_urlVersion=0&#38;_userid=10&#38;md5=9dc7d44177b67c361853fc57d76b1682">Abstract</a> <a href="http://vision.ai.uiuc.edu/publications/Gao-CVIU09.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Texel-based Texture Segmentation, International Conference on Computer Vision (ICCV), Kyoto, Japan, September-October 2009.</li>
<li>E. Akbas and&#8230;</li></ol>]]></description>
			<content:encoded><![CDATA[<h3>2009</h3>
<ol>
<li><span style="font-size: 11pt; color: #1f497d;"> </span>C. Gao, H. Hua and N. Ahuja, A Hemispherical Imaging Camera, Computer Vision and Image Understanding, 2009, in press. <a href="http://www.sciencedirect.com/science?_ob=ArticleURL&amp;_udi=B6WCX-4VV2NR1-1&amp;_user=10&amp;_rdoc=1&amp;_fmt=&amp;_orig=search&amp;_sort=d&amp;view=c&amp;_acct=C000050221&amp;_version=1&amp;_urlVersion=0&amp;_userid=10&amp;md5=9dc7d44177b67c361853fc57d76b1682">Abstract</a> <a href="http://vision.ai.uiuc.edu/publications/Gao-CVIU09.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Texel-based Texture Segmentation, International Conference on Computer Vision (ICCV), Kyoto, Japan, September-October 2009.</li>
<li>E. Akbas and N. Ahuja, From ramp discontinuities to segmentation tree, 9th Asian Conference on Computer Vision (ACCV), Xi&#8217;an, China, September 2009. <a href="http://vision.ai.uiuc.edu/publications/accv2009_akbas_ahuja.pdf" target="_blank">Full text</a></li>
<li>Q.Kong, A.Kumar, N.Ahuja and Y.Liu, Robust Segmentation of Freight Containers in Train Monitoring Videos, Workshop on Applications of  Computer Vision(WACV), Snowbird, Utah, December 2009.</li>
</ol>
<h3>2008</h3>
<div style="text-align: justify;">
<ol>
<li>S. Todorovic and N. Ahuja, Region Based Hierarchical Image Matching, International Journal of Computer Vision,  Vol. 78, No. 1, 2008, 47-66. <a href="http://www.springerlink.com/content/4362267345742252/">Abstract</a></li>
<li>A. Briassouli and N. Ahuja, Integration of Frequency and Space for Multiple Motion Estimation and Shape-Independent Object Segmentation, IEEE Transactions on Circuits, Systems and Video Technology, Vol. 18, No. 5, May 2008, 657-669. <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4454233">Abstract</a></li>
<li>H. Wang and N. Ahuja, A Tensor Approximation Approach to Dimensionality Reduction, International Journal of Computer Vision, 76:3, March 2008, 217-229. <a href="../../publications/ijcv2008_wang_ahuja_tensor_dimensionality.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Unsupervised Category Modeling, Recognition, and Segmentation in Images, IEEE Trans. Pattern Analysis and Machine Intelligence, 2008, to appear. <a href="../../%7Esintod/research/publications/CategoryExtractionPAMI07.pdf">Full Text</a></li>
<li>J. Hart, E. Resendiz, B. Freid, S. Sawadisavi, C. Barkan and N. Ahuja, Machine Vision Using Multi-Spectral Imaging for Undercarriage Inspection of Railroad Equipment, Proc. 8th World Congress on Railway Research (WCRR), Seoul, Korea, May 2008.  <a href="http://vision.ai.uiuc.edu/publications/ihhc_fried_barkan_ahuja_hart_todorvic_kocher_rail_road.pdf">Full Text</a></li>
<li>Varsha Hedau, H. Arora and N. Ahuja, Matching Images under Unstable Segmentation,  in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../publications/cvpr2008_hedau_arora_ahuja_image_matching.pdf">Full Text</a></li>
<li>N. Ahuja and S. Todorovic, Connected segmentation tree &#8211; a joint representation of region layout and hierarchy, in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../%7Esintod/research/publications/AhujaTodor.CST.CVPR08.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Learning subcategory relevances to category recognition, in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../%7Esintod/research/publications/TodorAhuja.Weights.CVPR08.pdf">Full Text</a></li>
<li>Bernard Ghanem and N. Ahuja, Extracting a Fluid Dynamic Texture and the Background from Video, in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../publications/cvpr2008_ghanem_ahuja_dynamic_texture.pdf">Full Text</a></li>
<li>B. Ghanem , E. Resendiz and N. Ahuja, Segmentation-based Perceptual Image Quality Assessment (SPIQA), International Conference on Image Processing, San Diego, CA, October 2008.  <a href="http://vision.ai.uiuc.edu/~bghanem2/Files/SPIQA_ICIP08.pdf">Full Text</a></li>
<li>S. Sawadisavi, J. Edwards, E. Resendiz, J. Hart, C. Barkan and N. Ahuja, Machine Vision Inspection of Railroad Track, Proc. AREMA 2008 Annual Conference, Salt Lake City, UT, September 2008.  <a href="http://ict.illinois.edu/railroad/CEE/pdf/Events/AREMA2008/Sawadisavi_et_al_AREMA_2008.pdf">Full Text</a></li>
<li>E. Resendiz and  N. Ahuja, A Unified Model for Activity Recognition in Video, Proc. 19th International Conference on Pattern Recognition(ICPR 2008), Tampa, FL, December 2008.  <a href="http://figment.csee.usf.edu/~sfefilat/data/papers/WeAT1.3.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Scale-invariant region-based hierarchical image matching, in Proc. 19th Int. Conf. Pattern Recognition (ICPR 2008), Tampa, FL, 2008. <a href="http://web.engr.oregonstate.edu/~sinisa/research/publications/scale_icpr08.pdf">Full Text</a></li>
<li>S. Shetty and N. Ahuja, A Uniformity Criterion and Algorithm for Data Clustering, in Proc. 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, FL, December 2008.  <a href="http://figment.csee.usf.edu/~sfefilat/data/papers/ThAT9.46.pdf">Full Text</a></li>
</ol>
</div>
<h3>2007</h3>
<div style="text-align: justify;">
<ol>
<li>Hongcheng Wang, Ning Xu, Ramesh Raskar and Narendra Ahuja, Videoshop: A New Framework for Video Editing in Gradient Domain, Graphical Models (GM), Volume 69, Issue 1, Jan. 2007, pp 57-70 <a href="../../publications/gm2007_wang_xu_raskar_ahuja_videoshop.pdf">Full Text</a></li>
<li>A. Kumar, N. Ahuja, J. Hart, U.K. Visesh, P.J. Narayanan, C.V. Jawahar, A Vision System for Monitoring Intermodal Freight Trains, Workshop on Applications of Computer Vision (WACV), Austin, TX, February 2007. <a href="../../publications/wacv2007_avinash_jawar_hart_ahuja_railroad.pdf">Full Text</a></li>
<li>S. Yi, B. Choi and N. Ahuja, Real-time Omni-directional Distance Measurement with Active Panoramic Vision, International Journal of Control, Automation, and Systems, Vol. 5, No. 2, April 2007, 184-191. <a href="../../publications/ijcas2007_yi_choi_ahuja_omni_cam.pdf">Full Text</a></li>
<li>Hong Hua, C. Gao, and N. Ahuja, Calibration of an augmented reality system using head-mounted projective displays, IEEE Transactions on Systems, Man, Cybernetics (Part A: Systems), 37(3), 416-30, May 2007. <a href="../../publications/ismar2007_hua_gao_ahuja_headmountedprojective.pdf">Full Text</a></li>
<li>B. Freid, C. Barkan, N. Ahuja, J. Hart, S. Todorvic, N. Kocher, Multispectral Machine Vision for Improved undercarriage Inspection of Railroad Rolling Stock, Proc. International Heavy Haul Conference, Specialist Technical Session, Kiruna, Sweden, June 2007, 737-744. <a href="../../publications/ihhc_fried_barkan_ahuja_hart_todorvic_kocher_rail_road.pdf">Full Text</a></li>
<li>T. Yu, N. Xu and N. Ahuja, Shape and View Independent Reflectance Map from Multiple Views, International Journal of Computer Vision, Vol. 73, No. 2, June 2007, 123-138. <a href="../../publications/ijcv2007_yu_xu_ahuja_shape_view_reflectance_map.pdf">Full Text</a></li>
<li>H. Arora, N. Loeff, D. Forsyth and N. Ahuja, Unsupervised Segmentation of Objects using Efficient Learning, Proc. IEEE Conference on Computer Vision and Pattern Recognition,  Minneapolis, MN, June 2007, 1-7 <a href="../../publications/cvpr2007_harora_ahuja_daf_leoff_unsupervised_segmentation_using_efficient_learning.pdf">Full Text</a></li>
<li>H. Arora and N. Ahuja, Modeling Objects using Distribution and Topology of Multiscale Region Pairs, IEEE Conference on Computer Vision and Pattern Recognition, Beyond Patches Workshop, Minneapolis, MN, June 2007, 1-8. <a href="../../publications/cvpr2007_harora_ahuja_modelingobjectsusingregionpairs.pdf">Full Text</a></li>
<li>A. Briassouli and N. Ahuja, Extraction and Analysis of Multiple Periodic Motions in Video Sequences, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, No. 7, July 2007, 1244-1261. <a href="../../publications/pami2007_briassouli_ahuja_motion_analysis.pdf">Full Text</a></li>
<li>H. Hua, N. Ahuja and C. Gao, Design Analysis of a High-Resolution Panoramic Camera Using Conventional Imagers and a Mirror Pyramid, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, No. 2, 2007, 356-361 <a href="../../publications/pami2007_hua_ahuja_gao_panoramic_camera.pdf">Full Text</a></li>
<li>Bernard Ghanem and Narendra Ahuja, Phase PCA for Dynamic Texture Video Compression, Proc.of the IEEE International Conference on Image Processing 2007, Sept. 16-19, San Antonio, TX <a href="../../publications/icip2007_bghanem_ahuja_dynamictextures.pdf">Full Text</a></li>
<li>N. Xu, N. Ahuja, R. Bansal, Object segmentation using graph cut based active contours, Computer Vision and Image Understanding,Volume 107, Issue 3, September 2007, Pages 210-224 <a href="../../publications/cviu2007_xu_bansal_ahuja.pdf">Full Text</a></li>
<li>N. Ahuja and S. Todorovic, Learning the Taxonomy and Models of Categories Present in Arbitrary Images,  Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_sintod_ahuja_categories.pdf">Full Text</a></li>
<li>N. Ahuja and S. Todorovic, Extracting Texels in 2.1D Natural Textures,  Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_sintod_ahuja_texels.pdf">Full Text</a></li>
<li>Bernard Ghanem and Narendra Ahuja, Phase Based Modelling of Dynamic Textures, Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_bghanem_ahuja_dynamictexture.pdf">Full Text</a></li>
<li>S. Todorovic and N. Ahuja, Region Based Hierarchical Image Matching, Int. Journal of Computer Vision, to appear. <a href="../../publications/ijcv2007_sintod_ahuja_regionbasedmatching.pdf">Full Text</a></li>
<li>A. Briassouli and N. Ahuja, Integration of Frequency and Space for Multiple Motion Estimation and Shape-Independent Object Segmentation, IEEE Transactions on Circuits, Systems and Video Technology, to appear  <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4454233&amp;con=yes">Abstract</a></li>
<li><span style="color: #000000;">Lai, Y-C., C.P.L. Barkan, J. Drapa,  N. Ahuja, J.M. Hart, P.J. Narayanan, C.V. Jawahar, A. Kumar, L. Milhon and M.P. Stehly 2007.  Machine-vision analysis of the energy efficiency of intermodal freight trains. Journal of Rail and Rapid Transit 221: 353-364. <a href="../../publications/jrrt2007_lai_et_al.pdf">Full Text</a></span></li>
</ol>
<p><span id="more-436"></span></div>
<h3>2006</h3>
<ol>
<li>C. Gao and N. Ahuja, A Refractive Camera for Acquiring Stereo Depth and Super-Resolution Images, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, New York, NY Vol. 2, June 2006, 2316-2323. <a href="../../papers/RefractiveCamera.pdf">Full Text</a></li>
<li> S. Todorovic and N. Ahuja, Extracting subimages of an unknown category from a set of images,  in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2006), vol. 1, pp. 927-934, New York, NY, 2006. <a href="../../papers/TodorovicAhuja_CategoryModeling_CVPR06.pdf">Full Text</a></li>
<li> S. Todorovic and N. Ahuja, 3D texture classification using the belief net of a segmentation tree, in Proc. 18th Int. Conf. Pattern Recognition (ICPR 2006), Hong Kong, China, 2006.<a href="../../papers/TodorovicAhuja_3DTexture_ICPR06.pdf"> Full Text</a></li>
<li>Alexia Briassouli, Narendra Ahuja, Spatial and Fourier Error Minimization for Motion Estimation and Segmentation, ICPR 2006, Hong Kong <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;arnumber=1698841&amp;isnumber=35817">Full Text</a></li>
<li>Alexia Briassouli, Narendra Ahuja, Estimation of Multiple Periodic Motions from Video, ECCV 2006, Graz, Austria</li>
<li>Hongcheng Wang, Yunqiang Chen, Tong Fang, Jason Tyan and Narendra Ahuja, Gradient Adaptive Image Restoration and Enhancement, in Proc. IEEE International Conference on Image Processing (ICIP), Atlanta, VA, Aug. 2006 <a href="../../%7Ewanghc/papers/icip06_grad_filtering.pdf">Full Text</a></li>
<li>Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Eurographics Symposium on Rendering (EGSR), 2006 <a href="../../%7Ewanghc/papers/EGSR06_SceneFactor.pdf">Full Text</a></li>
<li>Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Technical Sketch, SIGGRAPH, 2006 <a href="../../%7Ewanghc/papers/siggraph06_sketch.pdf">Full Text</a></li>
<li>Himanshu Arora, Narendra Ahuja, Analysis of ramp discontinuity model for multiscale image segmentation. ICPR(1), 2006.</li>
<li> T. Yu, N. Ahuja and W-C. Chen, SDG Cut: 3D Reconstruction of Non-Lambertian Objects Using Graph Cuts on Surface Distance Grid, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, New York, NY, Vol. 2, June 2006, 2269-2276. <a href="http://research.nokia.com/files/wchen_CVPR06.pdf">Full Text</a></li>
<li>C-B. Liu, R-S. Lin, M-H. Yang, N. Ahuja and S. Levinson, Object Tracking Using Globally Coordinated Nonlinear Manifolds, International Conference on Pattern Recognition, Vol. 1, Hong Kong, August 2006, 844-847. <a href="http://vision.ai.uiuc.edu/~cbliu/publications/icpr06_tracking.pdf">Full Text</a></li>
<li>S.Yi and Narendra Ahuja, An Omnidirectional Stereo Vision System Using a Single Camera, International Conference on Pattern Recognition, Vol. 4, Hong Kong, August 2006, 861-847. <a href="../../newpubs/sooyeong_camera.pdf">Full Text</a></li>
<li>S. Yi and N. Ahuja, A Novel Omnidirectional Stereo Vision System with a Single Camera, International Conference on Image Analysis and Recognition, Portugal, September 2006.  <a href="http://vision.ai.uiuc.edu/publications/A_Novel_Omnidirectional.pdf">Full Text</a></li>
</ol>
<h3>2005</h3>
<ol>
<li> Hongcheng Wang and Narendra Ahuja, Rank-R Approximation of Tensors Using Image-as-Matrix Representation, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2005 <a href="../../%7Ewanghc/papers/cvpr05_rankR.pdf">Full Text</a></li>
<li> Hongcheng Wang, Qing Wu,           Lin Shi, Yizhou Yu and Narendra Ahuja, Out-of-Core Tensor Approximation of Multi-Dimensional Matrices of Visual Data,  in ACM SIGGRAPH 2005. <a href="../../%7Ewanghc/papers/siggraph05_tensor_hongcheng.pdf"> Full Text</a></li>
<li> Hongcheng Wang, Ning Xu, Ramesh Raskar and Narendra Ahuja, Videoshop: A New Framework for Video Editing in Spatio-Temporal Gradient Domain, IEEE, Video Proceedings, International Conference on Computer Vision and Pattern Recognition, 2005  <a href="http://vision.ai.uiuc.edu/~wanghc/papers/editing-graphical_model.pdf">Full Text</a></li>
<li>Tianli Yu, Jiebo Luo and Narendra Ahuja Shape Regularized Active Contour using Iterative Global Search and Local Optimization, accepted by CVPR 2005, June 20-26 2005, San Diego, CA, USA <a href="../../newpubs/Tianli_cvpr05.pdf">Full Text<br />
</a></li>
<li>Tianli Yu, Jiebo Luo, Amit Singhal, and Narendra Ahuja Shape regularized active contour based on dynamic programming for anatomical structure segmentation, SPIE Medical Imaging 2005, February 12-17 2005, San Diego, CA, USA <a href="../../newpubs/Tianli_spie05.pdf">Full Text<br />
</a></li>
<li>Alexia Briassouli, Narendra Ahuja, Integrated Spatial and Frequency Domain 2D Motion Segmentation and Estimation, ICCV 2005, Beijing, China, [<a href="../../publications/iccv2005_alexia_ahuja_freq.pdf">Full Text</a>]</li>
</ol>
<h3>2004</h3>
<ol>
<li>M. Aggarwal and N. Ahuja, Split Aperture Imaging for High Dynamic Range, International Journal on Computer Vision, Vol. 58, No. 1, June 2004, 7-17. <a href="http://www.springerlink.com/app/home/contribution.asp?wasp=bf93ec8571a8499dbc209ad557f7bff0&amp;referrer=parent&amp;backto=issue,2,6;journal,17,115;linkingpublicationresults,1:100272,1">Abstract and Full Text</a></li>
<li>Che-Bin Liu and Narendra Ahuja. Vision Based Fire Detection. Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 4, pp. 134-137, 08 23 &#8211; 08, 2004, Cambridge UK <a href="../../newpubs/icpr04_fire.pdf">Full Text<br />
</a></li>
<li>Chunyu Gao, Narendra Ahuja, Single camera stereo using planar parallel plate , Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 4, pp. 108-111 08, 2004, Cambridge UK <a href="../../newpubs/Stereo_PPP_Gao.pdf">Full Text<br />
</a></li>
<li>Alexia Briassouli, Narendra Ahuja, Fusion of frequency and spatial domain information for motion analysis , Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 2, pp. 175-178, 2004 Cambridge UK <a href="../../newpubs/870_Briassouli_A.ps">Full Text<br />
</a></li>
<li>Hongcheng Wang and Narendra Ahuja, Compact Representation of Multidimensional Data Using Tensor Rank-One Decomposition, Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 1, pp. 44-47 08 23 &#8211; 08, 2004, Cambridge UK<a href="../../%7Ewanghc/research/icpr04_rank1.html"> Full Text</a></li>
<li>Ning Xu and Narendra Ahuja. A Three-view Matching Algorithm Considering Foreshortening Effects. In Proceedings of International Conference on Computer Vision, Pattern Recognition and Image Processing, pp. 635-638, Cary, NC. September 2003. <a href="../../%7Eningxu/publications.html">Abstract and Full Text </a></li>
<li>Ning Xu and Narendra Ahuja. Generating Omnifocus Images Using Graph Cuts and a New Focus Measure. Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 4. pp 697-700, 08, 2004 Cambridge UK. <a href="http://ieeexplore.ieee.org/xpl/abs_free.jsp?arNumber=1333868">Full Text</a></li>
<li>A. Jagmohan, M. Singh, and N. Ahuja, Dense Two View Stereo Matching Using Kernel Maximum Likelihood Estimation, Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 3, pp. 28-31, 08, 2004 Cambridge UK <a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1334461&amp;isnumber=29387&amp;punumber=9258&amp;k2dockey=1334461@ieeecnfs&amp;query=%28%09dense+stereo+matching+using+kernel+maximum+likelihood+estimation%3Cin%3Emetadata%29&amp;pos=0">Abstract</a></li>
<li>M. Singh, H. Arora and N. Ahuja, Robust Registration and Tracking Using Kernel Density Correlation, 2004 Conference on Computer Vision and Pattern Recognition Workshop on Image and Video Registration, CVPRW&#8217;04 Volume 11, p. 174, 06 27 &#8211; 07 02, 2004, Washington, D.C., USA <a href="../../%7Emsingh/ivr2004.pdf">Full Text</a></li>
<li>Che-bin Liu, Narendra Ahuja, A Model for Dynamic Shape and Its Applications , 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR&#8217;04) Volume 2, pp. 129-134 06 27 &#8211; 07 02, 2004, Washington, D.C., USA <a href="../../newpubs/cvpr04_dyn.pdf">Full Text<br />
</a></li>
<li> Tianli Yu, Ning Xu and Narendra Ahuja, Recovering Shape and Reflectance Model of Non-Lambertian Objects from Multiple Views, 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR&#8217;04) Volume 2, pp. 226-233, 06 27 &#8211; 07 02, 2004, Washington, D.C., USA<a href="../../newpubs/CVPR2004Tianli.pdf"> Full Text</a></li>
<li>Tianli Yu, Ning Xu and Narendra Ahuja, Shape and View Independent Reflectance Map from Multiple Views, ECCV 2004, LNCS 3024, pp. 602-616, May 11-14, Prague. <a href="../../newpubs/ECCV2004Tianli.pdf">Full Text</a></li>
<li>M. Singh, H. Arora and N. Ahuja, A Robust Probabilistic Estimation Framework for Parametric Image Models, European Conference on Computer Vision, LNCS 3021, pp. 508-522, 2004. <a href="../../%7Emsingh/eccv2004.pdf">Full Text</a></li>
<li>A. Sehgal, A. Jagmohan, N. Ahuja Wyner-Ziv Coding of Video: Applications to Error Resilience, IEEE Trans. Multimedia, April 2004, pages 249 &#8211; 258. <a href="../../newpubs/stmm.pdf">Full Text</a></li>
<li>Ning Xu, Tianli Yu and Narendra Ahuja. Shape from color consistency using node cut. In Proceedings of Asian Conference on Computer Vision, Jeju Island, Korea. January 2004. <a href="../../%7Eningxu/publications.html">Abstract and Full Text</a></li>
</ol>
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=436</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Conference Papers</title>
		<link>http://vision.ai.uiuc.edu/?p=385</link>
		<comments>http://vision.ai.uiuc.edu/?p=385#comments</comments>
		<pubDate>Thu, 07 May 2009 06:53:40 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=385</guid>
		<description><![CDATA[<p style="margin-right: 7.5pt;"><span style="color: #000000;"><span id="more-385"></span>N. Ahuja, Connectivity in Lattices and Mosaics, Proc. 4th Int. Joint Conf. on Pattern Recognition, Kyoto, Japan, November 1978, 488-493. <a href="../../abstracts/pub3_3_1_a1178ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and A. Rosenfeld, Mosaic Models for Textures, 18th IEEE Conf. Decision and Control, Fort Lauderdale, December 1979, Proc.&#8230;</span></p>]]></description>
			<content:encoded><![CDATA[<p style="margin-right: 7.5pt;"><span style="color: #000000;"><span id="more-385"></span>N. Ahuja, Connectivity in Lattices and Mosaics, Proc. 4th Int. Joint Conf. on Pattern Recognition, Kyoto, Japan, November 1978, 488-493. <a href="../../abstracts/pub3_3_1_a1178ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and A. Rosenfeld, Mosaic Models for Textures, 18th IEEE Conf. Decision and Control, Fort Lauderdale, December 1979, Proc. 60-70. <a href="../../abstracts/pub3_3_1_a1279ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, R. T. Chien, R. Yen and N. Bridwell, Interference Detection and Collision Avoidance Among Three Dimensional Objects, Proc. 1st National Conf. on Artificial Intelligence, Stanford University, August 19-21, 1980, 44-48. <a href="../../abstracts/pub8_1_1_a0880ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Dot Pattern Processing Using Voronoi Polygons as Neighborhoods, Proc. 5th Int. Joint Conf. on Pattern Recognition, Miami Beach, December 1-4, 1980, 1122-1127. <a href="../../abstracts/pub3_2_1_a1280ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Dot Pattern Processing Using Voronoi Polygons as Neighborhoods, presented at IEEE Image Segmentation Workshop, VPI, Blacksburg, May 15-16, 1980. <a href="../../abstracts/pub3_3_1_a0580ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Approaches to Recursive Image Decomposition, Proc. IEEE Conf. on Pattern Recognition and Image Processing, Dallas, August 3-5, 1981, 75-80.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, N. Bridwell, C. Nash and T. S. Huang, Three-Dimensional Robot Vision, Proc. IEEE International Workshop on Industrial Applications of Machine Vision, Raleigh-Durham, May 3-5, 1982, 206-213. <a href="../../abstracts/pub4_2_2_a0582ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. Swamy, Multiprocessor Pyramids for Bottom-up Image Analysis, Proc. IEEE Conf. on Pattern Recognition and Image Processing, Las Vegas, June 13-17, 1982, 380-385. <a href="../../abstracts/pub7_2_1_a0682Ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. Swamy, Interleaved Pyramid Architectures for Bottom-Up Image Analysis, Proc. 6th Int. Conf. on Pattern Recognition, Munich, Germany, October 19-22, 1982, 388-391. <a href="../../abstracts/pub7_2_1_a1082Ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tuceryan and N. Ahuja, Segmentation of Dot Patterns Containing Homogeneous Clusters, Proc. 6th Int. Conf. on Pattern Recognition, Munich, Germany, October 19-22, 1982, 392-394. <a href="../../abstracts/pub3_2_1_a1082tuceryan.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">B. An, N. Ahuja and B. Schachter, Image Representation Using Voronoi Tessellation, Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Washington, D. C., June 19-23, 1983, 188-189. <a href="../../abstracts/pub4_1_1_a0683an.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Nash and N. Ahuja, Updating Octrees of Translating Objects, Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Washington, D. C., June 19-23, 1983, 380-381. <a href="../../abstracts/pub4_2_2_a0683nash.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tuceryan and N. Ahuja, Perceptual Segmentation of Nonhomogeneous Dot Patterns, Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Washington D. C., June 19-23, 1983, 47-52. <a href="../../abstracts/pub3_2_1_a0683tuceryan.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">H. C. Chen, N. Ahuja and T. S. Huang, Septree Representation of Moving Objects Using Hexagonal Cylinderical Decomposition, Proc. Intelligent Robots: 3rd Int. Conf. on Robot Vision and Sensory Controls, Cambridge, November 6-10, 1983. <a href="../../abstracts/pub4_1_1_a1183chen.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Dot Pattern Perception, Proc. National Symposium on Image Signal Processing, Indian Institute of Technology, Madras, India, January 9-12, 1984.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Efficient Planar Embedding of Trees for VLSI Layouts, IEEE Int. Conf. on Systems, Man and Cybernetics, Bombay/New Delhi, India, December 30, 1983 &#8211; January 7, 1984, Proc. 28-32. <a href="../../abstracts/pub7_2_1_a1283Ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Efficient Planar Embedding of Trees for VLSI layouts, Proc. 7th Int. Conf. on Pattern Recognition, Montreal, Canada, July 30 &#8211; Aug 2, 1984, 460-464. <a href="../../abstracts/pub7_2_1_a0884Ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and W. Hoff, Augmented Medial Axis Transform, Proc. IEEE Workshop on Computer Vision, Annapolis, Maryland, April 30 &#8211; May 2, 1984, 251-256. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">C. H. Debrunner and N. Ahuja, A Bottom-up Minimum Spanning Tree Algorithm for Multiprocessor Pyramid Architectures, Proc. Workshop on Algorithm-Guided Parallel Architectures for Automatic Target Recognition, Leesburg, Virginia, July 16-18, 1984, 51-78. <a href="../../abstracts/pub7_2_1_a0784Debrunner.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Osse and N. Ahuja, Efficient Octree Representation of Moving Objects, Proc. 7th Int. Conf. on Pattern Recognition, Montreal, Canada, July 30 &#8211; Aug 2, 1984, 821-823. <a href="../../abstracts/pub4_2_2_a0784osse.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Ruff and N. Ahuja, Robot Path Planning in a Three-Dimensional Environment, Proc. 7th Int. Conf. on Pattern Recognition, Montreal, Canada, July 30 &#8211; Aug 2, 1984, 188-191. <a href="../../abstracts/pub8_1_2_a0784ruff.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and W. Hoff, Augmented Medial Axis Transform, Proc. 7th Int. Conf. on Pattern Recognition, Montreal, Canada, July 30 &#8211; Aug 2, 1984, 336-338. <a href="../../abstracts/pub4_1_1_a0784ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and M. Tuceryan, Representing Geometric Structure in Dot Patterns, Geobild ©85, Georgenthal, DDR, January 14-18, 1985.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Veenstra and N. Ahuja, Octree Generation from Silhouette Views of an Object, IEEE Int. Conf. on Robotics and Automation, St. Louis, March 25-28, 1985, 843-848. <a href="../../abstracts/pub4_2_1_a0385veenstra.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and M. Tuceryan, Extracting Perceptual Structure in Dot Patterns, IEEE Workshop on Languages for Automation: Cognitive Aspects in Information Processing, Palma de Mallorca, Spain, June 28-29, 1985, 86-91.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Hoff and N. Ahuja, Generating Range Map from Stereo Images, 4th Scandinavian Conf. on Image Analysis, Trondheim, Norway, June 18-20, 1985, 761-768.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng and N. Ahuja, Octree Representation of Objects in Arbitrary Motion, IEEE Conf. on Computer Vision and Pattern Recognition, San Francisco, June 9-13, 1985, 524-529. <a href="../../abstracts/pub4_2_2_a0685weng.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Hoff and N. Ahuja, Depth from Stereo, IEEE Conf. on Computer Vision and Pattern Recognition, San Francisco, June 9-13, 1985.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Sharma, J. H. Patel and N. Ahuja, NETRA: A Multiprocessor Computer Architecture for Image Understanding, IEEE Workshop on Computer Architecture for Pattern Analysis and Image Database Management, Miami, November 18-20, 1985, 92-98. <a href="../../abstracts/pub7_2_1_a1185sharma.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Hoff and N. Ahuja, Surfaces from Stereo, Proc. DARPA Image Understanding Workshop, Miami, December 9-10, 1985, 98-106.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Veenstra and N. Ahuja, Efficient Octree Generation from Silhouettes, IEEE Conf. on Computer Vision and Pattern Recognition, Miami, June 22-26, 1986, 537-542. <a href="../../abstracts/pub4_2_1_a0686veenstra.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. S. Huang, J. Weng and N. Ahuja, 3-D Motion from Image Sequences: Modeling, Understanding and Prediction, IEEE Workshop on Motion: Representation and Analysis, Kiawah Island, May 6-9, 1986, 516-518.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Hoff and N. Ahuja, Surfaces from Stereo, 8th International Conference on Pattern Recognition, Paris, France, October 28-31, 1986, 516-518. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, 3-D Motion from Image Sequences: Modeling, Estimation and Prediction, 8th International Conference on Pattern Recognition, Paris, France, October 28-31, 1986, 1107-1109. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">W. Hoff and N. Ahuja, Extracting Surfaces from Stereo Images: An Integrated Approach, First International Conference on Computer Vision, London, England, June 8-11, 1987, 284-294.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Blostein and N. Ahuja, Representation and Three-dimensional Interpretation of Image Texture: An Integrated Approach, First International Conference on Computer Vision, London, England, June 8-11, 1987, 444-449.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, T. Huang and N. Ahuja, Error Analysis of Motion Parameter Estimation from Image Sequences, First International Conference on Computer Vision, London, England, June 8-11, 1987, 703-707.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Srivastava and N. Ahuja, An Algorithm for Generating Octrees from Object Silhouettes in Perspective Views, Proc. IEEE Workshop on Computer Vision, Miami Beach, November 30 -December 2, 1987, 363-365. <a href="../../abstracts/pub1_5_1_a1287srivastava.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, T. Huang and N. Ahuja, A Two-Step Approach to Optimal Motion and Structure Estimation from Feature Correspondences, Proc. IEEE Workshop on Computer Vision, Miami Beach, November 30 -December 2, 1987, 355-357. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Hwang and N. Ahuja, Path Planning Using a Potential Field Representation, Proc. IEEE International Conference on Robotics and Automation, Philadelphia, April 25-29, 1988, 648-649. <a href="../../abstracts/pub8_1_2_a0488hwang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, Closed-Form Solution+Maximum Likelihood: A Robust Approach to Motion and Structure Estimation, IEEE Conf. on Computer Vision and Pattern Recognition, Ann Arbor, June 1988, 381-386. <a href="../../abstracts/pub1_2_1_a0688wah.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. Huang, IU at UI: An Overview and an Example on Shape from Texture, Proc. DARPA Image Understanding Workshop, Boston, April 6-8, 1988, 222-253.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, Y. Liu, T. Huang and N. Ahuja, Determining Motion/Structure from Line Correspondences: A Robust Linear Algorithm and Uniqueness Theorems, IEEE Conf. on Computer Vision and Pattern Recognition, Ann Arbor, June 1988, 387-392. <a href="../../abstracts/pub1_2_1_a0688wlha.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, Motion and Structure from Point Correspondences: A Robust Algorithm for Planar Case with Error Estimation, 9th Int. Conference on Pattern Recognition, Rome, Italy, November, 1988, 247-251.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, Two-View Matching, Second International Conference on Computer Vision, Tarpon Springs, December 5-8, 1988, 64-73. <a href="../../abstracts/pub1_3a_1_a1288wah.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">L. Abbott and N. Ahuja, Surface Reconstruction by Dynamic Integration of Focus, Camera Vergence and Stereo, Second International Conference on Computer Vision, Tarpon Springs, December 5-8, 1988, 532-543. <a href="../../abstracts/pub1_2_1_a1288aa.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. N. Choudhary, S. Das, N. Ahuja and J. Patel, Surface Reconstruction from Stereo Images: An Implementation on a Hypercube Multiprocessor, Fourth Conference on Hypercube Concurrent Computers and Applications, Monterey, CA, March 1989, 1045-1052. <a href="../../abstracts/pub7_2_1_a0389Choudhary.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, T. Huang and N. Ahuja, Motion from Images: Image Matching, Parameter Estimation and Intrinsic Stability, IEEE Workshop on Visual Motion, Irvine, March 20-22, 1989, 359-366. <a href="../../abstracts/pub1_2_1_a0389wha.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Hwang and N. Ahuja, Path Planning using a Potential Field Representation, IEEE Conference on Computer Vision and Pattern Recognition, San Diego, June 4-8, 1989, 569-575. <a href="../../abstracts/pub8_1_2_a0689hwang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, Optimal Motion and Structure Estimation, IEEE Conference on Computer Vision and Pattern Recognition, San Diego, June 4-8, 1989, 144-152. <a href="../../abstracts/pub1_2_1_a0689wah.htm">Abstract &amp; PDF Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das, A. L. Abbott and N. Ahuja, Surface Reconstruction from Focus and Stereo, Proc. 5th International Conference on Image Analysis and Processing, Positano, Italy, September 1989. <a href="../../abstracts/pub1_5_1_a0989das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Integrating Mutilresolution Image Acquisition and Coarse-to-fine Surface Reconstruction from Stereo, IEEE Workshop on Interpretation of 3D Scenes, Austin, November 26-29, 1989, 9-15.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Debrunner and N. Ahuja, A Hankel Matrix Based Motion Estimation Algorithm, 10th International Conference on Pattern Recognition &#8211; Computer Vision, Vol. 1, Atlantic City, June 1990, 384-389.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. N. Choudhary, S. Das, N. Ahuja, and J. H. Patel, A Reconfigurable and Hierarchical Parallel Processing Architecture: Performance Results for Stereo Vision, 10th International Conference on Pattern Recognition &#8211; Computer Vision, Vol. 2, Atlantic City, June 1990, 389-393.<a href="../../abstracts/pub7_2_1_a0690Choudhary.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, T. S. Huang, and N. Ahuja, Estimating Motion and Structure from Line Matches, 10th International Conference on Pattern Recognition &#8211; Computer Vision, Vol. 1, Atlantic City, June 1990, 168-172. <a href="../../abstracts/pub1_3_2_a0690wha.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Active Stereo Based Image Acquisition and Surface Reconstruction, Proc. 5th IEEE Int. Symposium on Intelligent Control, Philadelphia, Sep 5-7, 1990, 233-238. <a href="../../abstracts/pub1_5_1_a0990das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, IU at UI: An Overview of Research during 1988-90, Proc. DARPA Image Understanding Workshop, Pittsburgh, Sep 11-13, 1990, 134-140.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Integrated Multiresolution Image Acquisition and Surface Reconstruction from Active Stereo, Proc. DARPA Image Understanding Workshop, Pittsburgh, Sep 11-13, 1990, 418-422. <a href="../../abstracts/pub1_5_1_a0990das2.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">E. Altman and N. Ahuja, A Dynamical Systems Approach to Integration in Stereo, Proc. DARPA Image Understanding Workshop, Pittsburgh, Sep 11-13, 1990, 423-427.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, A Robust Algorithm for Plane Motion Estimation, First International Conf. on Automation, Robotics and Computer Vision, Singapore, Sep 18-21, 1990.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Multiresolution Image Acquisition and Surface Reconstruction, Proc. 3rd Int. Conf. on Computer Vision, Osaka, Japan, Dec 4-7, 1990, 485-488. <a href="../../abstracts/pub1_5_1_a1290das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. L. Abbott and N. Ahuja, Active Surface Reconstruction by Integrating Focus, Vergence, Stereo and Camera Calibration, Proc. 3rd Int. Conf. on Computer Vision, Osaka, Japan, Dec 4-7, 1990, 489-492. <a href="../../abstracts/pub1_5_1_a1290abbott.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Dynamic Integration of Visual Cues for Position Estimation, Proc. SPIE Conference on Intelligent Robots and Computer Vision IX: Neural, Biological and 3-D Methods, Vol. 1382, Boston, November 1990, 341-352. <a href="../../abstracts/pub1_5_1_a1190das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Chuang and N. Ahuja, Path Planning using Newtonian Potential, Proc. IEEE Conference on Robotics and Automation, Sacramento, April 8-10, 1991, 558-563. <a href="../../abstracts/pub8_1_2_a0491chuang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, Issues in the Design of Integrated Vision Systems, Int. Joint Conference on Artificial Intelligence Workshop W-7, Sydney, Australia, August 24, 1991, Part III. <a href="../../abstracts/pub1_5_1_a0891ahuja.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Motion and Structure from Orthographic Images, Proc. IEEE Conference on Acoustics, Speech and Signal Processing, Toronto, Canada, May 14-17, 1991, 2445-2448.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Estimation of Motion and Structure of Planar Surfaces from a Sequence of Monocular Images, Proc. IEEE Conference on Computer Vision and Pattern Recognition, Maui, Hawaii, June 1991, 732-733.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Segmentation and Estimation of Structure and Motion of Textured Piecewise Planar Surfaces, Proc. IEEE Workshop on Visual Motion, Princeton, New Jersey, October 7-9, 1991, 274-279.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Chuang and N. Ahuja, Skeletonization Using a Generalized Potential Field Model, Proc. 8th Israeli Symposium on Artificial Intelligence and Computer Vision, December 30-31, 1991. <a href="../../abstracts/pub4_1_1_a1291chuang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. S. Huang, IU at UI: An Overview of Research during 1990-91, Proc. DARPA Image Understanding Workshop, San Diego, January 27-29, 1992, 127-135.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Integrated 3D Recovery and Visualization of Flight Image Sequences, Proc. DARPA Image Understanding Workshop, San Diego, January 27-29, 1992, 473-477.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Debrunner and N. Ahuja, Motion and Structure Factorization and Segmentation of Long Multiple Motion Image Sequences, Proc. DARPA Image Understanding Workshop, San Diego, January 27-29, 1992, 543-547.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Debrunner and N. Ahuja, Motion and Strucuture Factorization and Segmentation of Long Multiple Motion Image Sequences, Second European Conference on Computer Vision, Santa Margherita Ligure, Italy, May 18-23, 1992, 217-221. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. S. Huang, Cresceptron: A Self-Organizing Neural Network Which Grows Adaptively, Proc. Int. Joint Conf. on Neural Networks, Vol. 1, Baltimore, Maryland, June 1992, 576-581.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tuceryan, A. K. Jain and N. Ahuja, Supervised Classification of Early Perceptual Structure in Dot Patterns, 11th Int. Conf. on Pattern Recognition &#8211; Computer Vision, The Hague, August 30 &#8211; September 2, 1992, B88-91.  <a href="../../abstracts/pub3_2_1_a0992tuceryan.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Estimating Motion of Constant Acceleration from Image Sequences, 11th Int. Conf. on Pattern Recognition &#8211; Computer Vision, The Hague, August 30 &#8211; September 2, 1992, A655-659. <a href="../../abstracts/pub1_2_1_a0892ha.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Chuang and N. Ahuja, Robot Arm Path Planning Using the Newtonian Potentiial, Int. Conf. on Control and Robotics, Vancouver, Canada, August 14-17, 1992, 14-17. <a href="../../abstracts/pub8_1_2_a0892chuang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Hougen and N. Ahuja, Integration of Stereo and Shape from Shading using Color, Proc. Second International Conf. on Automation, Robotics and Computer Vision, Vol 1, Singapore, September 15-18 1992, pp. CV-6.6.1 &#8211; CV-6.6.5.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. L. Abbott and N. Ahuja, The University of Illinois Active Vision System, Proc. SPIE Conf. on Applications of Artificial Intelligence XI: Algorithms, Techniques and Active Vision, Boston, November 1992, 757-768. <a href="../../abstracts/pub1_5_1_a1192abbott.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Long Image Sequence Motion Analysis Using Polynomial Motion Models, IAPR Workshop on Machine Vision Applications, Dec 7-9, Tokyo, 1992, 109-113.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Z. Hong and N. Ahuja, Target Tracking from Binocular Image Sequence Using the Autoregressive Moving Average Model IAPR Workshop on Machine Vision Applications, Dec 7-9, Tokyo, 1992, 317-320.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Z. Hong and N. Ahuja, Target Tracking and Cumulative Depth Map Generation from Binocular Image Sequences, Proc. 3rd Int. Conf. on Intelligent Autonomous Systems, Pittsburgh, PA, February 1993. <a href="../../abstracts/pub1_5_1_a0293hong.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Feature Extraction and Matching as Signal Detection, Proc. SPIE Conf. on Applications of Artificial Intelligence XI: Machine Vision and Robotics, 1993, 51-65. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. S. Huang, IU at UI: An Overview of Research during 1991-92, Proc. DARPA Image Understanding Workshop, Washington, April 19-21, 1993, 117-125.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Thirumalai and N. Ahuja, Detection and Segmentation of Feature Trajectories in Multiple, Discontinuous Motion Image Sequences, Proc. DARPA Image Understanding Workshop, Washington, April 19-21, 1993, 621-628.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, A Transform for Detection of Multiscale Image Structure, Proc. DARPA Image Understanding Workshop, Washington, April 19-21, 1993, 893-903.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Range Estimation from Focus Using a Non-frontal Imaging Camera, Proc. DARPA Image Understanding Workshop, Washington, April 19-21, 1993, 959-965.  <a href="http://vision.ai.uiuc.edu/publications/RangeEstimationDARPA.pdf.PDF">Full Text</a><br />
</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, N. Ahuja and T. Huang, Learning, Recognition and Segmentation of 3-D Objects from 2-D Images, 4th Int. Conf. on Computer Vision, Berlin, Germany, May 11-14, 1993, 121-128. <a href="../../abstracts/5_1_2_593weng.html">Abstract and full-text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Hougen and N. Ahuja, Estimation of the Light Source Distribution and its Use in Shape Recovery from Stereo and Shading, 4th Int. Conf. on Computer Vision, Berlin, Germany, May 11-14, 1993, 148-155.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, A Study of Interference Detection in Virtual Cooperative Workspace for an Operator Assistant, Workshop on Human Interface, Tokyo, May 24, 1993, vol. 8, no. 2, 247-254. (Part in English, part in Japanese). Also published as a Technical Report of Technical Meeting on Human Communication Engineering of IEICE, HC93-17. (in Japanese). <a href="../../abstracts/pub8_1_1_a0593kitamura.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, A Transform for Detection of Multiscale Image Structure, Proc. IEEE Conference on Computer Vision and Pattern Recognition, New York, June 1993, 780-781.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Performance Analysis of Focus, Vergence and Stereo as Depth Cues for Active Vision, Proc. IEEE Conference on Computer Vision and Pattern Recognition, New York, June 1993, 194-199. <a href="../../abstracts/pub1_5_1_a0693dasahuja.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Range Estimation from Focus Using a Non-frontal Imaging Camera, Proc. American Association for Artificial Intelligence Conference, Washington, July 1993, 830-835.  <a href="http://www.aaai.org/Papers/AAAI/1993/AAAI93-124.pdf">Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Use of a Non-frontal Camera for Extended Depth of Field in Wide Scenes, Proc. SPIE Conf. on Intelligent Robots and Computer Vision XII: Algorithms and Techniques, September 1993, 62-72.  <a href="http://vision.ai.uiuc.edu/publications/Wide Scenes.PDF">Full Text</a><br />
</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Necessary and Sufficient Conditions for a Unique Solution of Plane Motion and Structure, Proc. 5th Int. Conference on Computer Analysis of Images and Patterns, Budapest, Hungary, September 13-15, 1993.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Srikanth and N. Ahuja, Parallel Distributed Detection of Feature Trajectories in Multiple Discontinuous Motion Image Sequences, Int. Joint Conf. on Neural Networks, Vol. 3, Nagoya, Japan, October 25-29, 1993, 2069-2072.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Interference Detection Among Objects for Operator Assistance in Virtual Cooperative Workspace, 2nd IEEE Int. Workshop on Robot and Human Communication (RO-MAN©93), November 3-5, 1993, Tokyo, Japan, 442-446. <a href="../../abstracts/pub8_1_1_a1193kitamura1.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Stereo Display of Large Scenes from Monocular Images Using a Novel Non-frontal Camera, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 133-136.  <a href="http://vision.ai.uiuc.edu/publications/Stereo Display.PDF">Full Text</a><br />
</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tabb and N. Ahuja, Detection and Representation of Multiscale Low-level Image Structure Using a New Transform, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 155-158.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Integrated Matching and Segmentation of Multiple Features in Two Views, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 213-216.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Chuang, W. Lai and N. Ahuja, Skeletal Representation of 3D Objects Based on Generalized Potential Field, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 236-239. <a href="../../abstracts/pub4_1_1_a1193chuang.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Iwasaki, N. Ahuja and F. Kishino, Coarse-to-fine Pose Estimation Using Coarse-to-fine Filtering, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 664-667.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">X. Hu and N. Ahuja, Mirror Uncertainity and Uniqueness Conditions for Determining Shape and Motion from Orthographic Projection, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 272-275.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">E. Altman and N. Ahuja and F. Kishino, Hand Trajectory Recognition Using Dynamical Systems, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 321-324.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Efficient Interference Detection Among Objects Using Octree and Polyhedral Shape Representations, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 775-779. <a href="../../abstracts/pub8_1_1_a1193kitamura.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Kitamura, J. Ohya, N. Ahuja and F. Kishino, Computational Taxonomy and Recognition of Facial Expressions, First Asian Conference on Computer Vision, November 23-25, 1993, Osaka, Japan, 434-437.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Integrated 3D Analysis of Flight Image Sequences, European Conference on Computer Vision, Vol. 1, Sweden, May 1994, 211-216.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Performance Evaluation of Depth Cues for Active Vision, NSF/DARPA Workshop on Performance vs. Methodology in Computer Vision, Seattle, WA, June 1994, 227-236. <a href="../../abstracts/pub1_5_1_a0694das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Darrell R. Hougen and Narendra Ahuja, Adaptive Polynomial Modelling of the Reflectance Map for Shape Estimation from Stereo and Shading, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, June 1994, 991-994.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Weng, Y. Cui, N. Ahuja and A. Singh, Integration of Transitory Image Sequences, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, June 20-23, 1994, 966-969. <a href="../../abstracts/pub1_2_1_a0694wcas.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tabb and N. Ahuja, A Multiscale Region-Based Approach to Image Matching, Proceedings International Conference on Pattern Recognition, Jerusalem, Israel, October 10-13, 1994, 415-419. <a href="../../abstracts/pub1_3_1_a1094ta.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Efficient Collision Detection Among Objects in Arbitrary Motion Using Multiple Shape Representations, Proceedings International Conference on Pattern Recognition, Jerusalem, Israel, October 10-13, 1994, 390-396.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Joshi, N. Ahuja and J. Ponce, Towards Structure and Motion Estimation From Dynamic Silhouettes, Proc. IEEE Workshop on Motion of Nonrigid and Articulated Objects, Austin, TX November 11-12, 1994, 166-171.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Sull and N. Ahuja, Estimation and Segmentation of Displacement Field Using Multiple Features, International Conference on Image Processing, Austin, TX, November 14-17, 1994, 53-57.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. S. Huang, IU at UI: An Overview of Research During 1993-94, Proceedings DARPA Image Understanding Workshop, November 1994, 133-142.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Das and N. Ahuja, Performance Evaluation of Depth Cues in Active Stereo Vision, Proceedings DARPA Image Understanding Workshop, November 1994, 1041-1047. <a href="../../abstracts/pub1_5_1_a1194das.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Tabb and N. Ahuja, Multiscale Image Segmentation Using a Recent Transform, Proceedings DARPA Image Understanding Workshop, November 1994, 1523-1530.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Joshi, N. Ahuja and J. Ponce, Silhouette-based Structure and Motion Estimation of a Smooth Object, Proceedings DARPA Image Understanding Workshop, November 1994, 1237-1243.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Obtaining Focused Images Using a Non-frontal Imaging Camera, Proceedings DARPA Image Understanding Workshop, November 1994, 617-620.  <a href="http://vision.ai.uiuc.edu/publications/Obtaining Focused Images Usinga Non-frontal Imaging Camera.pdf.PDF">Full Text</a><br />
</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Joshi, N. Ahuja and J. Ponce, Structure and Motion Estimation from Dynamic Silhouettes Under Perspective Projection, 5th Int. Conf. on Computer Vision, Cambridge, Massachusetts, June 20-23, 1995, 290-295. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Charan and N. Ahuja, Feature Guided Pixel Matching and Segmentation in Motion Image Sequences, Int. Symposium on Computer Vision, November 19-21, 1995, Coral Gables, Florida, I277-I281.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Charan and N. Ahuja, Curve Detection in 3D Dot Patterns, 2nd Asian Conf. on Computer Vision, Singapore, December 5-8, 1995, 199-203.<a href="../../abstracts/pub3_2_1_a1295charan.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and R. Charan, Pixel Matching and Motion Segmentation in Image Sequences, 2nd Asian Conf. on Computer Vision, Singapore, December 5-8, 1995, I310-I314.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja, A Nonfrontal Imaging Camera, Int. Computer Science Conference, Hong Kong, December 11-13, 1995, p. 397.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. Huang, IU at UI: An Overview of Research During 1994-95, Proceedings Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996, 159-164.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Bajcsy and N. Ahuja, Segmentation of Multidimensional Images, Proceedings Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996, 937-942.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. A. Jackson, Multiscale Region Detection, Proceedings Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996, 961-967.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Hougen and N. Ahuja, Shape from Appearance: A Statistical Approach to Surface Shape Estimation, Proceedings Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996, 1095-1101.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Hougen and N. Ahuja, Resolution and Accuracy of Stereo, Focus, and Shading Methods, Proceedings Image Understanding Workshop, Palm Springs, CA, February 12-15, 1996, 1133-1139.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">D. Hougen and N. Ahuja, Shape from Appearance: A Statistical Approach to Surface Shape Estimation, Proceedings European Conference on Computer Vision, Cambridge, England, April 1996, 421-429.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Krishnan and N. Ahuja, Panoramic Image Acquisition, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, June 18-20, 1996, 379-384.  <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=00517100">Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Bajcsy and N. Ahuja, Uniformity and Homogeneity-based Hierarchical Clustering, Proceedings International Conference on Pattern Recognition, Vienna, Austria, August 26-29, 1996, 96-100. <a href="../../abstracts/pub3_2_1_a0896bajcsy.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Jackson and N. Ahuja, Elliptical Gaussian Filters, Proceedings International Conference on Pattern Recognition, Vienna, Austria, August 26-29, 1996, 775-779.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Courtney and N. Ahuja, Segmentation of Volume Images using a Multiscale Transform, Proceedings International Conference on Pattern Recognition, Vienna, Austria, August 26-29, 1996, 432-436.</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda and N. Ahuja, Segmentation Based Reversible Image Compression, International Conference on Image Processing, Geneva, Switzerland, September 1996, 81-84.<a href="../../abstracts/pub6_1_1_a0996krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda and N. Ahuja, Discrete Multi-Dimensional Linear Transforms over Connected Arbitrarily Shaped Supports, IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. <a href="../../abstracts/pub6_4_1_a0497krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">4, Munich, Germany, April 1997, 3041-3044. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Bajcsy and N. Ahuja, A New Framework for Hierarchical Segmentation using Homogeneity Analysis, Proc. First International Conference on Scale-Space in Computer Vision, Vol. 1252, Springer LNCS, Utrecht, Netherlands, July 1997, 319-322.  <a href="../../abstracts/pub3_4_1_a0797bajcsy.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and T. S. Huang, IU at UI: An Overview of Research During 1996-97. Proc. Image Understanding Workshop, New Orleans, LA, May 1997, 465-472. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Bajcsy and N. Ahuja, Hierarchical Image Segmentation Using Similarity Analysis, Proc. Image Understanding Workshop, New Orleans, LA, May 1997, 541-545. <a href="../../abstracts/pub3_2_1_a0597bajcsy.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda and N. Ahuja, Super Resolution with Region Sensitive Interpolation, Proc. Image Understanding Workshop, New Orleans, LA, May 1997, 537-540. <a href="../../abstracts/pub6_1_2_a0597krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Srinivasa and N. Ahuja, Learning to Fixate on 3D Targets with Uncalibrated Active Cameras, Proc. Image Understanding Workshop, New Orleans, LA, May 1997, 129-134. <a href="../../abstracts/pub1_5_1_a0597Srinivasa.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">S. C. Yoon, K. Ratakonda and N. Ahuja, Region Based Video Coding Using a Multiscale Image Segmentation, International Conference on Image Processing, Vol. II, Santa Barbara, CA, October 26-29, 1997, 510-513. <a href="../../abstracts/pub6_2_1_a1097yoonvc.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda, S. C. Yoon and N. Ahuja, Video Compression: Coding the Displaced Frame Difference, International Conference on Image Processing, Vol. I, Santa Barbara, CA, October 26-29, 1997, 353-356. <a href="../../abstracts/pub6_2_1_a1097ratakondavc.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Castano and N. Ahuja, Omnifocused 3D Display Using the Nonfrontal Imaging Camera, Proc. IEEE and ATR Workshop on Computer Vision for Virtual Reality Based Human Communications, CVVRHC©98, Bombay, India, January 3, 1998, 28-34. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda and N. Ahuja, Restoring Image Quality Through Structure Preserving De-Noising, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 33-40. <a href="../../abstracts/pub6_1_3_a0198krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">B. Perrin, N. Ahuja and N. Srinivasa, Learning Multiscale Image Models of 2D Object Classes, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 323-331.  <a href="../../newpubs/perrinAhujaObjRec.pdf">Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Srinivasa and N. Ahuja, A Learning Approach to Fixate on 3D Targets with Active Cameras, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 623-631. <a href="../../abstracts/pub1_5_1_a0198Srinivasa.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Bajcsy and N. Ahuja, Hierarchical Texture Segmentation, 3rd Asian Conference on Computer Vision, Hong Kong, January 8-11, 1998, 291-298.  <a href="../../abstracts/pub3_4_1_a0198bajcsy.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, Extracting Gestural Motion Trajectory, 1998 IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, April 1998, 10-15. <a href="../../abstracts/5_2_3_498yang.html">Abstract and full-text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad and N. Ahuja, Unsupervised Multidimensional Hierarchical Clustering, International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, Vol. V, May 12-15, 1998, 2761-2764. <a href="../../abstracts/pub3_2_1_a0598dugad.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Aggarwal, N. Shanbhag and N. Ahuja, Improving the Throughput of Flexible-Precision DSPs via Algorithm Transformation, International Conference on Acoustics, Speech, and Signal Processing, Vol. V, Seattle, WA, May 12-15, 1998, 3069-3072. <a href="../../abstracts/pub7_1_1_a0598Manoj.htm">Abstract</a> <a href="http://vision.ai.uiuc.edu/publications/Aggarwal-ImprovingThroughputFlexiblePrecisionDSPs.ICASSP98.pdf">Full Text</a><br />
</span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">P. Ishwar, K. Ratakonda, P. Moulin and N. Ahuja, Image De-noising Using Multiple Compaction Domains, International Conference on Acoustics, Speech, and Signal Processing, Vol. III, Seattle, WA, May 12-15, 1998, Invited, 1889-1892. <a href="../../abstracts/pub6_1_3_a0598ishwar.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, Extraction and Classification of Motion Patterns for Hand Gesture Recognition, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Santa Barbara, CA, June 1998, 892-897. <a href="../../abstracts/5_2_3_698yang.html">Abstract and full-text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">J. Ma and N. Ahuja, Dense Shape and Motion from Region Correspondences, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Santa Barbara, CA, June 1998, 219-224. <a href="../../abstracts/pub1_3_2_a0698ma.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda and N. Ahuja, POCS-Based Adaptive Image Magnification, Proc. International Conference on Image Processing, Vol. 3, Chicago, IL, October 1998, 203-207. <a href="../../abstracts/pub6_1_2_a1098krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">K. Ratakonda, R. Dugad and N. Ahuja, Digital Image Watermarking: Issues in Resolving Rightful Ownership, Proc. International Conference on Image Processing, Vol. 2, Chicago, IL, October 1998, 414-418. <a href="../../abstracts/pub6_4_2_a1098dugad.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad, K. Ratakonda and N. Ahuja, A New Wavelet Based Scheme for Digital Image Watermarking, Proc. International Conference on Image Processing, Vol. 2, Chicago, IL, October 1998, 419-423. <a href="../../abstracts/pub6_4_2_a1098dugad2.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, Detecting Human Faces in Color Images, IEEE International Conference on Image Processing, Vol. 1, Chicago, IL, October 1998, 127-130. <a href="../../abstracts/5_2_1_2001yang.html">Abstract and full-text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad, K. Ratakonda and N. Ahuja, Robust Video Shot Change Detection, IEEE Workshop on Multimedia Signal Processing, Redondo Beach, CA, December 1998, 276-278.<a href="../../abstracts/pub6_6_1_a1298krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad, K. Ratakonda and N. Ahuja, Robust Video Shot Change Detection, Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, December 1998, 358-363. <a href="../../abstracts/pubs6_6_1a1298krishna.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Singh and N. Ahuja, Estimating Light Sources, Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, December 1998, 76-81. <a href="../../abstracts/pub1_4_1_a1298singhmk.htm">Abstract and Full Text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, Gaussian Mixture Modeling of Human Skin Color and Its Applications in Image and Video Databases, Proc. of the SPIE: Storage and Retrieval for Image and Video Databases VI, Vol. 3656, San Jose, CA, Jan. 1999. <a href="../../mhyang/papers/spie99-abstract.html">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M-H. Yang and N. Ahuja, Detecting Human Faces in Color Images, IEEE Conference on Computer Vision and Pattern Recognition, Ft. Collins, CO, June 1999, I-466-472. <a href="../../abstracts/5_2_1_1098yang.html">Abstract and full-text</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad and N. Ahuja, A Fast Scheme for Altering Resolution in the compressed Domain, IEEE Conference on Computer Vision and Pattern Recognition, Ft. Collins, CO, June 1999, I-213-218. <a href="../../abstracts/pub6_4_1_a0699dugad.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, A Data Partition Method for Parallel Self-Organizing Map, IEEE International Joint Conference on Neural Networks, Washington, DC, July 1999. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">M.-H. Yang, N. Ahuja, D. Kriegman, Face Detection using a Mixture of Factor Analyzers, International Conference on Image Processing, Kobe, Japan, Oct. 1999, III-612-616. <a href="../../mhyang/papers/icip99-abstract.html">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad and N. Ahuja, A Fast Scheme for Downsampling and Unsampling in the DCT Domain, International Conference on Image Processing, Kobe, Japan, Oct. 1999, II-909-913. <a href="../../abstracts/pub6_4_1_a1099dugad.htm">Abstract </a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="color: #000000;">R. Dugad and N. Ahuja, Video Denoising by Combining Kalman and Wiener Estimates, International Conference on Image Processing, Kobe, Japan, Oct. 1999, IV-152-156. <a href="../../abstracts/pub6_5_1_a1099dugad.htm">Abstract</a></span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Singh, P. Ishwar, K. Ratakonda and N. Ahuja, Segmentation Based Denoising Using Multiple Compaction Domains, International Conference on Image Processing, Kobe, Japan, Oct. 1999, I-372-375. <a href="../../abstracts/pub6_1_3_a1099singhmk.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">D. Roth, M.-H. Yang and N. Ahuja, A SnoW-Based Face Detector, Proc. Advances in Neural Information Procesing Systems (NIPS’99), Denver, CO, Dec. 1999, 862-868.</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M.-H. Yang, N. Ahuja and D. Roth, View-Based 3D Object Recognition Using SnoW, Proc. of the Fourth Asian Conference on Computer Vision, Vol. 2, Taipei, Jan. 2000, 830-835.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.41.8348">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M.-H. Yang, N. Ahuja and D. Kriegman, Face Detection Using Mixtures of Linear Subspaces, Fourth IEEE Int. Conference on Automatic Face and Gesture Recognition (FG 2000), Grenoble, France, March 2000, 70-76. <a href="http://vision.ai.uiuc.edu/mhyang/papers/fg2000-paper2.pdf"> Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M.-H. Yang and N. Ahuja, A Geometric Approach to Train Support Vector Machines, IEEE Int. Conference on Computer Vision and Pattern Recognition, Vol. I, Hilton Head, SC, June 2000, 430-437. <a href="../../abstracts/5_2_2_600yang.html">Abstract and full-text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">D. Roth, N.-H. Yang and N. Ahuja, Learning to Recognize Objects, IEEE Int. Conference on Computer Vision and Pattern Recognition, Vol. I, Hilton Head, SC, June 2000, 724-731.  <a href="../../abstracts/5_1_2_606roth.html">Abstract and full-text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">J. Ma and N. Ahuja, Region Correspondence by Global Configuration Matching and Progressive Delauney Triangulation, IEEE Int. Conference on Computer Vision and Pattern Recognition, Vol. II, Hilton Head, SC, June 2000, 637-642.</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">R. Dugad and N. Ahuja, A Scheme for Joint Watermarking and Compression of Video, IEEE Int. Conference on Image Processing, Vol. 2, Vancouver, BC, Canada, Sept. 2000, 80-84. <a href="../../abstracts/pub6_4_2_a0900dugad.htm">Abstract </a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M.-H. Yang, N. Ahuja and D. Kriegman, Face Recognition Using Kernel Eigenfaces, IEEE Int. Conference on Image Processing, Vol. 1, Vancouver, BC, Canada, Sept. 2000, 37-41. <a href="../../abstracts/5_2_2_900yang.html">Abstract and full-text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">J. Ma, N. Ahuja, C. Neti and A. Senior, Recovering Frontal-Pose Image from a Single Profile Image, IEEE Int. Conference on Image Processing, Vol. 2, Vancouver, BC, Canada, Sept. 2000, 243-247. <a href="../../abstracts/5_2_2_900ma.html">Abstract and full-text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, Camera Center Estimation, Int. conference on Pattern Recognition, Vol. 1, Barcelona, Spain, Sept. 2000, 876-880.  <a href="http://vision.ai.uiuc.edu/~manoj/aggarwal_center.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, Estimating Sensor Orientation in Cameras, Int. Conference on Pattern Recognition, Vol. 1, Barcelona, Spain, Sept. 2000, 896-899.  <a href="http://www2.computer.org/portal/web/csdl/doi/10.1109/ICPR.2000.905570">Abstract</a> <a href="http://vision.ai.uiuc.edu/publications/Aggarwal-Estimating Sensor Orientation.ICPR00.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, On Generating Seamless Mosaics with Large Depth of Field, Int. Conference on Pattern Recognition, Vol. 1, Barcelona, Spain, Sept. 2000, 588-591.  <a href="http://vision.ai.uiuc.edu/~manoj/aggarwal_focus.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">L. Lu and N. Ahuja, Identification and control of a Pneumatic Actuator for Use With the Hexapod Project, Proc. ICARCV 2000, Singapore. Dec. 2000, (CD ROM). <a href="../../abstracts/pub8_2_1_a1200lu.htm">Abstract</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, High Dynamic Range Panoramic Imaging, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 2-9.  <a href="http://vision.ai.uiuc.edu/manoj/dynamic_aggarwal.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, Split Aperture Imaging for High Dynamic Range, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 10-17.  <a href="http://vision.ai.uiuc.edu/manoj/multi_aggarwal.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal and N. Ahuja, A New Imaging Model, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 82-89. <a href="http://vision.ai.uiuc.edu/publications/Aggarwal-ANewImagingModel.ICCV01.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Aggarwal, H. Hua and N. Ahuja, On Cosine-Fourth and Vignetting Effects in Lenses, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 472-479.  <a href="http://vision.ai.uiuc.edu/publications/Aggarwal-OncosineFourthVignetting.ICCV01.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 12pt 0in;"><span style="color: #000000;">K.-H. Tan and N. Ahuja, Selecting Objects with Freehand Sketches, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 337-344. <a href="../../abstracts/pub5_3_1_a0701karhan1.htm">Abstract and Full Text</a> </span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">K.-H. Tan and N. Ahuja. A Representation for Image Structure and its Application in Object Selection with Freehand Sketches. In Proceedings IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2001. <a href="../../abstracts/pub5_3_1_a1201karhan1.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">S.-C. Yoon and N. Ahuja, Frame Interpolation using Transmitted Block-Based Motion Vectors, Proc. International Conference on Image Processing, Thessaloniki, Greece, October 2001, to appear. <a href="../../abstracts/pub6_3_1_a1001yoonfi.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">H. Hua, L.D. Brown, C. Gao, N. Ahuja, J.P. Rolland, F. Biocca, A Head-Mounted Projective Display and Its Applications in Interactive Augmented Environments, Conference Abstracts and Applications &#8211; Sketches and Applications, SIGGRAPH, Los Angeles, CA, August 2001.  <a href="http://vision.ai.uiuc.edu/publications/HongHua-SIGGRAPH01.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">H. Hua, C. Gao, L.D. Brown, N. Ahuja and J.P. Rolland, Using a Head-Mounted Projective Display in Interactive Augmented Environments, Proceedings of IEEE and ACM International Symposium on Augmented Reality, New York, NY, October 2001, 217-223. <a href="http://vision.ai.uiuc.edu/publications/HongHua-UsingHeadMountedProjectiveDisplay01.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M.-H. Yang, D. Roth and N. Ahuja, Face Detection Using Large Margin Classifiers, Proc. International Conference on Image Processing, Thessaloniki, Greece, October 2001, to appear.   <a href="../../abstracts/5_2_1_1001yang.html">Abstract and full-text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">A. Sehgal , A. Jagmohan , N. Ahuja, High Capacity Data Embedding in the Wavelet Domain, IEEE International Conference on Image Processing (ICIP), 2001, Thessaloniki, Greece, pages 979 &#8211; 982. <a href="../../abstracts/abstract_jagmohan_1001.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">A. Sehgal , A. Jagmohan , N. Ahuja, `Wireless Video Conferencing using Multiple Description Coding,&#8217; IEEE International Symposium on Circuits and Systems (ISCAS), 2001, Sydney, Austrailia, pages 303 &#8211; 306. <a href="../../abstracts/abstract_jagmohan_0901.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">H. Hua and N. Ahuja, A High Resolution Panoramic Camera, IEEE Conf. On Computer Vision and Pattern Recognition, December 11-13, 2001, Hawaii, I-960-967.  <a href="http://vision.ai.uiuc.edu/publications/HongHua-CVPR01.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu, Ravi Bansal and Narendra Ahuja. Object boundary segmentation using graph cuts based active contours. CVPR2001 Technical Sketches, pp. 87-90, Dec. 2001. <a href="../../abstracts/abstract_ningxu_1201.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu, Narendra Ahuja and Ravi Bansal. Automated lung nodule segmentation using dynamic programming and EM based classification. SPIE Medical Imaging 4684-70, pp. 666-676, San Diego, CA. February, 2002. <a href="../../abstracts/abstract_ningxu_0202.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">H. Hua, C. Gao, L. Brown, N. Ahuja and J. P. Rolland, A Testbed for Precise Registration, Natural Occlusion and Interaction in an Augmented Environment using a Head-Mounted Projective Display (HMPD), IEEE VR 2002 Proceedings, March 2002, Orlando, FL, 81-89.  <a href="http://vision.ai.uiuc.edu/publications/HongHua-IEEE.VR02.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">A. Jagmohan , A. Sehgal , N. Ahuja, `Isotropic Error-Diffusion Halftoning,&#8217; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2002, Orlando, Florida, pages 3329-3332. <a href="../../abstracts/abstract_jagmohan_0502.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">A. Sehgal , A. Jagmohan , N. Ahuja, `Scalable Predictive Coding and the Wyner-Ziv Problem,&#8217; IEEE International Conference on Communication Systems (ICCS), 2002, Singapore, pages 101-106. <a href="../../abstracts/abstract_jagmohan_11021.htm">Abstract and Full Text</a></span></p>
<p>A. Jagmohan, A. Sehgal, N. Ahuja, `Multiple Description Coding Using Coset Codes,&#8217; IEEE International Conference on Communication Systems (ICCS), 2002, Singapore, pages 732- 737. <a href="../../abstracts/abstract_jagmohan_11022.htm">Abstract and Full Text</a></p>
<p>A. Jagmohan, A. Sehgal, N. Ahuja, `Predictive Coding Using Coset Codes,&#8217; IEEE International Conference on Image Processing (ICIP), 2002, Rochester, NY, vol. 2, pages 29- 32. <a href="../../abstracts/abstract_jagmohan_0902.htm">Abstract and Full Text</a></p>
<p>K.-H. Tan, H. Hua and N. Ahuja, Multiview Mirror Pyramid-based Panoramic Cameras, Proceedings of the IEEE Workshop on Omnidirectional Vision (Omnivis), June 2002, Copenhagen, Denmark, 87-93.  <a href="http://vision.ai.uiuc.edu/publications/Tan-Omnivis02.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu and Narendra Ahuja. Iterative 3D surface modeling from a sparse set of matched points. In Proceedings of IEEE International Conference on Multimedia and Expo, vol. 1, pp. 893 –896, Lausanne, Switzerland. August 2002. <a href="../../abstracts/abstract_ningxu_0802.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu and Narendra Ahuja. Object contour tracking using graph cuts based active contours. In Proceedings of IEEE International Conference on Image Processing, vol. 3, pp. 277-280, Rochester, NY. September 2002. <a href="../../abstracts/abstract_ningxu_0902.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">H. Hua, C. Gao, and N. Ahuja, Calibration of a head-mounted projective display for augmented reality systems, in Proceedings of IEEE International Symposium on Mixed and Augmented Reality 2002, Darmstadt, Germany, Sep. 30-Oct. 1st , 2002 <a href="http://vision.ai.uiuc.edu/abstracts/pub9_2_1a0902hua.html">Abstract</a> <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;arnumber=1115087&amp;isnumber=24594">Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">K.-H. Tan, D. Kriegman and N. Ahuja, Appearance-based Eye Gaze Estimation, Proceedings IEEE Workshop on Applications of Computer Vision, December 2002, Orlando, FL, 191-195.  <a href="http://vision.ai.uiuc.edu/publications/Tan-AppearanceBasedEyeGaze02.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Singh and N. Ahuja, Mean-Shift Segmentation with Wavelet-based Bandwidth Selection, IEEE Workshop on Applications in Computer Vision, pp. 43-50, Dec. 2002, Florida. <a href="http:///">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Chunyu Gao, Hong Hua, and N. Ahuja, Easy calibration of a head-mounted projective display for augmented reality systems, in Proceedings of IEEE VR 2003, p. 53-60, March 24-26, 2003, Los Angeles, CA. <a href="http://vision.ai.uiuc.edu/abstracts/pub9_2_1a0303Gao.html">Abstract</a> <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;arnumber=1191121&amp;isnumber=26695">Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">H. Hua, L. Brown, C. Gao and N. Ahuja, A New Collaborative Infrastructure: SCAPE, Proc. IEEE Virtual Reality 2003, Los Angeles, CA, March 2003, 171.  <a href="http://vision.ai.uiuc.edu/publications/HongHua-IEEE.VR03.pdf">Full Text</a></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu, Ravi Bansal and Narendra Ahuja. Object Segmentation Using Graph Cuts Based Active Contours. In Proceedings of IEEE International Conference on Computer Vision and  Pattern Recognition, vol. 2, pp. 46-53, Madison, WI. June 2003. .<a href="../../abstracts/abstract_ningxu_0603.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu and Narendra Ahuja. A Three-view Matching Algorithm Considering Foreshortening Effects. In Proceedings of International Conference on Computer Vision, Pattern Recognition and Image Processing, pp. 635-638, Cary, NC. September 2003. <a href="../../abstracts/abstract_ningxu_0903.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Hongcheng Wang and Narendra Ahuja. Facial Expression Decomposition. In Proceedings of International Conference on Computer Vision, vol 2, pp. 958-965, Nice, France. October 2003. <a href="../../abstracts/abstract_hongcheng_1003.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">M. Singh and N. Ahuja, “Regression based Bandwidth Selection for Segmentation using Parzen Windows”,  Ninth International Conference in Computer Vision, Proceedings, vol. 1, pp. 2-9,  Oct. 2003, Nice, France. <a href="http:///">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">A. Sehgal, N. Ahuja, `Robust Predictive Coding of Video Using Coset Codes,&#8217; IEEE Data Compression Conference (DCC), 2003, Snowbird, Utah, pages 103-112.<a href="http:///">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Jagmohan, A.; Sehgal, A.; Ahuja, N. WYZE-PMD based multiple description video codec, Multimedia and Expo, 2003. Proceedings. 2003 International Conference on , Volume: 1 , 6-9 July 2003 Page(s): 569 -572 <a href="../../abstracts/abstract_jagmohan_0703.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Jagmohan, A.; Ahuja, N. Wyner-Ziv encoded predictive multiple descriptions,  Data Compression Conference, 2003. Proceedings. DCC 2003 , March 25-27, 2003 Page(s): 213 -222 <a href="../../abstracts/abstract_jagmohan_0303.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu, Tianli Yu and Narendra Ahuja. Shape from color consistency using node cut. In Proceedings of 6th Asian Conference on Computer Vision, Jeju Island, Korea. January 2004. <a href="../../abstracts/abstract_ningxu_01041.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu and Narendra Ahuja. On the use of depth-from-focus in 3D object modeling from multiple views. In Proceedings of 6th Asian Conference on Computer Vision, Jeju Island, Korea. January 2004. <a href="../../abstracts/abstract_ningxu_01043.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;"><br />
</span></p>
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;">
<p class="MsoBodyText" style="margin: 0in 7.5pt 0.0001pt 0in;"><span style="color: #000000;">Ning Xu and Narendra Ahuja. Interactive Object Selection Using S-t Minimum Cut. In Proceedings of 6th Asian Conference on Computer Vision, Jeju Island, Korea. January 2004. <a href="../../abstracts/abstract_ningxu_01042.htm">Abstract and Full Text</a></span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, N. Xu and N. Ahuja, Shape and View Independent Reflectance Map from Multiple Views, Proc. European Conference on Computer Vision,  Prague, May 2004.  <a href="http://vision.ai.uiuc.edu/newpubs/ECCV2004Tianli.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Singh, H. Arora and N. Ahuja, Robust Probabilistic Estimation for Parametric Image Models, Proc. European Conference on Computer Vision,  Prague, May 2004, to appear.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.889">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">M. Singh, H. Arora and N. Ahuja, Robust Registration and Tracking Using Kernel Density Correlation, 2nd IEEE Conference on Image and Video Registration, Washington, DC. June 2004, to appear.</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C.-B. Liu and N. Ahuja, A Model for Dynamic Shape and Its Applications, IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, June 2004, Vol. 2, 129-134.  <a href="http://stereo.ai.uiuc.edu/newpubs/cvpr04_dyn.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, N. Xu and N. Ahuja, Recovering Shape and Reflectance Model of Non-Lambertian Objects from Multiple Views, IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, June 2004, 226-233.  <a href="http://vision.ai.uiuc.edu/publications/CVPR2004Tianli.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Jagmohan, M. Singh and N. Ahuja, Dense Two View Stereo Matching Using Kernel Maximum Likelihood Estimation, IEEE International Conference on Pattern Recognition, Cambridge, UK, August, 2004.  <a href="http://portal.acm.org/citation.cfm?id=1020905">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C.-B. Liu and N. Ahuja, Vision Based Fire Detection, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, Vol. 4, 134-137.  <a href="http://stereo.ai.uiuc.edu/newpubs/icpr04_fire.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Xu, K. Tan, H. Arora and N. Ahuja, Generating Omnifocus Images Using Graph Cuts and a New focus Measure, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, Vol. 4, 697-700.  <a href="http://portal.acm.org/citation.cfm?id=1021317">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, N. Xu and N. Ahuja, Reconstructing a Dynamic Surface from Video Sequences Using Graph Cuts in 4D Space-Time, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, 245-248.  <a href="http://portal.acm.org/citation.cfm?id=1020696">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Briassouli and N. Ahuja, Fusion of Frequency and Spatial Domain Information for Motion Analysis, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, 175-178.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.2.6586">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Gao and N. Ahuja, Single Camera Stereo Using Planar Parallel Plate, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, Vol. 4, 108-111. <a href="http://vision.ai.uiuc.edu/newpubs/Stereo_PPP_Gao.pdf"> Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Wang, R. Raskar and N. Ahuja, Seamless Video Editing. IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, Vol. 3, 858-861.  <a href="http://vision.ai.uiuc.edu/~wanghc/papers/icpr04_editing.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Wang and N. Ahuja, Compact Representation of Multidimensional Data Using Tensor Rank-One Decomposition, IEEE International Conference on Pattern Recognition, Cambridge, UK, August 2004, Vol. 1, 44-47.  <a href="http://vision.ai.uiuc.edu/~wanghc/papers/icpr04_tensor.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C.-B. Liu and N. Ahuja, Motion Based Retrieval of Dynamic Objects in Videos, ACM Multimedia Conference, October 2004, 288-291.  <a href="http://portal.acm.org/citation.cfm?id=1027527.1027593">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Wang, Q. Wu, L. Shi, Y. Yu and N. Ahuja, Out-of-Core Tensor Approximation of Multi-Dimensional Matrices of Visual Data, ACM SIGGRAPH, 2005.  <a href="http://portal.acm.org/citation.cfm?doid=1073204.1073224">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, J. Luo, A. Singhal and N. Ahuja, Shape Regularized Active Contour Based on Dynamic Programming for Anatomical Structure Segmentation, SPIE Medical Imaging 2005, San Diego, CA, February 2005.  <a href="http://vision.ai.uiuc.edu/newpubs/Tianli_spie05.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Wang and N. Ahuja, Rank-R Approximation of Tensors Using Image-as-Matrix Representation, IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, June 2005, to appear.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.60.7704">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Wang, N. Xu, R. Raskar and N. Ahuja, Videoshop: A New Framework for Video Editing in Spatio-Temporal Gradient domain, Video Proceedings, IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, June 2005, to appear.</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, J. Luo and N. Ahuja, Shape Regularized Active Contour using Iterative Global Search and Local Optimization, IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, June 2005, to appear.  <a href="http://vision.ai.uiuc.edu/publications/Tianli_cvpr05.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C. Gao and N. Ahuja, A Refractive Camera for Acquiring Stereo Depth and Super-Resolution Images, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, New York, NY Vol. 2, June 2006, 2316-2323. <a href="../../papers/RefractiveCamera.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Todorovic and N. Ahuja, Extracting subimages of an unknown category from a set of images,  in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2006), vol. 1, pp. 927-934, New York, NY, 2006. <a href="../../papers/TodorovicAhuja_CategoryModeling_CVPR06.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Todorovic and N. Ahuja, 3D texture classification using the belief net of a segmentation tree, in Proc. 18th Int. Conf. Pattern Recognition (ICPR 2006), Hong Kong, China, 2006. <a href="../../papers/TodorovicAhuja_3DTexture_ICPR06.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Alexia Briassouli, Narendra Ahuja, Spatial and Fourier Error Minimization for Motion Estimation and Segmentation, ICPR 2006, Hong Kong <a href="http://inf-server.inf.uth.gr/%7Ebriassou/publications.html">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Alexia Briassouli, Narendra Ahuja, Estimation of Multiple Periodic Motions from Video, ECCV 2006, Graz, Austria <a href="http://inf-server.inf.uth.gr/%7Ebriassou/publications.html">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Hongcheng Wang, Yunqiang Chen, Tong Fang, Jason Tyan and Narendra Ahuja, Gradient Adaptive Image Restoration and Enhancement, in Proc. IEEE International Conference on Image Processing (ICIP), Atlanta, VA, Aug. 2006 <a href="../../%7Ewanghc/papers/icip06_grad_filtering.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Eurographics Symposium on Rendering (EGSR), 2006 <a href="../../%7Ewanghc/papers/EGSR06_SceneFactor.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Tianli Yu, Hongcheng Wang, Narendra Ahuja, Wei-Chao Chen, Sparse Lumigraph Relight by Illumination and Reflectance Estimation from Multi-View Images, Technical Sketch, SIGGRAPH, 2006 <a href="../../%7Ewanghc/papers/siggraph06_sketch.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Himanshu Arora, Narendra Ahuja, Analysis of ramp discontinuity model for multiscale image segmentation. ICPR(1), 2006.  <a href="http://portal.acm.org/citation.cfm?id=1172834">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">T. Yu, N. Ahuja and W-C. Chen, SDG Cut: 3D Reconstruction of Non-Lambertian Objects Using Graph Cuts on Surface Distance Grid, Proceedings IEEE Conference on Computer Vision and Pattern Recognition, New York, NY, Vol. 2, June 2006, 2269-2276. <a href="../../tianliresearch.blogspot.com/2006/03/sdg-cut-3d-reconstruction-of-non.html">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">C-B. Liu, R-S. Lin, M-H. Yang, N. Ahuja and S. Levinson, Object Tracking Using Globally Coordinated Nonlinear Manifolds, International Conference on Pattern Recognition, Vol. 1, Hong Kong, August 2006, 844-847. <a href="../../vision.ai.uiuc.edu/%7Ecbliu/publications/icpr06_tracking.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">S.Yi and Narendra Ahuja, An Omnidirectional Stereo Vision System Using a Single Camera, International Conference on Pattern Recognition, Vol. 4, Hong Kong, August 2006, 861-847. <a href="../../newpubs/sooyeong_camera.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Yi and N. Ahuja, A Novel Omnidirectional Stereo Vision System with a Single Camera, International Conference on Image Analysis and Recognition, Portugal, September 2006.  <a href="http://www.springerlink.com/content/p557x5p437p52k70/">Abstract</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">A. Kumar, N. Ahuja, J. Hart, U.K. Visesh, P.J. Narayanan, C.V. Jawahar, A Vision System for Monitoring Intermodal Freight Trains, Workshop on Applications of Computer Vision (WACV), Austin, TX, February 2007. <a href="../../publications/wacv2007_avinash_jawar_hart_ahuja_railroad.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">B. Freid, C. Barkan, N. Ahuja, J. Hart, S. Todorvic, N. Kocher, Multispectral Machine Vision for Improved undercarriage Inspection of Railroad Rolling Stock, Proc. International Heavy Haul Conference, Specialist Technical Session, Kiruna, Sweden, June 2007, 737-744. <a href="../../publications/ihhc_fried_barkan_ahuja_hart_todorvic_kocher_rail_road.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Arora, N. Loeff, D. Forsyth and N. Ahuja, Unsupervised Segmentation of Objects using Efficient Learning, Proc. IEEE Conference on Computer Vision and Pattern Recognition,  Minneapolis, MN, June 2007, 1-7 <a href="../../publications/cvpr2007_harora_ahuja_daf_leoff_unsupervised_segmentation_using_efficient_learning.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">H. Arora and N. Ahuja, Modeling Objects using Distribution and Topology of Multiscale Region Pairs, IEEE Conference on Computer Vision and Pattern Recognition, Beyond Patches Workshop, Minneapolis, MN, June 2007, 1-8. <a href="../../publications/cvpr2007_harora_ahuja_modelingobjectsusingregionpairs.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Bernard Ghanem and Narendra Ahuja, Phase PCA for Dynamic Texture Video Compression, Proc.of the IEEE International Conference on Image Processing 2007, Sept. 16-19, San Antonio, TX <a href="../../publications/icip2007_bghanem_ahuja_dynamictextures.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. Todorovic, Learning the Taxonomy and Models of Categories Present in Arbitrary Images,  Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_sintod_ahuja_categories.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. Todorovic, Extracting Texels in 2.1D Natural Textures,  Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_sintod_ahuja_texels.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Bernard Ghanem and Narendra Ahuja, Phase Based Modelling of Dynamic Textures, Proc. of the IEEE International Conference on Computer Vision 2007, Oct. 14-20, Rio De Janeiro, Brazil <a href="../../publications/iccv2007_bghanem_ahuja_dynamictexture.pdf">Full Text</a></span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Varsha Hedau, H. Arora and N. Ahuja, &#8220;Matching Images under Unstable Segmentation&#8221;,  in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../publications/cvpr2008_hedau_arora_ahuja_image_matching.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">N. Ahuja and S. Todorovic, &#8220;Connected segmentation tree &#8211; a joint representation of region layout and hierarchy,&#8221; in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../%7Esintod/research/publications/AhujaTodor.CST.CVPR08.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">S. Todorovic and N. Ahuja, Learning subcategory relevances to category recognition,&#8221; in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../%7Esintod/research/publications/TodorAhuja.Weights.CVPR08.pdf">Full Text</a><br />
</span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;"><span style="color: #000000;">Bernard Ghanem and N. Ahuja, &#8220;Extracting a Fluid Dynamic Texture and the Background from Video&#8221;, in Proc. IEEE Comp. Soc. Conf. Computer Vision and Pattern Recognition (CVPR 2008), Anchorage, AL, 2008. <a href="../../publications/cvpr2008_ghanem_ahuja_dynamic_texture.pdf">Full Text</a></span></p>
<p class="MsoBodyText" style="margin-right: 7.5pt;">Q. Yang, K-H. Tan and N. Ahuja, Real-time 0(1) Bilateral Filtering, IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2009), Miami, FL, June 2009.  <a href="http://vision.ai.uiuc.edu/publications/QingxiongYang-CVPR09.pdf">Full Text</a></p>
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<p><!--Session data--></p>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=385</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Journal Papers</title>
		<link>http://vision.ai.uiuc.edu/?p=382</link>
		<comments>http://vision.ai.uiuc.edu/?p=382#comments</comments>
		<pubDate>Thu, 07 May 2009 06:52:10 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Journal]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=382</guid>
		<description><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-382"></span>N. Ahuja, L. S. Davis, D. L. Milgram and A. Rosenfeld, Piecewise Approximation of Pictures Using Maximal Neighborhoods, IEEE Trans. Comput., C-27, April l978, 375-379. <a href="../../abstracts/pub3_4_1_a0478ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
</p><p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and A. Rosenfeld, A Note on the Use of Second-Order Gray-Level Statistics for&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-382"></span>N. Ahuja, L. S. Davis, D. L. Milgram and A. Rosenfeld, Piecewise Approximation of Pictures Using Maximal Neighborhoods, IEEE Trans. Comput., C-27, April l978, 375-379. <a href="../../abstracts/pub3_4_1_a0478ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and A. Rosenfeld, A Note on the Use of Second-Order Gray-Level Statistics for Threshold Selection, IEEE Trans. Syst., Man, Cyber., SMC-8, December 1978, 895-898. <a href="../../abstracts/pub3_4_1_a1278ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">B. Schachter and N. Ahuja, Random Pattern Generation Processes, Computer Graphics and Image Processing, 10, June 1979, 95-114. <a href="../../abstracts/pub3_3_1_a0679schachter.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">B. Schachter and N. Ahuja, A History of Visual Flight Simulation, Computer Graphics World, 3, May 1980, 16-31.  <a href="../../abstracts/pub3_3_1_a10580schachter.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, R. M. Haralick and A. Rosenfeld, Neighbor Gray Levels as Features in Pixel Classification, Pattern Recognition, 12, August 1980, 251-260. <a href="../../abstracts/pub3_4_1_a0880ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, T. Dubitzki and A. Rosenfeld, Some Experiments with Mosaic Models for Images, IEEE Trans. Syst., Man, Cyber., 10, November 1980, 744-749. <a href="../../abstracts/pub3_3_1_a1180ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and A. Rosenfeld, Mosaic Models for Textures, IEEE Trans. Pattern Analysis and Machine Intelligence, January 1981, 1-11. <a href="../../abstracts/pub3_3_1_a0181ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Mosaic Models for Images, 1: Geometric Properties of Components in Cell Structure Mosaics, Information Sciences, 23, March 1981, 69-104. <a href="../../abstracts/pub3_3_1_a0381ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Mosaic Models for Images, 2: Geometric Properties of Components in Coverage Mosaics, Information Sciences, 23, April 1981, 159-200. <a href="../../abstracts/pub3_3_1_a0481ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">R. Chellappa, R. L. Kashyap and N. Ahuja, Decision Rules for Choice of Neighbors in Random Field Models of Images, Computer Graphics and Image Processing, l5, April 1981, 301-318. <a href="../../abstracts/pub3_4_1_a0481chellappa.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Mosaic Models for Images, 3: Spatial Correlation in Mosaics, Information Sciences, 24, June 1981, 43-70. <a href="../../abstracts/pub3_3_1_a0681ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and B. Schachter, Image Models, ACM Computing Surveys, December 1981, 373-398. <a href="../../abstracts/pub3_3_1_a1281ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Dot Pattern Processing Using Voronoi Neighborhoods, IEEE Trans. Pattern Analysis and Machine Intelligence, May 1982, 336-343.<a href="../../abstracts/pub3_2_1_a0582ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, On Approaches to Polygonal Decomposition for Image Representation, Computer Vision, Graphics and Image Processing, November 1983, 200-214. <a href="../../abstracts/pub4_1_1_a1183ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and C. Nash, Octree Representations of Moving Objects Computer Vision, Graphics and Image Processing, May 1984, 207-216.<a href="../../abstracts/pub4_2_2_a0584ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and S. Swamy, Multiprocessor Pyramid Architectures for Bottom-Up Analysis, IEEE Trans. Pattern Analysis and Machine Intelligence, July 1984, 463-475. <a href="../../abstracts/pub7_2_1_a0784Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">H. C. Chen, N. Ahuja and T. S. Huang, Septree Representations of Moving Objects Using Hexagonal Cylinderical Decomposition, Journal of Optical Engineering, September 1984, 531-535. <a href="../../abstracts/pub4_1_1_a0984chen.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, B. An and B. Schachter, Image Representation Using Voronoi Tessellation, Computer Vision, Graphics and Image Processing, March 1985, 286-295. <a href="../../abstracts/pub4_1_1_a0385ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and S. Swamy, Multiprocessor Pyramid Architectures for Bottom-up Image Analysis, Test and Measurement World, November 1985, 66-76. <a href="../../abstracts/pub7_2_1_a1185Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Efficient Planar Embedding of Trees for VLSI Layouts, Computer Vision, Graphics and Image Processing, May 1986, 189-203.<a href="../../abstracts/pub7_2_1_a0586Ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, T. Huang and N. Ahuja, 3-D Motion Estimation, Understanding and Prediction from Noisy Image Sequences, IEEE Trans. Pattern Analysis and Machine Intelligence, May 1987, 370-389.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng and N. Ahuja, Octrees of Objects in Arbitrary Motion: Representation and Efficiency, Computer Vision, Graphics and Image Processing, August 1987, 167-185. <a href="../../abstracts/pub4_2_2_a0887weng.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Veenstra and N. Ahuja, Line Drawings of Octree-Represented Objects, ACM Transactions on Graphics, January 1988, 61-75. <a href="../../abstracts/pub4_2_1_a0188veenstra.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Zhuang, T. S. Huang, N. Ahuja and R. M. Haralick, A Simplified Optic Flow-Motion Algorithm, Computer Vision, Graphics and Image Processing, June 1988, 334-344.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">D. Blostein and N. Ahuja, A Multiscale Region Detector, Computer Vision, Graphics and Image Processing, January 1989, 22-41.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and J. Veenstra, Generating Octrees from Object Silhouettes in Orthographic Views, IEEE Trans. Pattern Analysis and Machine Intelligence, February 1989, 137-149. <a href="../../abstracts/pub4_2_1_a0289ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">W. Hoff and N. Ahuja, Surfaces from Stereo: Integrating Feature Matching, Disparity Estimation and Contour Detection, IEEE Trans. Pattern Analysis and Machine Intelligence, February 1989, 121-136. <a href="../../abstracts/pub1_3a_1_a0289ha.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, T. Huang and N. Ahuja, Motion and Structure from Two Perspective Views: Algorithm, Error Analysis, and Error Estimation IEEE Trans. Pattern Analysis and Machine Intelligence, May 1989, 451-476.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">D. Blostein and N. Ahuja, Shape from Texture: Integrating Texture Element Extraction and Surface Estimation, IEEE Trans. Pattern Analysis and Machine Intelligence, December 1989, 1233-1251.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and M. Tuceryan , Extraction of Basic Perceptual Structure in Dot Patterns: Integrating Region, Boundary and Component Gestalt, Computer Vision, Graphics and Image Processing, December 1989, 304-356. <a href="../../abstracts/pub3_2_1_a1289ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Srivastava and N. Ahuja, Octree Generation from Object Silhouettes in Perspective Views, Computer Vision, Graphics and Image Processing, January 1990, 68-84. <a href="../../abstracts/pub4_2_1_a0190srivastava.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Motion Estimation Under Orthographic Projection, IEEE Trans. on Robotics and Automation, December 1991, 848-853.<a href="../../abstracts/pub1_2_1_a1291ha.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, N. Ahuja and T. S. Huang, Motion and Structure from Point Correspondences: Planar Surfaces, IEEE Trans. Signal Processing, vol. 39, No. 12, December 1991, 2691-2717. <a href="../../abstracts/pub1_2_1_a1188wah.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Y. Hwang and N. Ahuja, A Potential Field Approach to Path Planning, IEEE Transactions on Robotics and Automation, Vol. 8, February 1992, 23-32. <a href="../../abstracts/pub8_1_2_a0292hwang.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">O. Faugeras, J. Mundy, N. Ahuja, C. Dyer, A. Pentland, R. Jain, K. Ikeuchi and K. Bowyer, Why Aspect Graphs Are Not (Yet) Practical for Computer Vision, Computer Vision, Graphics and Image Processing: Image Understanding, March 1992, 212-218. <a href="../../abstracts/pub4_1_1_a0392faugeras.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, T. S. Huang and N. Ahuja, Motion and Structure from Line Correspondences: Closed-form Solution, Uniqueness and Optimization, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 14, March 1992, 318-336. <a href="../../abstracts/pub1_2_1_a0392wha.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, N. Ahuja and T. S. Huang, Matching Two Perspective Views, IEEE Trans. Pattern Analysis and Machine Intelligence, August 1992, 806-825. <a href="../../abstracts/pub1_3_2_a0892wah.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Y. Hwang and N. Ahuja, Gross Motion Plannig &#8211; A Survey, ACM Computing Surveys, Vol 24, No. 3, 1992, 219-292. <a href="../../abstracts/pub8_1_2_a0992hwang.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Das and N. Ahuja, Integrating Coarse-to-fine Image Acquisition and Estimation from Multiple Cues, Sadhana, Vol. 18, Part 2, June 1993, 223-238.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Sufficient Conditions for Double or Unique Solution of Motion and Structure, Computer Vision, Graphics and Image Processing: Image Understanding, September 1993, 161-176.  <a href="../../abstracts/pub1_2_1_a0491huahuja.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, N. Ahuja and T. S. Huang, Optimal Motion and Structure Estimation from Two Views, IEEE Trans. Pattern Analysis and Machine Intelligence, September 1993, 864-884. <a href="../../abstracts/pub1_3_2_a0993wah.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Motion and Structure Estimation using Long Sequence Motion Models, Journal of Image and Vision Computing, 11, November 1993, 549-569.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and A. L. Abbott, Active Stereo: Integrating Disparity, Vergence, Focus, Aperture and Calibration for Surface Estimation, IEEE Trans. Pattern Analysis and Machine Intelligence, October 1993, 1007-1029. <a href="../../abstracts/pub1_5_1_a1093ahujabbot.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">A. N. Choudhary, J. H. Patel, and N. Ahuja, NETRA: A Hierarchical and Partitionable Architecture for Computer Vision Systems, IEEE Trans. Parallel and Distributed Systems, Vol. 4, No. 10, October 1993, 1092-1104. <a href="../../abstracts/pub7_2_1_a1093Choudhary.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Sull and N. Ahuja, Integrated 3D Analysis and Analysis Guided Synthesis of Flight Image Sequences, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 4, April 1994, 357-372.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Matching Point Features with Ordered Geometric, Rigidity and Disparity Constraints, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 10, October 1994, 1041-1049. <a href="../../abstracts/pub1_3_1_a1094ha.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Mirror Uncertainty and Uniqueness Conditions for Determining Shape and Motion from Orthographic Projection, Int. Journal of Computer Vision, 13, 3, November 1994, 295-309.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Feature Extraction and Matching as Signal Detection, Int. Journal of Pattern Recognition and Artificial Intelligence, 8, 6, December 1994, 1343-1379.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">X. Hu and N. Ahuja, Necessary and Sufficient Conditions for a Unique Solution of Plane Motion and Structure, IEEE Transactions on Robotics and Automation, Vol. 11, No. 2, April 1995, 304-308.  <a href="../../abstracts/pub1_2_1_a0495huahuja.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, On Detection and Representation of Multiscale Low-Level Image Structure, ACM Computing Surveys, Vol. 27, No. 3, September 1995, 304-306.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Sull and N. Ahuja, Integrated Matching and Segmentation of Multiple Features in Two Views, Computer Vision, Graphics and Image Processing: Image Understanding, Vol. 62, No. 3, November 1995, 279-297.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Das and N. Ahuja, Performance Analysis of Stereo, Vergence and Focus as Depth Cues for Active Vision, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 17, No. 12, December 1995, 1213-1219. <a href="../../abstracts/pub1_5_1_a1295dasahuja.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">T. Srikanth and N. Ahuja, Parallel Distributed Detection of Feature Trajectories in Multiple Discontinuous Motion Image Sequences, IEEE Trans. on Neural Networks, Vol. 7, No. 3, May 1996, 594-603.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Das and N. Ahuja, Active Surface Estimation: Integrating Coarse-to-fine Image Acquisition and Estimation from Multiple Cues, Artificial Intelligence, Vol. 83, May 1996, 241-266. <a href="../../abstracts/pub1_5_1_a0596das.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">A. Krishnan and N. Ahuja, Range Estimation from Focus Using a Nonfrontal Imaging Camera, Int. Journal of Computer Vision, Vol. 20, No. 3, 1996, 169-185. <a href="http://www.springerlink.com/content/h257633q15jr4765/fulltext.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Y. Kitamura, H. Takemura, N. Ahuja and F. Kishino, Colliding Face Detection among 3-D Objects using Octree and Polyhedral Shape Representation, Journal of the Robotics Society of Japan, Vol. 14, No. 5, 1996, 733-742. <a href="../../abstracts/pub8_1_1_a0096kitamura.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, A Transform for Multiscale Image Segmentation by Integrated Edge and Region Detection, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 18, No. 12, December 1996, 1211-1235.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and Jen-Hui Chuang, Shape Representation Using a Generalized Potential Field Model, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 19, No. 2, February 1997, 169-176. <a href="../../abstracts/pub4_1_1_a0297ahuja.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Tabb and N. Ahuja, Unsupervised Multiscale Image Segmentation by Integrated Edge and Region Detection, IEEE Transactions on Image Processing, Vol. 6, No. 5, May 1997, 642-655.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, Y. Cui and N. Ahuja, Transitory Image Sequences, Asymptotic Properties, and Estimation of Motion and Structure, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 19, No. 5, May 1997, 451-464. <a href="../../abstracts/pub1_2_1_a0597wca.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, N. Ahuja and T. S. Huang, Learning Recognition and Segmentation Using the Cresceptron, International Journal of Computer Vision, 25(2), 1997, 109-143.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">C. Debrunner and N. Ahuja, Segmentation and Factorization-Based Motion and Structure Estimation for Long Image Sequences, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 20, No. 2, February 1998, 206-211.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">P. Bajcsy and N. Ahuja, Location and Density-Based Hierarchical Clustering, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 20, No. 9, September 1998, 1011-1015. <a href="../../abstracts/pub3_2_1_a0998bajcsy.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J.-H. Chuang and N. Ahuja, An Analytically Tractable Potential Field Model of Free Space and Its Application in Obstacle Avoidance, IEEE Trans. on Systems, Man, and Cybernetics, Vol. 28, No. 5, October 1998, 729-736. <a href="../../abstracts/pub8_1_1_a1098chuang.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Srinivasa and N. Ahuja, A Topological and Temporal Correlator Network for Spatiotemporal Pattern Learning, Recognition and Recall, IEEE Transactions on Neural Networks, Vol. 10, No. 2, March 1999, 356-371. <a href="../../abstracts/5_1_1_399srinivasa.html">Abstract and full-text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">T. Joshi, N. Ahuja and J. Ponce, Structure and Motion Estimation from Dynamic Silhouettes under Perspective Projection, International Journal of Computer Vision, 31(1), 1999, 31-50.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. C. Yoon, K. Ratakonda and N. Ahuja, Low Bit-Rate Video Coding with Implicit Multiscale Image Segmentation, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 11, No. 4, April 2001, 461-474. <a href="../../abstracts/pub6_2_1_a1099yoonvc.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">R. Dugad and N. Ahuja, A Fast Scheme for Image Size Change in the Compressed Domain, IEEE Trans. on Circuits and Sstems for Video Technology, Vol. 11, No. 4, April 2001, 461-474. <a href="http://vision.ai.uiuc.edu/abstracts/pub6_4_1_a0401dugad.htm">Abstract</a> <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=00915353">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Z.-P. Liang, H. Pan, R. L. Magin, N. Ahuja and T. S. Huang, Automated Image Registration by Maximization of a Region Similarity Metric, International Journal of Imaging Systems and Technology, special issue on MR Signal Processing, to appear.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.375">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M.-H. Yang, D. Kriegman and N. Ahuja, Face Detection Using Multimodal and Density Modes, Computer Vision and Image Understanding, Vol. 84, October 2001, 1-21.  <a href="http://vision.ucsd.edu/kriegman-grp/papers/cviu01.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M.-H. Yang, D. Kriegman and N. Ahuja, Detecting Faces in Images: A Survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, January 2002, 34-58.  <a href="http://vision.ai.uiuc.edu/mhyang/papers/pami02a.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Aggarwal and N. Ahuja, A Pupil-Centric Model of Image Formation, International Journal of Computer Vision, 48(3), 2002, 195-214.  <a href="http://www.springerlink.com/content/jl2c8uvntegtqba3/fulltext.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M.-H. Yang, N. Ahuja and M. Tabb, Extraction of Motion Trajectories and Its Application to Hand Gesture Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol., 24, No. 8, 2002, 1061-1074.  <a href="http://portal.acm.org/citation.cfm?id=605095">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">D. Roth, M.-H. Yang and N. Ahuja, Learning to Recognize Objects, Neural Computation, Vol. 14, No. 5, 2002, 1071-1104.  <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.1689">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">R. Dugad and N. Ahuja, A Scheme for Spatial Scalability Using Nonscalable Encoders, IEEE Transactions CSVT,  Vol. 13, No. 10, October 2003, 993-999.  <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&amp;arnumber=1234139">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">K.-H. Tan, H. Hua and N. Ahuja, Multiview Panoramic Cameras Using Mirror Pyramids, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 26, No. 6, June 2004.  <a href="http://vision.ai.uiuc.edu/~tankh/Camera/multiview_pam.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Aggarwal and N. Ahuja, Split Aperture Imaging for High Dynamic Range, International Journal on Computer Vision, Vol. 58, No. 1, June 2004, 195-214.  <a href="http://vision.ai.uiuc.edu/manoj/multi_aggarwal.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">A. Sehgal, A. Jagmohan and N. Ahuja, Robust  Wyner-Ziv Coding of Video, IEEE Transactions on Multimedia, Vol. 6, No. 2, April 2004, 249-258.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Tabb and N. Ahuja, Two-Dimensional Motion Estimation by Matching a Multiscale Set of Region Primitives, IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J.-H. Chuang, N. Ahuja, C.-C. Lin, C.-H. Tsai and C.-H. Chen, A Potential-Based Generalized Cylinder Representation, Computers and Graphics, Vol. 28, 2004, 907-918.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Aggarwal and N. Ahuja, Split Aperture Imaging for High Dynamic Range, International Journal on Computer Vision, Vol. 58, No. 1, June 2004, 7-17. <a href="http://www.springerlink.com/app/home/contribution.asp?wasp=bf93ec8571a8499dbc209ad557f7bff0&amp;referrer=parent&amp;backto=issue,2,6;journal,17,115;linkingpublicationresults,1:100272,1">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">Hongcheng Wang, Ning Xu, Ramesh Raskar and Narendra Ahuja, Videoshop: A New Framework for Video Editing in Gradient Domain, Graphical Models (GM), Volume 69, Issue 1, Jan. 2007, pp 57-70 <a href="../../publications/gm2007_wang_xu_raskar_ahuja_videoshop.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Yi, B. Choi and N. Ahuja, Real-time Omni-directional Distance Measurement with Active Panoramic Vision, International Journal of Control, Automation, and Systems, Vol. 5, No. 2, April 2007, 184-191.<a href="../../publications/ijcas2007_yi_choi_ahuja_omni_cam.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Hong Hua, C. Gao, and N. Ahuja, Calibration of an augmented reality system using head-mounted projective displays, IEEE Transactions on Systems, Man, Cybernetics (Part A: Systems), 37(3), 416-30, May 2007. <a href="../../publications/ismar2007_hua_gao_ahuja_headmountedprojective.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">T. Yu, N. Xu and N. Ahuja, Shape and View Independent Reflectance Map from Multiple Views, International Journal of Computer Vision, Vol. 73, No. 2, June 2007, 123-138. <a href="../../publications/ijcv2007_yu_xu_ahuja_shape_view_reflectance_map.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">A. Briassouli and N. Ahuja, Extraction and Analysis of Multiple Periodic Motions in Video Sequences, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, No. 7, July 2007, 1244-1261. <a href="../../publications/pami2007_briassouli_ahuja_motion_analysis.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">H. Hua, N. Ahuja and C. Gao, Design Analysis of a High-Resolution Panoramic Camera Using Conventional Imagers and a Mirror Pyramid, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, No. 2, 2007, 356-361 <a href="../../publications/pami2007_hua_ahuja_gao_panoramic_camera.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Xu, N. Ahuja, R. Bansal, Object segmentation using graph cut based active contours, Computer Vision and Image Understanding,Volume 107, Issue 3, September 2007, Pages 210-224 <a href="../../publications/cviu2007_xu_bansal_ahuja.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">Lai, Y-C., C.P.L. Barkan, J. Drapa,  N. Ahuja, J.M. Hart, P.J. Narayanan, C.V. Jawahar, A. Kumar, L. Milhon and M.P. Stehly 2007.  Machine-vision analysis of the energy efficiency of intermodal freight trains. Journal of Rail and Rapid Transit 221: 353-364. <a href="../../publications/jrrt2007_lai_et_al.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Todorovic and N. Ahuja, Region Based Hierarchical Image Matching, Int. Journal of Computer Vision, to appear. <a href="../../publications/ijcv2007_sintod_ahuja_regionbasedmatching.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">A. Briassouli and N. Ahuja, Integration of Frequency and Space for Multiple Motion Estimation and Shape-Independent Object Segmentation, IEEE Transactions on Circuits, Systems and Video Technology, to appear</p>
<p class="MsoNormal" style="margin-right: 7.5pt; color: #99ffff;">H. Wang and N. Ahuja, A Tensor Approximation Approach to Dimensionality Reduction, International Journal of Computer Vision, 76:3, March 2008, 217-229. <a style="color: #ccffff;" href="../../publications/ijcv2008_wang_ahuja_tensor_dimensionality.pdf">Full Text</a><br style="color: #ccffff;" /></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Todorovic and N. Ahuja, Unsupervised Category Modeling, Recognition, and Segmentation in Images, IEEE Trans. Pattern Analysis and Machine Intelligence, 2008, to appear. <a href="../../%7Esintod/research/publications/CategoryExtractionPAMI07.pdf">Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt; color: #ccffff;">
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=382</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Book Chapters</title>
		<link>http://vision.ai.uiuc.edu/?p=380</link>
		<comments>http://vision.ai.uiuc.edu/?p=380#comments</comments>
		<pubDate>Thu, 07 May 2009 06:50:40 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Book Chapters]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=380</guid>
		<description><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-380"></span>N. Ahuja, Mosaic Models for Textures. A. Rosenfeld (ed.), Image Modeling, Academic Press, 1981, 1-8. (Also appears in Digital Image Processing and Analysis: Vol. 1, R. Chellappa and A. Sawchuk (Eds.), IEEE Computer Society Press, Washington D. C., 1986.)</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
</p><p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-380"></span>N. Ahuja, Mosaic Models for Textures. A. Rosenfeld (ed.), Image Modeling, Academic Press, 1981, 1-8. (Also appears in Digital Image Processing and Analysis: Vol. 1, R. Chellappa and A. Sawchuk (Eds.), IEEE Computer Society Press, Washington D. C., 1986.)</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and A. Rosenfeld, Image Models. P. R. Krishnaiah and L. N. Kanal (eds.), Handbook of Statistics, Volume 2, North Holland, l982, 383-397.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and S. Swamy, Multiprocessor Pyramid Architectures for Bottom-Up Image Analysis, A. Rosenfeld (ed.), Multiresolution Image Processing and Analysis, Volume 1, Springer-Verlag, 1984, 38-59.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Veenstra and N. Ahuja, Deriving Object Octrees from Images, G. Goos and J. Hartmanis (eds.), Lecture Notes in Computer Science, 206, (same as Proc. 5th Conf. on Foundations of Software Technology and Theoretical Computer Science, December 16-18, 1985, New Delhi, India, editor: S. N. Maheshwari), Springer-Verlag, 1985, 196-211.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Texture Analysis, S. Shapiro (ed.), Encyclopedia of Artificial Intelligence, Wiley, 1987, 1101-1115. (Updated Versions Reprinted in second edition, 1992, 1587-1605.)</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and M. Tuceryan, Dot Pattern Analysis. S. Shapiro (ed.), Encyclopedia of Artificial Intelligence, Wiley, 1987, 253-256.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Sharma, N. Ahuja and J. Patel, NETRA: An Architecture for a Large Scale Multiprocessor Vision System, L. Uhr (ed.), Parallel Computer Vision, Academic Press, 1987, 87-105.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">A. N. Choudhary, S. Das, N. Ahuja and J. Patel, High Speed Stereo, S. Prasad and R. L. Kashyap (Eds.), Spectral Analysis in One or Two Dimensions, Vedam Books International, 1990, 577-590.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">Blostein and N. Ahuja, Shape from Texture: Integrating Texture Element Extraction and Surface Estimation, R. Kasturi and R. C. Jain (Eds.), Computer Vision: Principles, IEEE Computer Society Press, Washington D. C., 1991, 325-343. (Also appears in IEEE Trans. Pattern Analysis and       Machine Intelligence, December 1989, 1233-1251.)</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">A. N. Choudhary, J. H. Patel and N. Ahuja, Architecture and Performance Evaluation of NETRA, V. K. Prasanna Kumar (Ed.), Parallel Architectures and Algorithms for Image Understanding, Academic Press, 1991, 251-278.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, T. Huang and N. Ahuja, Motion and Structure from Two Perspective Views: Algorithm, Error Analysis, and Error Estimation R. Kasturi and R. C. Jain (Eds.), Computer Vision: Advances and Applications, IEEE Computer Society Press, Washington D. C., 1991, 352-377. (Also appears in IEEE Trans. Pattern Analysis and Machine Intelligence, May 1989, 451-476. <a href="../../abstracts/pub1_3_2_a0589wha.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">S. Das and N. Ahuja, Integrating Mutilresolution Image Acquisition and Coarse-to-fine Surface Reconstruction from Stereo from Stereo, L. Wolff, S. Shafer and G. Healey (Eds.), Physics Based Vision, Principles and Practice: Radiometry, Jones and Bartlett, 1992, 361-367. <a href="../../abstracts/pub1_5_1_a1189das.htm">Abstract</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">J. Weng, T. S. Huang and N. Ahuja, Object Recognition by a Self-Organizing Neural Network which Grows Adaptively, Parallel Image Analysis, A. Nakamura, et.al. (ed.), Springer-Verlag, 1992, 32-33.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, Computer Vision, McGraw-Hill Encyclopedia of Science and Technology, 1995.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and R. Charan, Pixel Matching and Motion Segmentation in Image Sequences, Lecture       Notes in Computer Science &#8211; Recent Developments in Computer Vision, 1035, S. Z. Li, D. P. Mital, E. K. Teoh, and H. Wang (Eds.), Springer-Verlag, 1996, 139-148.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja, A Transform for Multiscale Image Segmentation, K. Bowyer and N. Ahuja (Eds.), Advances in Image Understanding: A Festschrift for Azriel Rosenfeld, IEEE Computer Society Press, June 1996, 45-64.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and R. Charan, Curve Detection in 3D Dot Patterns using Voronois Impact on Modern Science, Voronois Impact on Modern Science, H. Syta and P. Engel (Eds.), National Academy of Sciences of Ukraine, Institute of Mathematics, Book II, 1998, 204-212.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Shaw, N. Ahuja and S. Hutchinson, Coordination, Collaboration, and Control of Multi-Robot Systems, Handbook for Industrial Robotics, 2nd Edition, S. Nof (Ed.), Wiley, 1999, 423-437.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">N. Ahuja and S. Sull, Analysis Guided Video Synthesis for Hyper Reality, N. Terashima and J. Tiffin (Eds.), Hyper Reality: The Infrastructure of the Information Society, to appear.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">M. Singh and N. Ahuja, Edge Measures Using Similarity Regions, Foundations of Image Understanding, L. S. David (Ed.), Kluwer, 2002, 241-288. <a href="../../abstracts/pub3_4_1_a0601singhmk.htm">Abstract and Full Text</a></p>
<p class="MsoNormal" style="margin-right: 7.5pt;">
<p>N. Ahuja and S. Sull, Analysis Guided Video Synthesis for Hyper Reality, N. Terashima and J. Tiffin (Eds.), Hyper Reality: The Infrastructure of the Information Society, to appear.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=380</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Books Edited</title>
		<link>http://vision.ai.uiuc.edu/?p=378</link>
		<comments>http://vision.ai.uiuc.edu/?p=378#comments</comments>
		<pubDate>Thu, 07 May 2009 06:49:39 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Books Edited]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=378</guid>
		<description><![CDATA[<p><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;"><span id="more-378"></span>K. Bowyer and N. Ahuja, <em>Advances in Image Understanding: A Festschrift for Azriel Rosenfeld</em>, IEEE Computer Society Press, June 1996.</span></p>
]]></description>
			<content:encoded><![CDATA[<p><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"><span id="more-378"></span>K. Bowyer and N. Ahuja, <em>Advances in Image Understanding: A Festschrift for Azriel Rosenfeld</em>, IEEE Computer Society Press, June 1996.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=378</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Books Authored</title>
		<link>http://vision.ai.uiuc.edu/?p=369</link>
		<comments>http://vision.ai.uiuc.edu/?p=369#comments</comments>
		<pubDate>Thu, 07 May 2009 06:42:21 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Books Authored]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=369</guid>
		<description><![CDATA[<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;"><span id="more-369"></span>N. Ahuja and B. Schachter, <em>Pattern Models</em>, Wiley, l983. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;"> </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">J. Weng, T. S. Huang and </span><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">N. Ahuja</span><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">, <em>Motion and Structure from Image Sequences</em>, Springer-Verlag, 1992. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;"> </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">M. H. Yang and </span><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">N. Ahuja</span><span style="font-size: 11pt; font-family: &#34;Book Antiqua&#34;;">, <em>Face Detection and Hand Gesture Recognition for Vision-Based</em> <em>Human Computer&#8230;</em></span></p>]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"><span id="more-369"></span>N. Ahuja and B. Schachter, <em>Pattern Models</em>, Wiley, l983. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"> </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">J. Weng, T. S. Huang and </span><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">N. Ahuja</span><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">, <em>Motion and Structure from Image Sequences</em>, Springer-Verlag, 1992. </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"> </span></p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">M. H. Yang and </span><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">N. Ahuja</span><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;">, <em>Face Detection and Hand Gesture Recognition for Vision-Based</em> <em>Human Computer Interaction</em>, Kluwer Academic Publishers, 2001. </span><strong><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"> </span></strong></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=369</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Patents</title>
		<link>http://vision.ai.uiuc.edu/?p=367</link>
		<comments>http://vision.ai.uiuc.edu/?p=367#comments</comments>
		<pubDate>Thu, 07 May 2009 06:41:16 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Patents]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=367</guid>
		<description><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-367"></span>A. Krishnan and N. Ahuja, Imaging Apparatus and Method for Determining Range from Focus and Focus Information, U. S. Patent, Issued September 1995. European Patent issued, October 2001.</p>
<p style="margin-right: 7.5pt;">
</p><p style="margin-right: 7.5pt;">N. Ahuja and M. Tabb, Multiscale Image Edge and Region Detection Method and&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="margin-right: 7.5pt;"><span id="more-367"></span>A. Krishnan and N. Ahuja, Imaging Apparatus and Method for Determining Range from Focus and Focus Information, U. S. Patent, Issued September 1995. European Patent issued, October 2001.</p>
<p style="margin-right: 7.5pt;">
<p style="margin-right: 7.5pt;">N. Ahuja and M. Tabb, Multiscale Image Edge and Region Detection Method and Apparatus, U. S. Patent, Issued September 1998.</p>
<p style="margin-right: 7.5pt;">
<p style="margin-right: 7.5pt;">R. Dugad and N. Ahuja, Transform Domain Significant Coefficient Digital Image Watermarking Method,       U.S. patent Application filed, June 2000.</p>
<p style="margin-right: 7.5pt;">
<p style="margin-right: 7.5pt;">H. Hua and N. Ahuja, Method and Apparatus for a High-Resolution and Real-Time Panoramic Camera, Patent Application Filed, November 2001.</p>
<p style="margin-right: 7.5pt;">
<p class="MsoNormal" style="margin-right: 7.5pt;">R. Dugad and N. Ahuja, Transformation of Image Parts in Different domains to Obtain Resultant Image Size Different From Initial Image Size, U.S. Patent, Issued October 2004.</p>
<p class="MsoNormal" style="margin-right: 7.5pt;"><span style="font-size: 11pt; font-family: &quot;Book Antiqua&quot;;"> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=367</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A New Omni-directional Stereo Vision System Using Single Camera</title>
		<link>http://vision.ai.uiuc.edu/?p=353</link>
		<comments>http://vision.ai.uiuc.edu/?p=353#comments</comments>
		<pubDate>Thu, 07 May 2009 06:31:12 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Omni-directional Stereo]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=353</guid>
		<description><![CDATA[<p style="text-align: justify;"><img class="alignleft" style="border: 0pt none;" src="../../project_new/panarama-camera.jpg" border="0" alt="" width="98" height="101" /> We describe a new omnidirectional stereo imaging system that uses a concave lens and a convex mirror to produce a stereo pair of images on the sensor of a conventional camera. The light incident from a scene point is split&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;"><img class="alignleft" style="border: 0pt none;" src="../../project_new/panarama-camera.jpg" border="0" alt="" width="98" height="101" /> We describe a new omnidirectional stereo imaging system that uses a concave lens and a convex mirror to produce a stereo pair of images on the sensor of a conventional camera. The light incident from a scene point is split and directed to the camera in two parts. One part reaches camera directly after reflection from the convex mirror and forms a single-viewpoint omnidirectional image. The second part is formed by passing a subbeam of the reflected light from the mirror through a concave lens and forms a displaced single viewpoint image where the disparity depends on the depth of the scene point. A closed-form expression for depth is derived. Since the optical components used are simple and commercially available, the resulting system is compact and inexpensive. This, and the simplicity of the required image processing algorithms, make the proposed system attractive for real-time applications, such as autonomous navigation and object manipulation. The experimental prototype we have built is described.<span id="more-353"></span></p>
<ol>
<li>Sooyeong Yi, Narendra Ahuja, An Omnidirectional Stereo System Using a Single Camera, 18th International Conference on (ICPR&#8217;06), 2006, Hong Kong, China <a href="../../publications/sooyeong_camera.pdf">Full Text<br />
</a></li>
<li>Sooyeong Yi, Narendra Ahuja, A Novel Omnidirectional Stereo System Using a Single Camera, International Conference on Image Analysis and Recognition (ICIAR), 2006 <a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=01699976">Full Text</a></li>
</ol>
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=353</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A New Imaging Model</title>
		<link>http://vision.ai.uiuc.edu/?p=343</link>
		<comments>http://vision.ai.uiuc.edu/?p=343#comments</comments>
		<pubDate>Thu, 07 May 2009 06:22:58 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[new imaging model]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=343</guid>
		<description><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_4_1_icon.jpg" border="2" alt="" width="125" height="61" /></p>
<p>In developing the new opto-geometric configurations, we have found that certain classical models and approaches cease to be adequate. For example, the long-established Gaussian model of image formation fails to adequately predict the acquired images, and the optical and geometric&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_4_1_icon.jpg" border="2" alt="" width="125" height="61" /></p>
<p>In developing the new opto-geometric configurations, we have found that certain classical models and approaches cease to be adequate. For example, the long-established Gaussian model of image formation fails to adequately predict the acquired images, and the optical and geometric phenomena ignored in the traditional characterization of the most focused scene point make the traditional methods of focus analysis unacceptable. We have the old models with new, more rigorous, and satisfactory models. These new models are also useful in contexts other than next generation camera designs &#8211; they are useful in improving the performance of currently &#8220;acceptable&#8221; systems, and in extending the applicability of computer vision methods to many scenarios and applications which were out of reach otherwise.</p>
<p><span id="more-343"></span>M. Aggarwal and N. Ahuja, A New Imaging Model, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 82-89. <a href="../../manoj/model_aggarwal.pdf">Full Paper(PDF)</a></p>
<p>M. Aggarwal, H. Hua and N. Ahuja, On Cosine-Fourth and Vignetting Effects in Lenses, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 472-479. <a href="../../manoj/cos4_aggarwal.pdf">Full Paper(PDF)</a></p>
<p>M. Aggarwal and N. Ahuja, A Pupil-Centric Model of Image Formation, International Journal on Computer Vision, 48(3), 2002, 195-214.  <a href="http://www.springerlink.com/content/jl2c8uvntegtqba3/">Secured Access to Text</a></p>
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=343</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hemispherical Imaging Camera</title>
		<link>http://vision.ai.uiuc.edu/?p=336</link>
		<comments>http://vision.ai.uiuc.edu/?p=336#comments</comments>
		<pubDate>Thu, 07 May 2009 06:11:02 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Hemispherical Camera]]></category>
		<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=336</guid>
		<description><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/hemispherical.gif" border="2" alt="Image not available" width="127" height="98" /></span></p>
<p style="margin-bottom: 0in;" align="justify">We have developed a camera which is capable of acquiring very large field of view (FOV) images at high and uniform resolution, from a single viewpoint, at video rates. The FOV can range from being nearly hemispherical, to being nearly&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/hemispherical.gif" border="2" alt="Image not available" width="127" height="98" /></span></p>
<p style="margin-bottom: 0in;" align="justify">We have developed a camera which is capable of acquiring very large field of view (FOV) images at high and uniform resolution, from a single viewpoint, at video rates. The FOV can range from being nearly hemispherical, to being nearly omni-directional, barring some small scene parts being obstructed by image sensors themselves. The camera consists of multiple imaging sensors and a hexagonal prism made of planar mirror faces. Each sensor is paired with a planar face of the prism. The sensors are positioned in such a way that they image different parts of the scene from a single virtual viewpoint, either directly or after reflections off the prism. A panoramic image is constructed by concatenating the images taken by different sensors. The resolution of the panoramic image is proportional to the number of sensors used and therefore a multiple of that of an individual sensor. Further, the resolution is substantially uniform across the entire panoramic image.</p>
<p><span id="more-336"></span></p>
<p style="text-align: justify;">1. Introduction</p>
<p style="text-indent: 0.25in; margin-bottom: 0in;" align="justify">A panoramic camera is an imaging device capable of capturing a very large field of view (FOV). Like any other camera, it is desirable that such cameras acquire the entire FOV from a single viewpoint, in real time, at high resolution which is uniform across the FOV, with large dynamic range, and over a large depth of field. Such devices find applications in many areas including tele-conferencing, surveillance and robot navigation. Many efforts have been made to achieve various subsets of these properties (i.e. wide FOV, high and uniform resolution, large depth of field, high dynamic range, a single viewpoint, and real-time acquisition. These methods of capturing panoramic or omni-directional images fall into two categories: dioptric methods, where only refractive elements (lenses) are employed, and catadioptric methods, where a combination of reflective and refractive components is used.</p>
<p style="text-indent: 0.25in; margin-bottom: 0in;" align="justify">Typical dioptric systems include camera clusters, panning cameras, and fisheye lenses. Catadioptric methods include curved mirror systems where a conventional camera captures the scene reflected off a single non-planar mirror (e.g. parabolic or hyperbolic mirror), and planar mirror systems such as mirror pyramid systems where multiple conventional cameras image the scene reflected off the faces of a mirror-pyramid. The cameras that use a parabolic- or a hyperbolic-mirror to map an omni-directional view onto a single sensor are able to capture a large FOV from a single viewpoint at video rate. However, the FOV shape is hemispherical minus a central cone which is blocked by self-occlusion. The overall resolution of the acquired images is limited to that of the sensor used, and further it varies with the viewing direction across the ring-like FOV, e.g., from a high just outside the central blind spot to a low in the periphery. The cameras using a spherical or conical mirror have similar properties as those using parabolic or hyperbolic mirrors except that they do not possess a single viewpoint.</p>
<p style="text-indent: 0.25in; margin-bottom: 0in;" align="justify">Many of the aforementioned systems provide a cylindrical shape FOV which is 360 degrees wide in azimuth, but has limited height in elevation (Fig. 1a). In certain applications such as robot navigation and surveillance, however, a hemispherical shape FOV is highly desirable (Fig. 1b). We have developed a system which is capable of acquiring hemispherical panoramic images in real time, with high and substantially uniform resolution, and from a single viewpoint. By substantially uniform resolution we mean the same level of uniformity as delivered by a conventional, non-panoramic camera.</p>
<p style="text-align: center;" align="center"><img id="_x0000_i1025" src="../../newpubs/hem/summary_files/image001.gif" alt="Text Box:                                                          (a)						(b)   Figure 1: Two types of FOV�s: (a) Cylindrical, (b) Hemispherical." width="677" height="239" /></p>
<p style="text-align: justify;">2. Prototype</p>
<p style="text-align: justify; text-indent: 13.7pt;">
<p style="text-align: justify; text-indent: 13.5pt;">
<p style="text-align: justify; text-indent: 13.5pt;">
<p class="MsoNormal" style="margin-left: 0.5in; text-align: center;" align="center"><!--[if gte vml 1]><v:shapetype  id="_x0000_t75" coordsize="21600,21600" o:spt="75" o:preferrelative="t"  path="m@4@5l@4@11@9@11@9@5xe" filled="f" stroked="f"> <v:stroke joinstyle="miter" /> <v:formulas> <v:f eqn="if lineDrawn pixelLineWidth 0" /> <v:f eqn="sum @0 1 0" /> <v:f eqn="sum 0 0 @1" /> <v:f eqn="prod @2 1 2" /> <v:f eqn="prod @3 21600 pixelWidth" /> <v:f eqn="prod @3 21600 pixelHeight" /> <v:f eqn="sum @0 0 1" /> <v:f eqn="prod @6 1 2" /> <v:f eqn="prod @7 21600 pixelWidth" /> <v:f eqn="sum @8 21600 0" /> <v:f eqn="prod @7 21600 pixelHeight" /> <v:f eqn="sum @10 21600 0" /> </v:formulas> <v:path o:extrusionok="f" gradientshapeok="t" o:connecttype="rect" /> <o:lock v:ext="edit" aspectratio="t" /> </v:shapetype><v:shape id="_x0000_s1026" type="#_x0000_t75" alt="Text Box:  &#13;&#10;&#13;&#10;Figure 2a: The prototype hardware.&#13;&#10;&#13;&#10; &#13;&#10;&#13;&#10;Figure 2b. The prototype of Fig. 2a in a ceiling mount.&#13;&#10;"  style='position:absolute;left:0;text-align:left;margin-left:0;margin-top:0;  width:373.5pt;height:672.75pt;z-index:1;mso-position-horizontal:left;  mso-position-vertical-relative:line' o:allowoverlap="f"> <v:imagedata src="summary_files/image002.gif" mce_src="summary_files/image002.gif" /> <w:wrap type="square" /> </v:shape><![endif]--><!--[if !vml]--><img style="width: 333px; height: 441px;" src="../../images/next_gen1.jpg" alt="Prototype as a mount" /></p>
<p class="MsoNormal" style="margin-left: 0.5in; text-align: center;" align="center"><!--[endif]--> Figure 2. Prototype in a ceiling mount</p>
<p style="margin-left: 0.25in; text-align: center; text-indent: -0.25in;"><img style="width: 531px; height: 430px;" src="../../images/next_gen2.jpg" alt="a sample image acquired by the prototype. The effective FOV from the top boundary to the bottom boundary is exactly 140 degrees." /></p>
<p class="MsoNormal" style="text-align: justify; text-indent: 13.5pt;"><!--[if gte vml 1]><v:shape id="_x0000_s1027" type="#_x0000_t75"  alt="Text Box:    &#13;&#10;(a)&#13;&#10; &#13;&#10;(b)&#13;&#10;Figure 4: Hemispherical panoramic images before and after compensating for mixing artifacts. (a) Before compensation; (b) After compensation.&#13;&#10;"  style='position:absolute;left:0;text-align:left;margin-left:0;margin-top:0;  width:229.5pt;height:346.5pt;z-index:2;mso-position-horizontal:left;  mso-position-vertical-relative:line' o:allowoverlap="f"> <v:imagedata src="summary_files/image003.gif" mce_src="summary_files/image003.gif" /> <w:wrap type="square" /> </v:shape><![endif]--><!--[if !vml]--><!--[endif]--><!--[if gte vml 1]><v:shape id="_x0000_s1028"  type="#_x0000_t75" alt="Text Box:    &#13;&#10;&#13;&#10;      &#13;&#10;&#13;&#10;      &#13;&#10;&#13;&#10; &#13;&#10;Figure 3: Experimental results.&#13;&#10;"  style='position:absolute;left:0;text-align:left;margin-left:0;margin-top:0;  width:247.5pt;height:465pt;z-index:3;mso-position-horizontal:left;  mso-position-vertical-relative:line' o:allowoverlap="f"> <v:imagedata src="summary_files/image004.gif" mce_src="summary_files/image004.gif" /> <w:wrap type="square" /> </v:shape><![endif]--><!--[if !vml]--><!--[endif]--></p>
<p style="text-align: justify; text-indent: 13.5pt;">
<p style="text-indent: 0.19in; margin-bottom: 0in; text-align: center;">Fig. 3 shows a sample image acquired by the prototype. The effective FOV from the top boundary to the bottom boundary is exactly 140 degrees.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=336</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Single Lens Depth Camera</title>
		<link>http://vision.ai.uiuc.edu/?p=320</link>
		<comments>http://vision.ai.uiuc.edu/?p=320#comments</comments>
		<pubDate>Thu, 07 May 2009 06:03:39 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Single Lens Depth Camera]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=320</guid>
		<description><![CDATA[<p style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/ppp_concept.jpg" border="2" alt="Image not available" width="75" height="53" /></span>We propose a novel depth sensing imaging system composed of a single camera along with a parallel planar plate rotating about the optical axis of the camera. Compared with conventional stereo systems, only one camera is utilized to capture stereo&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/ppp_concept.jpg" border="2" alt="Image not available" width="75" height="53" /></span>We propose a novel depth sensing imaging system composed of a single camera along with a parallel planar plate rotating about the optical axis of the camera. Compared with conventional stereo systems, only one camera is utilized to capture stereo pairs, which can improve the accuracy of correspondence detection as is the case for any single camera stereo systems. The proposed system is able to capture multiple images by simply rotating the plate. With multiple stereo pairs, it is possible to obtain precise depth estimates, without encountering matching ambiguity problems, even for objects with low texture. Given the large number of resulting images, in conjunction with the estimated depth map, we show that the proposed system is also capable of acquiring super-resolution images. Finally, experimental results on reconstructing 3D structures and recovering high-resolution textures are presented.</p>
<p style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;"><span id="more-320"></span></p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">Stereo is one of the most widely explored sources of scene depth. Stereo usually refers to spatial stereo, wherein two cameras, separated by a baseline, simultaneously capture stereo image pairs. The spatial disparity in the images of the same scene feature then captures the feature&#8217;s depth. More than two cameras can also be used to capture the disparity information across multiple views.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">An alternative to such spatial stereo is temporal stereo wherein a single camera is relocated to the same set of viewpoints to capture the two or more images sequentially. This loses the parallel imaging capability and therefore the ability to handle fast moving objects, but it reduces the number of cameras used to one as well as eliminates the need for photometric calibration of the camera if needed for feature/intensity matching of stereo images.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: center; text-indent: 9pt;" align="center">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;"><!--[if gte vml 1]><v:shapetype id="_x0000_t202"  coordsize="21600,21600" o:spt="202" path="m,l,21600r21600,l21600,xe"> <v:stroke joinstyle="miter" /> <v:path gradientshapeok="t" o:connecttype="rect" /> </v:shapetype><v:shape id="_x0000_s1163" type="#_x0000_t202" style='position:absolute;  left:0;text-align:left;margin-left:0;margin-top:3.05pt;width:233.7pt;height:128.6pt;  text-indent:0;z-index:3;mso-position-horizontal:left' strokecolor="white"> <v:textbox style="mso-next-textbox:#_x0000_s1163" mce_style="mso-next-textbox:#_x0000_s1163"> <![if !mso]></p>
<table cellpadding=0 cellspacing=0 width="100%">
<tr>
<td><![endif]></p>
<div>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><v:shapetype      id="_x0000_t75" coordsize="21600,21600" o:spt="75" o:preferrelative="t"      path="m@4@5l@4@11@9@11@9@5xe" filled="f" stroked="f"> <v:stroke joinstyle="miter" /> <v:formulas> <v:f eqn="if lineDrawn pixelLineWidth 0" /> <v:f eqn="sum @0 1 0" /> <v:f eqn="sum 0 0 @1" /> <v:f eqn="prod @2 1 2" /> <v:f eqn="prod @3 21600 pixelWidth" /> <v:f eqn="prod @3 21600 pixelHeight" /> <v:f eqn="sum @0 0 1" /> <v:f eqn="prod @6 1 2" /> <v:f eqn="prod @7 21600 pixelWidth" /> <v:f eqn="sum @8 21600 0" /> <v:f eqn="prod @7 21600 pixelHeight" /> <v:f eqn="sum @10 21600 0" /> </v:formulas> <v:path o:extrusionok="f" gradientshapeok="t" o:connecttype="rect" /> <o:lock v:ext="edit" aspectratio="t" /> </v:shapetype><v:shape id="_x0000_i1027" type="#_x0000_t75" style='width:180.75pt;      height:101.25pt' o:ole=""> <v:imagedata src="summary_files/image011.wmz" mce_src="summary_files/image011.wmz" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed"      ProgID="Visio.Drawing.11" ShapeID="_x0000_i1027" DrawAspect="Content"      ObjectID="_1179520685"> </o:OLEObject> <![endif]></p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><span     style="font-size:9.0pt;mso-bidi-font-size:10.0pt" mce_style="font-size:9.0pt;mso-bidi-font-size:10.0pt">Figure 1. Concept illust</span><st1:PersonName><span      style="font-size:9.0pt;mso-bidi-font-size:10.0pt" mce_style="font-size:9.0pt;mso-bidi-font-size:10.0pt">r</span></st1:PersonName><span     style="font-size:9.0pt;mso-bidi-font-size:10.0pt" mce_style="font-size:9.0pt;mso-bidi-font-size:10.0pt">ation<o:p></o:p></span></p>
</div>
<p><![if !mso]></td>
</tr>
</table>
<p><![endif]></v:textbox> <w:wrap type="square" /> </v:shape><![endif]--><!--[if !vml]--><img src="../../newpubs/depth/summary_files/image012.gif" alt="Text Box:   Figure 1. Concept illustration" hspace="12" width="318" height="178" align="left" /><!--[endif]--></p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">We propose a single-camera depth estimation system that captures a large number of images, after the incoming light from a scene point has been deflected in a manner that depends on the object depth. The deflection mechanism is the passage of light through a thick glass plate placed in front of an ordinary camera at a certain orientation to the optical axis. In order to estimate depth, at least two images captured under two different plate poses are necessary. However, a larger number of images, containing redundant depth information, are acquired by changing the plate orientation sequentially, e.g. by rotating and/or reorienting the plate. Rotating the plate at a fixed orientation with respect to the optical axis is a mechanically convenient way of obtaining a large number of depth-coded images, followed, if desired, by more plate orientations and rotations, to acquire more images, as illustrated in Fig 1. An analysis of the correspondences among the set of images yields depth estimates. High quality, dense depth estimation distinguishes this new camera from other single camera stereo systems.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<h3 class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">Depth Dependent Pixel Displacement</h3>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">It is well known from optics that light ray passing through a planar plate will encounter a lateral displacement. For a camera-plate system, this phenomenon is shown as the pixel shifts in the image. For a given object point, we assume that the point is shifted by the plate in a plane parallel with image plane.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">The displacements of a pixel depend differently on changes in the plate tilt angle and rotation angle. Using both changes leads to robust estimation because the lack of sensitivity to one type of change is complemented by higher sensitivity to the other. Depth estimation is done by minimizing a cost function which is overdetermined due to the large number of images available. The cost function includes the error due to the fit of the result to the multiple estimates available at a single pixel, and the local roughness of the surface at the pixel.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">The dimensions of the plate and its tilt angle, among other parameters, have to be selected properly to achieve good depth estimates. The plate parameters affect the amount of depth-sensitive displacement in the image. Larger refractive index and thicker plate yield a larger displacement, which corresponds to a higher depth resolution. However, larger refractive index and thicker plate also introduce larger chromatic aberration, which may degrade the image quality. The tilt angle of the plate also affects the displacement &#8211; larger the tilt angle, larger the pixel displacement for the same depth. However, with the increase of the tilt angle, the size of the plate increases dramatically.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">
<h3 class="MsoNormal" style="margin-right: 0.2in; text-align: justify;">Prototype</h3>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">We have developed a prototype with the rotating plate oriented at a fixed tilt angle. It requires calibration only once, in the beginning, after which the system acquires images continuously without the need for any further calibration. This is achieved by rotating the plate continuously about the x, y and/or z axes, acquiring images at video rate. The images of a scene point, acquired during rotation, lie along a 4-dimensional manifold in the space defined by the object depth in the viewing direction, and the three rotation angles. To estimate object depth in a given direction, we simply find the best estimate of the intersection of the line in that direction with the manifold samples obtained from the images acquired during rotation. Each pixel in each image thus contributes to the number of manifold samples. Since the locations of pixels in different images define a denser array of directions than possible with a single (orientation) image grid, the system yields depth estimates along a direction array denser than the original images.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">We used a Sony DFX900 color camera equipped with a 16mm lens, and an ordinary glass plate with 13mm thickness and refractive index of around 1.5. The plate was mounted on a rotary stage and tilted approximately 45 degrees with respect to the image plane. A total 36 plate poses , evenly distributed within the range of 360 degrees, were calibrated and used to recover depth and synthesize the super-resolution images.</p>
<p class="MsoBodyText3" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt; line-height: normal;">We tested our system with two objects: house and monster head. The experimental results of the house object are shown in Fig 2. Fig. 2a is one of the input images taken through the plate, Fig. 2b shows the recovered depth map, and Fig. 2c shows a new view generated from the reconstructed house model.</p>
<p class="MsoBodyText3" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt; line-height: normal;">Fig 3 shows the experimental results for the monster head object. Figs 3a~3c show one of input images, the recovered depth map, and a new view of the reconstructed head model, respectively. This result indicates that the camera performs well for objects having a limited amount of texture.</p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p><br style="page-break-before: always;" /></p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;"><!--[if mso &#038; !supportInlineShapes &#038; supportFields]><span style="mso-element:field-begin;mso-field-lock:yes" mce_style="mso-element:field-begin;mso-field-lock:yes"></span><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span>SHAPE <span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span>\* MERGEFORMAT <span style="mso-element:field-separator" mce_style="mso-element:field-separator"></span><![endif]--><!--[if gte vml 1]><v:shape  id="_x0000_s1180" type="#_x0000_t202" style='width:486pt;height:369.2pt;  mso-position-horizontal-relative:char;mso-position-vertical-relative:line'  strokecolor="white"> <v:textbox> <![if !mso]></p>
<table cellpadding=0 cellspacing=0 width="100%">
<tr>
<td><![endif]></p>
<div>
<p class=MsoNormal style="text-indent:.5in" mce_style="text-indent:.5in"><v:shape id="_x0000_i1028"      type="#_x0000_t75" style='width:92.25pt;height:91.5pt' o:ole=""> <v:imagedata src="summary_files/image003.png" mce_src="summary_files/image003.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1028" DrawAspect="Content" ObjectID="_1179520686"> </o:OLEObject> <![endif]><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span><span style="mso-tab-count:     4" mce_style="mso-tab-count:     4">����������������������������������������������������� </span><span     style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span><v:shape id="_x0000_i1029" type="#_x0000_t75"      style='width:108.75pt;height:101.25pt' o:ole=""> <v:imagedata src="summary_files/image004.png" mce_src="summary_files/image004.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1029" DrawAspect="Content" ObjectID="_1179520687"> </o:OLEObject> <![endif]><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span></p>
<p class=MsoNormal style="margin-left:.5in;text-indent:.5in" mce_style="margin-left:.5in;text-indent:.5in">(a)<span     style="mso-tab-count:6" mce_style="mso-tab-count:6">������������������������������������������������������������������������������������������ </span>(b)</p>
<p class=MsoNormal style="margin-left:.5in;text-indent:.5in" mce_style="margin-left:.5in;text-indent:.5in"><o:p>&nbsp;</o:p></p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><v:shape id="_x0000_i1030"      type="#_x0000_t75" style='width:209.25pt;height:181.5pt' o:ole=""> <v:imagedata src="summary_files/image005.png" mce_src="summary_files/image005.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1030" DrawAspect="Content" ObjectID="_1179520688"> </o:OLEObject> <![endif]></p>
<p class=MsoNormal style="margin-left:3.0in" mce_style="margin-left:3.0in">(c)</p>
<p class=MsoNormal style="margin-left:3.0in" mce_style="margin-left:3.0in"><o:p>&nbsp;</o:p></p>
<p class=MsoCommentText><span style="font-size:12.0pt" mce_style="font-size:12.0pt">Figure 2. Depth     Estimation Results for the house object. (a) One of input image; (b)     Recovered depth map; (c) A new view generated from the reconstructed 3D     model.<o:p></o:p></span></p>
</div>
<p><![if !mso]></td>
</tr>
</table>
<p><![endif]></v:textbox> <w:wrap type="none" /> <w:anchorlock /> </v:shape><![endif]--><!--[if !vml]--><img src="../../newpubs/depth/summary_files/image006.gif" alt="Text Box:   				    (a)						(b)    (c)  Figure 2. Depth Estimation Results for the house object. (a) One of input image; (b) Recovered depth map; (c) A new view generated from the reconstructed 3D model." width="654" height="498" /><!--[endif]--><!--[if mso &#038; !supportInlineShapes &#038; supportFields]><v:shape  id="_x0000_i1025" type="#_x0000_t75" style='width:486pt;height:369.2pt'> <v:imagedata croptop="-65520f" cropbottom="65520f" /> </v:shape><span style="mso-element:field-end" mce_style="mso-element:field-end"></span><![endif]--></p>
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p class="MsoNormal" style="margin-right: 0.2in; text-align: justify; text-indent: 9pt;">
<p><!--[if mso &#038; !supportInlineShapes &#038; supportFields]><span style="font-size:9.0pt;mso-bidi-font-size:10.0pt" mce_style="font-size:9.0pt;mso-bidi-font-size:10.0pt"><span style="mso-element: field-begin;mso-field-lock:yes" mce_style="mso-element: field-begin;mso-field-lock:yes"></span><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span>SHAPE <span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span>\* MERGEFORMAT <span style="mso-element: field-separator" mce_style="mso-element: field-separator"></span></span><![endif]--><!--[if gte vml 1]><v:shape id="_x0000_s1188"  type="#_x0000_t202" style='width:486pt;height:364.6pt;  mso-position-horizontal-relative:char;mso-position-vertical-relative:line'  strokecolor="white"> <v:textbox> <![if !mso]></p>
<table cellpadding=0 cellspacing=0 width="100%">
<tr>
<td><![endif]></p>
<div>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><v:shape id="_x0000_i1031"      type="#_x0000_t75" style='width:92.25pt;height:121.5pt' o:ole=""> <v:imagedata src="summary_files/image007.png" mce_src="summary_files/image007.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1031" DrawAspect="Content" ObjectID="_1179520689"> </o:OLEObject> <![endif]><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span><span style="mso-tab-count:     4" mce_style="mso-tab-count:     4">����������������������������������������������������� </span><span     style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span><v:shape id="_x0000_i1032" type="#_x0000_t75"      style='width:93.75pt;height:124.5pt' o:ole=""> <v:imagedata src="summary_files/image008.png" mce_src="summary_files/image008.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1032" DrawAspect="Content" ObjectID="_1179520691"> </o:OLEObject> <![endif]><span style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�</span></p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><o:p>&nbsp;</o:p></p>
<p class=MsoNormal style="margin-left:1.0in;text-indent:.5in" mce_style="margin-left:1.0in;text-indent:.5in">(a)<span     style="mso-tab-count:5" mce_style="mso-tab-count:5">�������������������������������������������������������������������������� </span><span     style="mso-spacerun:yes" mce_style="mso-spacerun:yes">�� </span><span style="mso-tab-count:1" mce_style="mso-tab-count:1">������������ </span><span     style="mso-spacerun:yes" mce_style="mso-spacerun:yes">������</span>(b)</p>
<p class=MsoNormal style="margin-left:3.5in" mce_style="margin-left:3.5in"><o:p>&nbsp;</o:p></p>
<p class=MsoNormal style="margin-left:38.25pt" mce_style="margin-left:38.25pt"><o:p>&nbsp;</o:p></p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><v:shape id="_x0000_i1033"      type="#_x0000_t75" style='width:209.25pt;height:129.75pt' o:ole=""> <v:imagedata src="summary_files/image009.png" mce_src="summary_files/image009.png" o:title="" /> </v:shape><![if gte mso 9]><o:OLEObject Type="Embed" ProgID="MSPhotoEd.3"      ShapeID="_x0000_i1033" DrawAspect="Content" ObjectID="_1179520692"> </o:OLEObject> <![endif]></p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center">(c)</p>
<p class=MsoNormal align=center style="text-align:center" mce_style="text-align:center"><o:p>&nbsp;</o:p></p>
<p class=MsoCommentText style="text-align:justify;text-justify:inter-ideograph" mce_style="text-align:justify;text-justify:inter-ideograph"><span     style="font-size:12.0pt" mce_style="font-size:12.0pt">Figure 3: Experimental results of the monster head     object. (a) One of input images; (b) Recovered depth map; (c) A new view     generated from the reconstructed 3D model.<o:p></o:p></span></p>
<p class=MsoCommentText style="text-align:justify;text-justify:inter-ideograph" mce_style="text-align:justify;text-justify:inter-ideograph"><span     style="font-size:12.0pt" mce_style="font-size:12.0pt"><o:p>&nbsp;</o:p></span></p>
</div>
<p><![if !mso]></td>
</tr>
</table>
<p><![endif]></v:textbox> <w:wrap type="none" /> <w:anchorlock /> </v:shape><![endif]--><!--[if !vml]--><img src="../../newpubs/depth/summary_files/image010.gif" alt="Text Box:   				     (a)					   	      (b)     (c)  Figure 3: Experimental results of the monster head object. (a) One of input images; (b) Recovered depth map; (c) A new view generated from the reconstructed 3D model." width="654" height="492" /></p>
<ol>
<li>Chunyu Gao, Narendra Ahuja, Single camera stereo using planar parallel plate , Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 4, pp. 108-111 08, 2004, Cambridge UK <a href="../../publications/Stereo_PPP_Gao.pdf">Full Text<br />
</a></li>
<li>Chunyu Gao and Narendra Ahuja, &#8220;A Refractive Camera for Acquiring Stereo and Super-resolution  Images&#8221;, Computer Vision and Pattern Recognition, IEEE Computer Society Conference on, Volume 2,  17-22 June 2006 Page(s):2316 &#8211; 2323 <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1641037">Full Text</a></li>
</ol>
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=320</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Multiview Double Mirror Pyramid Panoramic Cameras</title>
		<link>http://vision.ai.uiuc.edu/?p=305</link>
		<comments>http://vision.ai.uiuc.edu/?p=305#comments</comments>
		<pubDate>Thu, 07 May 2009 05:44:30 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[pyramid cameras]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=305</guid>
		<description><![CDATA[<p style="text-align: justify;"><img class="alignleft" style="border: 0pt none;" src="../../%7Etankh/Camera/spam_setup.jpg" border="0" alt="" width="100" height="117" />Panoramic images and video are useful in many   applications such as special effects, immersive virtual reality environments,   and video games. Among the numerous devices proposed for capturing panoramas,   mirror pyramid-based camera systems are a promising approach for video rate   capture,&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;"><img class="alignleft" style="border: 0pt none;" src="../../%7Etankh/Camera/spam_setup.jpg" border="0" alt="" width="100" height="117" />Panoramic images and video are useful in many   applications such as special effects, immersive virtual reality environments,   and video games. Among the numerous devices proposed for capturing panoramas,   mirror pyramid-based camera systems are a promising approach for video rate   capture, as they offer single-viewpoint imaging, and use only flat mirrors   that are easier to produce than curved mirrors. Past work has focused on   capturing panoramas from a single viewpoint.</p>
<p style="text-align: justify;">In this work, we   have extended our work on the Double   Mirror Pyramid Panoramic Camera, that acquires panoramic   images from a single viewpoint, to multiple viewpoints.</p>
<p style="text-align: justify;">
<p style="text-align: justify;">
<p style="text-align: justify;"><span id="more-305"></span></p>
<p style="text-align: justify;">
<p class="MsoNormal" style="text-align: justify;"><span style="display: none;"> </span></p>
<div>
<table class="MsoNormalTable" style="width: 6.25in;" border="0" cellspacing="0" cellpadding="0" width="600">
<tbody>
<tr>
<td style="padding: 0in;">
<h2 style="margin-left: 0.35pt; text-align: justify;"><span style="color: #000000;"><span style="font-size: 13.5pt;">Mirror Pyramid Cameras</span></span></h2>
</td>
<td style="padding: 0in;">
<p style="text-align: justify;">
</td>
</tr>
<tr>
<td style="padding: 0in;" colspan="2">
<p style="text-align: justify;">A mirror pyramid   consists of a set of flat mirror faces arranged around an axis of symmetry,   inclined to form a pyramid. By strategically positioning a number of   conventional cameras around a mirror pyramid, the viewpoints for the   individual cameras� mirror images can be colocated at a single point within   the pyramid, effectively forming a virtual camera with a wide field of view.</p>
<p style="text-align: justify;">Mirror   pyramid-based panoramic cameras have a number of attractive properties,   including</p>
<ul style="margin-top: 0in;" type="disc">
<li class="MsoNormal" style="text-align: justify;">single-viewpoint        imaging,</li>
<li class="MsoNormal" style="text-align: justify;">high resolution, and</li>
<li class="MsoNormal" style="text-align: justify;">video rate capture.</li>
</ul>
<p style="text-align: justify;">Currently existing   designs realize a single viewpoint within each mirror pyramid. In order to   capture panoramas from multiple viewpoints with these designs, the entire   physical setup would need to be relocated or duplicated. The former solution   lacks the capability of video rate imaging, and the latter leads to bulky   designs due to the multiple mirror pyramids.</p>
<h2 style="margin: 0in 0in 0.0001pt; text-align: justify;">Multiview Double Mirror Pyramid Cameras</h2>
<p style="margin: 0in 0in 0.0001pt; text-align: justify;">We have extended mirror pyramid panoramic camera to a   generalized design that accommodates multiple viewpoints. Each viewpoint is   the common mirror image of the optic points of a set of physical cameras   located outside the pyramid and together they yield a seamless panoramic   image. Each camera set yields a panoramic image from its associated   viewpoint. The result is simultaneous, multiview, panoramic, video rate   imaging with a compact design. Using a double mirror pyramid, i.e., two   pyramids back to back with a shared base, helps double the height of the   visual field in a manner similar to monocular imaging in our <span style="color: windowtext;"><span style="color: windowtext; text-decoration: none;">D</span><span style="color: windowtext; text-decoration: none;">ouble   Mirror Pyramid Panoramic</span></span> Camera.</p>
<p style="margin: 0in 0in 0.0001pt; text-align: justify;">
<p style="margin: 0in 0in 0.0001pt; text-align: justify;">The resulting set of panoramic images can be used for stereo   analysis or stereo viewing.</p>
<p style="text-align: justify;">
</td>
</tr>
</tbody>
</table>
</div>
<p class="MsoNormal" style="text-align: justify;"><span style="display: none;"> </span></p>
<div>
<table class="MsoNormalTable" style="width: 6.25in;" border="0" cellpadding="0" width="600">
<tbody>
<tr>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1025" src="../../newpubs/multiDPC/summary_files/image001.jpg" border="0" alt="" width="153" height="250" /></p>
</td>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1026" src="../../newpubs/multiDPC/summary_files/image002.jpg" border="0" alt="" width="158" height="250" /></p>
</td>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1027" src="../../newpubs/multiDPC/summary_files/image003.jpg" border="0" alt="" width="153" height="250" /></p>
</td>
</tr>
<tr>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;">(a)</p>
</td>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;">(b)</p>
</td>
<td style="padding: 0.75pt;">
<p class="MsoNormal" style="text-align: justify;">(c)</p>
</td>
</tr>
<tr>
<td style="padding: 0.75pt;" colspan="3">
<p class="MsoNormal" style="text-align: justify;">Figure   1. Variation in the physical camera position with viewpoint position. (a)   Viewpoint is centered within four-sided pyramid, shown with the corresponding   eight camera positions. (b) Translated viewpoints marked A, B, and C are   shown with correspondingly marked physical camera positions. (c) Same as (b),   but for a mirror pyramid having a large number of faces, to show how the   shape changes as the viewpoint translates.</p>
<p style="text-align: justify;">
</td>
</tr>
</tbody>
</table>
</div>
<p class="MsoNormal" style="text-align: justify;"><span style="display: none;"> </span></p>
<div>
<table class="MsoNormalTable" style="width: 6.25in;" border="0" cellspacing="0" cellpadding="0" width="600">
<tbody>
<tr>
<td style="padding: 0in;">
<h2 style="margin-left: 0in; text-align: justify;"><span style="color: #000000;"><a name="Experiments"></a><span style="font-size: 13.5pt;">Prototype</span></span></h2>
</td>
<td style="padding: 0in;">
<p style="text-align: justify;">
</td>
</tr>
<tr>
<td style="padding: 0in;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1028" src="../../newpubs/multiDPC/summary_files/image004.jpg" border="0" alt="" width="250" height="155" /></p>
</td>
<td style="padding: 0in;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1029" src="../../newpubs/multiDPC/summary_files/image005.jpg" border="0" alt="" width="250" height="290" /></p>
</td>
</tr>
<tr>
<td style="padding: 0in;">
<p class="MsoNormal" style="text-align: justify;">(a)</p>
</td>
<td style="padding: 0in;">
<p class="MsoNormal" style="text-align: justify;">(b)</p>
</td>
</tr>
<tr>
<td style="padding: 0in;" colspan="2">
<p style="text-align: justify;">Figure 2. The   experimental two-view panoramic camera shown with only four sensors. (a) A   schematic showing double mirror pyramid with the four physical cameras   associated with two faces, two per face each corresponding to one of the two   viewpoints. (b) The physical implementation with four sensors (conventional   cameras) whose reflections can be seen inside the two associated mirror   faces.</p>
<p style="text-align: justify;">
</td>
</tr>
</tbody>
</table>
</div>
<p class="MsoNormal" style="text-align: justify;"><span style="display: none;"> </span></p>
<div>
<table class="MsoNormalTable" style="width: 480pt;" border="0" cellpadding="0" width="640">
<tbody>
<tr>
<td style="padding: 0in;">
<h2 style="margin-left: 0in; text-align: justify;"><a name="Results"></a><span style="color: #000000;"><span style="font-size: 13.5pt;">Results</span></span></h2>
</td>
<td style="padding: 0in;">
<p class="MsoNormal" style="text-align: justify;"><span style="color: black;"> </span></p>
</td>
<td style="padding: 0in;">
<p style="text-align: justify;">
</td>
</tr>
<tr style="height: 114pt;">
<td style="padding: 0in; height: 114pt;" valign="bottom">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1030" src="../../newpubs/multiDPC/summary_files/image006.jpg" border="0" alt="" width="200" height="150" /></p>
</td>
<td style="padding: 0in; height: 114pt;" valign="bottom">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1031" src="../../newpubs/multiDPC/summary_files/image007.jpg" border="0" alt="" width="200" height="150" /></p>
</td>
<td style="padding: 0in; height: 114pt;" rowspan="2" valign="bottom">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1032" src="../../newpubs/multiDPC/summary_files/image008.jpg" border="0" alt="" width="259" height="300" /></p>
</td>
</tr>
<tr style="height: 114pt;">
<td style="padding: 0in; height: 114pt;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1033" src="../../newpubs/multiDPC/summary_files/image009.jpg" border="0" alt="" width="200" height="150" /></p>
</td>
<td style="padding: 0in; height: 114pt;">
<p class="MsoNormal" style="text-align: justify;"><img id="_x0000_i1034" src="../../newpubs/multiDPC/summary_files/image010.jpg" border="0" alt="" width="200" height="150" /></p>
</td>
</tr>
<tr style="height: 0.25in;">
<td style="padding: 0in; height: 0.25in;" colspan="2">
<p class="MsoNormal" style="text-align: justify;">(a)</p>
</td>
<td style="padding: 0in; height: 0.25in;" valign="bottom">
<p class="MsoNormal" style="text-align: justify;">(b)</p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding: 0in; height: 13.5pt;" colspan="3">
<p style="text-align: justify;">Figure 3. (a) The   four images captured by the four sensors (conventional cameras). (b) The   mosaic of four images. These mosaic will  be 360-degree wide when all   physical cameras are present instead of just the four used here.</p>
</td>
</tr>
</tbody>
</table>
</div>
<p>Kar-Han Tan, Hong Hua, and Narendra Ahuja. Multiview Mirror Pyramid Cameras.<br />
IEEE Transactions in Pattern Analysis and Machine Intelligence. To appear.<br />
Download Manuscript: [<a href="../../%7Etankh/pam_tpami_techreport.pdf">PDF</a>]<br />
PowerPoint: [<a href="../../%7Etankh/Camera/karhan_tan_camera.zip">ZIP</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=305</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>High-Resolution Double Pyramid Panoramic Cameras</title>
		<link>http://vision.ai.uiuc.edu/?p=303</link>
		<comments>http://vision.ai.uiuc.edu/?p=303#comments</comments>
		<pubDate>Thu, 07 May 2009 05:41:54 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[pyramid cameras]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=303</guid>
		<description><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/pyramid.jpg" border="2" alt="Image not available" width="95" height="103" /></span></p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.25in;">High-resolution panoramic capture is highly desirable in many applications such as immersive virtual environments, tele-conferencing, surveillance, and robot navigation. In addition, a single viewpoint for all viewing directions, a large depth-of-field (omni-focus), and real-time acquisition are desired in some imaging&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/pyramid.jpg" border="2" alt="Image not available" width="95" height="103" /></span></p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.25in;">High-resolution panoramic capture is highly desirable in many applications such as immersive virtual environments, tele-conferencing, surveillance, and robot navigation. In addition, a single viewpoint for all viewing directions, a large depth-of-field (omni-focus), and real-time acquisition are desired in some imaging applications (e.g. 3D reconstruction and rendering). The FOV of a conventional camera is limited by the size of its sensor and the focal length of its lens. For example, a typical 16mm lens with 2/3&#8243; CCD sensor has a <img src="../../newpubs/singleDPC/summary_files/image002.gif" alt="" width="53" height="20" />FOV. The number of pixels on the sensor (640 x 480 for NTSC camera) determines the resolution. The depth-of-field is limited and is determined by various imaging parameters such as aperture, focal length, and the scene location of the object.<!--[if gte vml 1]><v:shapetype id="_x0000_t75"  coordsize="21600,21600" o:spt="75" o:preferrelative="t" path="m@4@5l@4@11@9@11@9@5xe"  filled="f" stroked="f"> <v:stroke joinstyle="miter" /> <v:formulas> <v:f eqn="if lineDrawn pixelLineWidth 0" /> <v:f eqn="sum @0 1 0" /> <v:f eqn="sum 0 0 @1" /> <v:f eqn="prod @2 1 2" /> <v:f eqn="prod @3 21600 pixelWidth" /> <v:f eqn="prod @3 21600 pixelHeight" /> <v:f eqn="sum @0 0 1" /> <v:f eqn="prod @6 1 2" /> <v:f eqn="prod @7 21600 pixelWidth" /> <v:f eqn="sum @8 21600 0" /> <v:f eqn="prod @7 21600 pixelHeight" /> <v:f eqn="sum @10 21600 0" /> </v:formulas> <v:path o:extrusionok="f" gradientshapeok="t" o:connecttype="rect" /> <o:lock v:ext="edit" aspectratio="t" /> </v:shapetype><v:shape id="_x0000_i1025" type="#_x0000_t75" style='width:39.75pt;  height:15pt' o:ole="" fillcolor="window"> <v:imagedata src="summary_files/image053.wmz" mce_src="summary_files/image053.wmz" o:title="" /> </v:shape><![endif]--><!--[if !vml]--><!--[endif]--><!--[if gte mso 9]><xml> <o:OLEObject Type="Embed" ProgID="Equation.3" ShapeID="_x0000_i1025"   DrawAspect="Content" ObjectID="_1179608497"> </o:OLEObject> </xml><![endif]--><!--[if gte vml 1]><v:shape  id="_x0000_i1026" type="#_x0000_t75" style='width:42pt;height:12pt' o:ole=""  fillcolor="window"> <v:imagedata src="summary_files/image054.wmz" mce_src="summary_files/image054.wmz" o:title="" /> </v:shape><![endif]--><!--[if !vml]--><!--[endif]--><!--[if gte mso 9]><xml> <o:OLEObject Type="Embed" ProgID="Equation.3" ShapeID="_x0000_i1026"   DrawAspect="Content" ObjectID="_1179608498"> </o:OLEObject> </xml><![endif]--></p>
<p style="text-align: justify; text-indent: 0.25in;">Many approaches have been presented to achieve various subsets of these properties: wide FOV, high resolution, large depth-of-field, a single viewpoint, and real-time acquisition. Among these, mirror-pyramid (MP)-based camera systems offer a promising approach to capturing high-resolution, <span id="more-303"></span>wide-FOV panoramas as they provide single-viewpoint images at video rate. Such systems use planar mirrors assembled in pyramid or prism shapes, and as many cameras as the number of mirror faces, each located and oriented to capture the part of the scene reflected off one of the flat mirror faces. Images from the individual cameras are concatenated to yield a 360-degree wide panoramic image. Compared to designs using parabolic or hyperbolic mirrors, flat mirrors are easier to design and produce, and they introduce minimal optical aberrations.</p>
<p style="text-align: justify; text-indent: 0.25in;">We have developed a double-mirror-pyramid design that doubles the size of the visual field of the single-pyramid based systems. With this prototype, we have developed methods for optimally choosing the parameters of MP-based camera systems, e.g., camera placement, pyramid geometry, sensor usage, and uniformity of image resolution, and how the resultant image quality can be evaluated.</p>
<p style="text-align: justify; text-indent: 0.25in;">
<p>2. Overview of panoramic imaging</p>
<p style="text-indent: 0.25in;">The existing methods of capturing panoramas fall into one of the two categories: dioptric methods, where only refractive elements (lenses) are employed, and catadioptric methods, where a combination of reflective and refractive components is used. Typical dioptric systems include: the camera cluster method where multiple cameras point in different directions to cover a wide FOV; the fisheye method where a single camera acquires a wide FOV image through a fisheye lens; and the rotating camera method where a conventional camera pans to generate mosaics, or a camera with a non-frontal, tilted sensor pans around its viewpoint to acquire panoramic omni-focused images. The catadioptric methods include: sensors in which a single camera captures the scene as reflected off a single curved mirror, or sensors in which multiple cameras image the scene as reflected off the planar mirror surfaces.</p>
<p style="text-indent: 0.25in;">The dioptric camera clusters are capable of capturing high-resolution panoramas at video rate. However, the cameras in these clustersdue to physical constraints, which makes it difficult or even impossible to mosaic individual images to form a true panoramic view, while apparent continuity across images may be achieved by ad hoc image blending. The sensors with fisheye lens are able to deliver large FOV images at video rate, but suffer from low resolution, irreversible distortion for close-by objects, and non-unique viewpoints for different portions of the FOV.� The rotating cameras deliver high-resolution wide FOV via panning, as well as omni-focus when used in conjunction with non-frontal imaging, but they have limited vertical FOV. Furthermore, because they sequentially capture different parts of the FOV, moving objects may be imaged incorrectly. typically do not share a unique viewpoint</p>
<p style="text-indent: 0.25in;">The catadioptric sensors that use a parabolic- or a hyperbolic-mirror to map an omni-directional view onto a single sensor are able to achieve a single viewpoint at video rate, but the resolution of the acquired image is limited to that of the sensor used and varies significantly with the viewing direction across the visual fields. Analogous to the dioptric case, this resolution problem can be alleviated partially by replacing the simultaneous imaging of the entire FOV with panning and sequential imaging of its parts, followed by mosaicing the images, at the expense of video rate. Another category of the catadioptric sensors employs a number of planar mirrors assembled in the shape of right mirror-pyramids, together with as many cameras as the number of pyramid faces. Each of these cameras, capturing the part of the scene reflected off one of the faces, is located and oriented strategically such that the mirror images of their viewpoints are co-located at a single point inside the pyramid. Effectively, this creates a virtual camera that captures wide-FOV, high-resolution panorama at video rate.</p>
<p style="text-align: justify;">Proposed Double-Mirror-Pyramid Camera</p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.25in;">The main challenge in constructing a panoramic camera from multiple sensors is to co-locate the entrance pupils of the multiple cameras so that adjacent cameras cover contiguous FOV without obstructing the view of other cameras or their own. Nalwa first used a right mirror pyramid (MP) formed from planar mirrors for this purpose. He reported an implementation using a 4-sided right pyramid and 4 cameras. The pyramid stands on its horizontal base. Each triangular face forms a 45-degree angle with the base. The cameras are positioned in the horizontal plane that contains the pyramid&#8217;s vertex such that the entrance pupil of each camera is equidistant from the vertex and the mirror images of the entrance pupils coincide at a common point, C, on the axis of the pyramid. The cameras are pointed vertically downward at the pyramid faces such that the virtual optical axes of the cameras are all contained in a plane parallel to the pyramid base, effectively viewing the world horizontally outward from the common virtual viewpoint <img src="../../newpubs/singleDPC/summary_files/image006.gif" alt="" width="15" height="16" />.<!--[if gte vml 1]><v:shape id="_x0000_i1027"  type="#_x0000_t75" style='width:11.25pt;height:12pt' o:ole="" fillcolor="window"> <v:imagedata src="summary_files/image055.wmz" mce_src="summary_files/image055.wmz" o:title="" /> </v:shape><![endif]--><!--[if !vml]--><!--[endif]--><!--[if gte mso 9]><xml> <o:OLEObject Type="Embed" ProgID="Equation.3" ShapeID="_x0000_i1027"   DrawAspect="Content" ObjectID="_1179608499"> </o:OLEObject> </xml><![endif]--></p>
<p style="text-align: justify; text-indent: 0.25in;">The vertical dimension of the panoramic FOV in each of the aforementioned cases is the same as that of each of the cameras used � only their horizontal FOVs are concatenated to obtain a wider, panoramic view. We have developed a panoramic design that uses a dual mirror-pyramid (DMP), formed by joining two mirror-pyramids such that their bases coincide (Fig. 2), together with two layers of camera clusters.� Such a DMP-based design thus doubles the vertical FOV while preserving the ability to acquire panoramic high-resolution images from an apparent single viewpoint at video rate.</p>
<p><a name="OLE_LINK2"></a></p>
<li>Hong Hua, Narendra Ahuja, A High-Resolution Panoramic Camera , Computer Vision and Pattern Recognition ( CVPR&#8217;01) &#8211; Volume 1, pp. 960 ~ 967 <a href="../../publications/PAM_cvpr01_hua.pdf">Full Text<br />
</a></li>
<li>K.-H. Tan, H. Hua and N. Ahuja, Multiview Panoramic Cameras Using Mirror Pyramids, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 26, No. 6, June 2004. <a href="../../%7Etankh/pam_tpami_techreport.pdf">Full Text</a></li>
<li>K.-H. Tan, H. Hua and N. Ahuja, Multiview Mirror Pyramid-based Panoramic Cameras, Proceedings of the IEEE Workshop on Omnidirectional Vision (Omnivis) , June 2002, Copenhagen, Denmark, 87-93.  <a href="http://vision.ai.uiuc.edu/~tankh/Camera/multiview_pam.pdf">Full Text</a></li>
<input id="gwProxy" type="hidden" />
<p><!--Session data--><br />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=303</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Split Aperture Imaging</title>
		<link>http://vision.ai.uiuc.edu/?p=299</link>
		<comments>http://vision.ai.uiuc.edu/?p=299#comments</comments>
		<pubDate>Thu, 07 May 2009 05:37:51 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[Split Aperture Imaging]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=299</guid>
		<description><![CDATA[<p class="MsoBodyTextIndent3" style="text-align: justify;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_3_1_icon.jpg" border="2" alt="" width="135" height="135" /></p>
<p>Standard imaging sensors have limited dynamic range and hence are sensitive to only a part of the illumination range present in a natural scene. The dynamic range can be improved by acquiring multiple images of the same scene under different&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p class="MsoBodyTextIndent3" style="text-align: justify;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_3_1_icon.jpg" border="2" alt="" width="135" height="135" /></p>
<p>Standard imaging sensors have limited dynamic range and hence are sensitive to only a part of the illumination range present in a natural scene. The dynamic range can be improved by acquiring multiple images of the same scene under different exposure settings and then combining them. We have developed a multi-sensor camera design, called Split-Aperture Camera, to acquire registered, multiple images of a scene, at different exposure, from a single viewpoint, and at video-rate. The resulting multiple exposure images are then used to construct a high dynamic range image.</p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.5in;">There are three main steps to composing the high dynamic range image. First, we transform the recorded intensities by each sensor into the actual sensor irradiance values. This mapping can be obtained using radiometric calibration techniques applicable to normal cameras. Second, since the irradiance at corresponding points on different sensors can be different, we need a correction factor to represent a scene point by a unique value independent of the sensor where it gets imaged. This factor is spatially variant and it is different for different sensors. The third and the last step is fusing the intensity transformed images into a single high dynamic range mosaic. For every pixel on a canvas (an empty image of same dimensions as any of the sensors), we have a set of transformed intensity values one from each of the images. We discard the values from images in which those locations were either saturated or clipped. Since, the values not discarded may be noisy, we combine them to obtain the final value.<span id="more-299"></span></p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.5in;">To build a prototype, we used a pyramid beam-splitter which is the corner of a mirror cube (3-face pyramid) and three sensors used were Sony monochrome board cameras CCB-ME37. The glass cube corners are commercially available and marketed as solid retroreflectors. The triangular surfaces were coated with a metallic coating such as aluminum to obtain the three desired reflective surfaces. We designed a special lens whose aperture is located just behind the lens, and aligned the pyramid with the optical axis with its tip at the center of the aperture. The position of the sensors were carefully calibrated to ensure that all the sensors were normal to the split optical axes, equidistant from the tip of the pyramid and images from all sensors overlaid exactly on top of each other. This arrangement ensures that distribution of light across the three sensors is independent of the 3D coordinates of the objects being imaged. We used thin-film neutral density filters with transmittances 1, 0.5 and 0.25 in front of the sensors to obtain images capturing different parts of the illumination range. The frame grabber used was Matrox multichannel board capable of synchronizing and capturing three channels simultaneously.</p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.5in;">Figure 1 below shows the prototype built. Figures 2 and 3 show samples of images acquired by the prototype.</p>
<p class="MsoNormal" style="text-align: justify; text-indent: 0.5in;">
<p class="MsoNormal" style="text-align: justify;">
<p class="MsoNormal" style="text-align: center;" align="center"><img id="_x0000_i1025" src="../../newpubs/splitaperture/summary_files/image001.jpg" alt="" width="226" height="205" /></p>
<p class="MsoCaption" style="text-align: center;" align="center">Figure 1: Photograph of a prototype of the high dynamic range Split Aperture Camera.</p>
<p class="MsoNormal" style="text-align: justify;">
<p class="MsoNormal" style="text-align: center; text-indent: 0.5in;" align="center"><img id="_x0000_i1026" src="../../newpubs/splitaperture/summary_files/image002.jpg" alt="" width="254" height="189" /> <img id="_x0000_i1027" src="../../newpubs/splitaperture/summary_files/image003.jpg" alt="" width="256" height="190" /></p>
<p class="MsoNormal" style="text-align: center; text-indent: 0.5in;" align="center">(a)                                                                     (b)</p>
<p class="MsoNormal" style="text-align: center; text-indent: 0.5in;" align="center"><img id="_x0000_i1028" src="../../newpubs/splitaperture/summary_files/image004.jpg" alt="" width="254" height="189" /> <img id="_x0000_i1029" src="../../newpubs/splitaperture/summary_files/image005.jpg" alt="" width="252" height="188" /></p>
<p class="MsoNormal" style="text-align: center; text-indent: 0.5in;" align="center">(c)                                                                    (d)</p>
<p class="MsoCaption" style="text-align: justify;">Figure 2: (a)-(c) The three images of a parking lot obtained by the high dynamic range Split Aperture Camera, employing three 8-bit sensors. Using the three filters yields three images, having brightness values in ratios 1:2:4. (d) The resulting high dynamic range image; the intensity range in this image has been compressed to 0-255 using nonlinear mapping for display purposes here.</p>
<p class="MsoNormal" style="text-align: justify;">
<p class="MsoNormal" style="text-align: center;" align="center"><img id="_x0000_i1030" src="../../newpubs/splitaperture/summary_files/image006.gif" alt="" width="156" height="116" /> <img id="_x0000_i1031" src="../../newpubs/splitaperture/summary_files/image007.gif" alt="" width="156" height="116" /> <img id="_x0000_i1032" src="../../newpubs/splitaperture/summary_files/image008.gif" alt="" width="156" height="116" /></p>
<p class="MsoNormal" style="text-align: justify;">
<p class="MsoNormal" style="text-align: center;" align="center"><img id="_x0000_i1033" src="../../newpubs/splitaperture/summary_files/image009.gif" alt="" width="156" height="116" /> <img id="_x0000_i1034" src="../../newpubs/splitaperture/summary_files/image010.gif" alt="" width="156" height="116" /></p>
<p class="MsoCaption" style="text-align: justify;">Figure 3: A sequence of frames in a high dynamic range video sequence of a parking lot, generated by fusing the three multi-exposure input sequences at video rate.</p>
<p><!--more--></p>
<p align="justify">M. Aggarwal and N. Ahuja, Split Aperture Imaging for       High Dynamic Range, Proc. International Conference on Computer Vision,       Vancouver, Canada, July 2001, 10-17. <a href="../../manoj/multi_aggarwal.pdf">Full Text<br />
</a></p>
<p>M. Aggarwal and N. Ahuja, Split Aperture Imaging for High Dynamic Range, International Journal on Computer Vision, Vol. 58, No. 1, June 2004, 7-17.  <a href="http://vision.ai.uiuc.edu/manoj/multi_aggarwal.pdf">Full Text</a></p>
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=299</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Omnifocus Imaging Using Graph Cuts</title>
		<link>http://vision.ai.uiuc.edu/?p=291</link>
		<comments>http://vision.ai.uiuc.edu/?p=291#comments</comments>
		<pubDate>Thu, 07 May 2009 05:26:25 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[NICAM]]></category>
		<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[new imaging model]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=291</guid>
		<description><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/seamless.jpg" border="2" alt="Image not available" width="112" height="81" /></span></p>
<p>We discuss how to generate omnifocus images from a sequence of different focal setting images. We first show that the existing focus measures would encounter difficulty when detecting which frame is most focused for pixels in the regions between intensity&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 2px solid black;" src="../../project_new/seamless.jpg" border="2" alt="Image not available" width="112" height="81" /></span></p>
<p>We discuss how to generate omnifocus images from a sequence of different focal setting images. We first show that the existing focus measures would encounter difficulty when detecting which frame is most focused for pixels in the regions between intensity edges and uniform areas. Then we propose a new focus measure that could be used to handle this problem. In addition, after computing focus measures for every pixel in all images, we construct a three dimensional (3D) node-capacitated graph and apply a graph cut based optimization method to estimate a spatio-focus surface that minimizes the summation of the new focus measure values on this surface. An omnifocus image can be directly generated from this minimal spatio-focus surface. Experimental results with simulated and real scenes are provided.</p>
<p><span id="more-291"></span>Ning Xu and Narendra Ahuja. Generating Omnifocus Images Using Graph Cuts and a New Focus Measure. Pattern Recognition, 17th International Conference on (ICPR&#8217;04) Volume 4. pp 697-700, 08, 2004 Cambridge UK. <a href="http://ieeexplore.ieee.org/xpl/abs_free.jsp?arNumber=1333868">Full Text</a></p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=291</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Panoramic Imaging with Infinite Dynamic Range</title>
		<link>http://vision.ai.uiuc.edu/?p=289</link>
		<comments>http://vision.ai.uiuc.edu/?p=289#comments</comments>
		<pubDate>Thu, 07 May 2009 05:24:37 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[NICAM]]></category>
		<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=289</guid>
		<description><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 0pt none;" src="../../icons2/proj2_2_1_b_icon.jpg" border="0" alt="" width="159" height="49" /></span></p>
<p><span style="font-family: Book Antiqua,Times New Roman,Times;"> </span> <span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 0pt none;" src="../../icons2/proj2_2_1_a_icon.jpg" border="0" alt="" width="160" height="48" /></span></p>
<p style="text-align: justify;">Most imaging sensors have a limited dynamic range and hence can satisfactorily respond to only a part of illumination levels present in a scene. This is particularly disadvantageous for omnidirectional and panoramic cameras since larger fields of view have&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p align="center"><span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 0pt none;" src="../../icons2/proj2_2_1_b_icon.jpg" border="0" alt="" width="159" height="49" /><!--mstheme--></span></p>
<p><!--mstheme--><span style="font-family: Book Antiqua,Times New Roman,Times;"> </span> <span style="font-family: Book Antiqua,Times New Roman,Times;"><img class="alignleft" style="border: 0pt none;" src="../../icons2/proj2_2_1_a_icon.jpg" border="0" alt="" width="160" height="48" /></span></p>
<p style="text-align: justify;">Most imaging sensors have a limited dynamic range and hence can satisfactorily respond to only a part of illumination levels present in a scene. This is particularly disadvantageous for omnidirectional and panoramic cameras since larger fields of view have larger brightness ranges. We propose a simple modification to existing high resolution omnidirectional/panoramic cameras in which the process of increasing the dynamic range is coupled with the process of increasing the field of view. This is achieved by placing a graded transparency(mask) in front of the sensor which allows every scene point to be imaged under multiple exposure settings as the camera pans, a process anyway required to capture large fields of view at high resolution. The sequence of images are then mosaiced to construct a high resolution,high dynamic range panoramic/omnidirectional image.Our method is robust to alignment errors between the mask and the sensor grid and does not require the mask to be placed on the sensing surface. We have designed a panoramic camera with the proposed modifications and have discussed various theoretical and practical issues encountered in obtaining a robust design. We show with an example of high resolution, high dynamic range panoramic image obtained from the camera we designed.</p>
<p><span id="more-289"></span></p>
<p align="justify">M. Aggarwal and N. Ahuja, High Dynamic Range Panoramic Imaging, Proc. International Conference on Computer Vision, Vancouver, Canada, July 2001, 2-9. <a href="../../manoj/dynamic_aggarwal.pdf">Full Text<br />
</a></p>
<input id="gwProxy" type="hidden"><!--Session data--></input>
<input id="jsProxy" onclick="jsCall();" type="hidden" />
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=289</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Near Omnifocused Imaging of Scenes with limited motion</title>
		<link>http://vision.ai.uiuc.edu/?p=278</link>
		<comments>http://vision.ai.uiuc.edu/?p=278#comments</comments>
		<pubDate>Thu, 07 May 2009 05:15:12 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[NICAM]]></category>
		<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=278</guid>
		<description><![CDATA[<p><span style="font-size: 11pt;"><span> </span><img class="alignleft" style="border: 0pt none;" src="../../newpubs/omnifocus_motion_files/image002.gif" border="0" alt="" width="101" height="62" /></span>Nicam achieves omnifocus   panoramic imaging only for static scenes since the camera pans across the   visual field. As panning takes time, successive images of the same moving   object are from different effective viewpoints. Fusion of these images for   omnifocus leads&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p><span style="font-size: 11pt;"><!--[endif]--><span> </span><!--[if gte vml 1]><v:shape id="_x0000_i1028"    type="#_x0000_t75" style='width:203.25pt;height:126pt' fillcolor="window"> <v:imagedata src="omnifocus_motion_files/image002.gif" mce_src="omnifocus_motion_files/image002.gif" o:title="manoj-focus" /> </v:shape><![endif]--><!--[if !vml]--><img class="alignleft" style="border: 0pt none;" src="../../newpubs/omnifocus_motion_files/image002.gif" border="0" alt="" width="101" height="62" /></span>Nicam achieves omnifocus   panoramic imaging only for static scenes since the camera pans across the   visual field. As panning takes time, successive images of the same moving   object are from different effective viewpoints. Fusion of these images for   omnifocus leads to registration problems. A straightforward extension to   image dynamic scenes in omnifocus would require elimination of panning. We   have developed an intermediate solution, which retains panning but yields   objects imaged in less than perfect focus.</p>
<p>Recall that the mosaicing the   set of images taken by Nicam as it pans requires correspondence of a scene   point across the set. The presence of moving objects upsets the   correspondence between images in the sequence, resulting in a distorted   appearance of the moving objects in the final mosaic (see part (a) of the   figure). We avoid these artifacts and create large depth of field mosaics of   scenes with moving objects. The basic idea is to combine the sequence of   images in a manner such<span id="more-278"></span> that each entire moving object is selected from a   single best focused frame and is pasted, as it is, in the final mosaic, and   other regions are chosen and pasted similarly from those images where they   are best focused and not occluded by the moving objects. This approach   ensures that all regions, except possibly those related to moving objects,   are best focused and no artifacts due to moving objects are present.</p>
<p>The necessary algorithms require segmenting the moving   objects in each image, selecting where is it overall in best focus, and for   each still scene point determining in which frame it is best focused. The   algorithms use a set of tests to perform the segmentation of moving objects.   The tests that we use are point intensity difference test, point correlation   test, and focus trajectory test. The point difference and point correlation   tests measure how image intensity and local texture vary over the sequence of   images (the correspondence between images is known for still scene points).   If the variation is small between adjacent frames the point is more likely to   be static. The focus trajectory test checks for variation in the focus   measure across the image sequence against expected behavior. For instance,   for a still scene point variation in focus measure is expected to be   unimodal, while for a moving scene point it will be random, since the   correspondences between images for such points are incorrect. One result thus   obtained is shown in the figure below.</p>
<p style="margin: 5.65pt 1.5pt 5pt 5.65pt; text-align: justify;">
<p style="margin: 5.65pt 1.5pt 5pt 5.65pt; text-align: center;" align="center"><!--[if gte vml 1]><v:shapetype id="_x0000_t75"    coordsize="21600,21600" o:spt="75" o:preferrelative="t" path="m@4@5l@4@11@9@11@9@5xe"    filled="f" stroked="f"> <v:stroke joinstyle="miter" /> <v:formulas> <v:f eqn="if lineDrawn pixelLineWidth 0" /> <v:f eqn="sum @0 1 0" /> <v:f eqn="sum 0 0 @1" /> <v:f eqn="prod @2 1 2" /> <v:f eqn="prod @3 21600 pixelWidth" /> <v:f eqn="prod @3 21600 pixelHeight" /> <v:f eqn="sum @0 0 1" /> <v:f eqn="prod @6 1 2" /> <v:f eqn="prod @7 21600 pixelWidth" /> <v:f eqn="sum @8 21600 0" /> <v:f eqn="prod @7 21600 pixelHeight" /> <v:f eqn="sum @10 21600 0" /> </v:formulas> <v:path o:extrusionok="f" gradientshapeok="t" o:connecttype="rect" /> <o:lock v:ext="edit" aspectratio="t" /> </v:shapetype><v:shape id="_x0000_i1027" type="#_x0000_t75" style='width:203.25pt;    height:126pt' fillcolor="window"> <v:imagedata src="omnifocus_motion_files/image001.gif" mce_src="omnifocus_motion_files/image001.gif" o:title="arun-focus" /> </v:shape><![endif]--><!--[if !vml]--><img src="../../newpubs/omnifocus_motion_files/image001.gif" border="0" alt="" width="271" height="168" /><!--[endif]--> <!--[if gte vml 1]><v:shape id="_x0000_i1028"    type="#_x0000_t75" style='width:203.25pt;height:126pt' fillcolor="window"> <v:imagedata src="omnifocus_motion_files/image002.gif" mce_src="omnifocus_motion_files/image002.gif" o:title="manoj-focus" /> </v:shape><![endif]--><!--[if !vml]--><img src="../../newpubs/omnifocus_motion_files/image002.gif" border="0" alt="" width="271" height="168" /><!--[endif]--></p>
<p style="margin: 5.65pt 1.5pt 5pt 310.5pt; text-indent: -217.5pt;"><!--[if !supportLists]--></p>
<pre>                                         (a)                                        (b)</pre>
<p style="margin: 5.65pt 1.5pt 0.0001pt 5.65pt; text-align: justify;">An omnifocus image   acquired in the presence of moving objects: (a) Nicam approach. (b) New   intermediate approach.</p>
]]></content:encoded>
			<wfw:commentRss>http://vision.ai.uiuc.edu/?feed=rss2&amp;p=278</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Omnifocus Nonfrontal Imaging Camera</title>
		<link>http://vision.ai.uiuc.edu/?p=268</link>
		<comments>http://vision.ai.uiuc.edu/?p=268#comments</comments>
		<pubDate>Thu, 07 May 2009 05:01:03 +0000</pubDate>
		<dc:creator>sanketh</dc:creator>
				<category><![CDATA[NICAM]]></category>
		<category><![CDATA[Next Generation Cameras]]></category>
		<category><![CDATA[Projects]]></category>

		<guid isPermaLink="false">http://vision.ai.uiuc.edu/wordpress/?p=268</guid>
		<description><![CDATA[<p style="text-align: justify;">
</p><p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_1_1_icon.jpg" border="2" alt="" width="160" height="153" /></p>
<p>The concept of omnifocus nonfrontal imaging camera, OMNICAM or NICAM, initiated a new chapter in imaging and digital cameras. NICAM has introduced hitherto nonexistent imaging capabilities, in addition to overcoming some problems with previous methods. NICAM is capable of acquiring&#8230;</p>]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">
<p align="center"><img class="alignleft" style="border: 2px solid black;" src="../../icons2/proj2_1_1_icon.jpg" border="2" alt="" width="160" height="153" /></p>
<p>The concept of omnifocus nonfrontal imaging camera, OMNICAM or NICAM, initiated a new chapter in imaging and digital cameras. NICAM has introduced hitherto nonexistent imaging capabilities, in addition to overcoming some problems with previous methods. NICAM is capable of acquiring seamless panoramic images and range estimates of wide scenes with all objects in focus, regardless of their locations. To understand the impact of NICAM, first consider imaging with conventional <span class="SpellE">cameras.The</span> camera&#8217;s field of view is generally much smaller than the entire visual field of interest. Consequently, the camera must pan across the scene of interest, focus on a part at a time, and acquire an image of each part. All the resulting images together then capture the complete scene. As   byproduct of focusing, the range of the objects in the scene <span class="GramE">can also be estimated</span>. Usual methods for focusing as well as range estimation from focusing mechanically relocate the sensor plane, thereby varying the focus distance setting in the camera. When a scene point appears in sharp focus, the corresponding depth and focus distance values satisfy the lens law. The depth for the scene point can then be calculated length and the focus distance.</p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><span id="more-268"></span><br />
The conventional methods therefore involve two mechanical (and hence slow) actions, those of panning<span class="GramE"> and</span>, for each chosen pan angle, focusing i.e. finding the best focus distance setting. The purpose of the first action is to accumulate data for the entire visual field from the camera&#8217;s narrower fields of view. This action is therefore essential. An innovation of <span class="SpellE">nonfrontal</span> imaging is in the elimination of the second action. The <span class="SpellE">nonfrontal</span> imaging camera has a sensor <span class="GramE">plane which</span> is not perpendicular to the optical axis as is standard. This imaging geometry eliminates the time consuming mechanical translation of the sensor plane. Camera panning, required for panoramic viewing anyway, in addition enables focusing. Further, a range-from-focus estimate for each visible scene point <span class="GramE">is also computed</span> as a by-product of identifying the sharpest image. Thus, from pan motion alone, <span class="SpellE">nonfrontal</span> imaging obtains a composite focused image of all objects/points in a wide scene regardless of their depths, which is in complete registration with a range map obtained in parallel. While it is well known that focus distance control yields both range and focused images, <span class="SpellE">nonfrontal</span> imaging has made it possible for the first time to realize this dual functionality simultaneously for all visible scene points. Further, this functionality is achieved passively, <span class="SpellE">i.e</span>, without any active illumination of the scene, e.g., using laser.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
Thus, <span class="SpellE">nonfrontal</span> imaging has the following novel capabilities: (<span class="SpellE">i</span>) <span class="GramE">It</span> provides panoramic (up to 360<sup>0</sup>) images of a scene without any visible seams. (ii) Each object in an image is in complete focus regardless of its location, i.e., that there is no need to explicitly perform the standard focusing action (accomplished mechanically in &#8220;manual&#8221; cameras and automatically in &#8220;automatic&#8221; cameras, but requiring mechanical movement in each case). (iii) Along with the sharp visual image, the camera also delivers the location (coordinates) of each focusable, visible scene point. One consequence of these capabilities is that a single <span class="SpellE"><span class="GramE">nonfrontal</span></span><span class="GramE"> imaging</span> camera can provide stereo pairs of images for three-dimensional, <span class="SpellE">omnifocused</span>, viewing of the entire scene in natural lighting. In fact, this visual 3D experience is even more informative in some ways than natural, human viewing of real world, since humans have finite depth of field while the NICAM driven display shows all parts in focus.</span></p>
<p class="MsoBodyText" style="margin-bottom: 12pt; text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>UNIQUENESS<br />
</strong><br />
<span class="SpellE">Nonfrontal</span> imaging represents qualitative leaps in what is feasible with the current technology. It<br />
makes it possible to achieve hitherto infeasible functionalities and performance levels in imaging. None of the available techniques can deliver seamless focused panoramic images. Most methods choose a scene point/object and bring it into focus <span class="GramE">by (manually or automatically) controlling</span> sensor location. Thus, they can focus at objects one by one. Similarly, they can estimate range from focus one object/point at a time. The following paragraphs describe the differences between NICAM and the various related existing technologies.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><strong><span style="font-size: 10pt; font-family: Arial;">Panoramic Images:</span></strong><span style="font-size: 10pt; font-family: Arial;"> In conventional photography, generating panoramic images has been more of an art, pursued by artists who take independent photographs and create a mosaic from them. There are some fundamental problems with this, however. Whenever <span class="GramE">an image is taken by the camera</span>, a choice of focus distance must be made. Usually, this <span class="GramE">is done</span> by imaging in focus that object which appears at the image center. This means that any objects at other distances in the camera&#8217;s field will not be in focus. In particular, the image borders may have different amounts of blur along them. Since in general different <span class="GramE">focus</span> settings are used to obtain photographs of contiguous scene parts, when they are <span class="SpellE">mosaiced</span> to form a panoramic image the discontinuities in the image sharpness of the scene parts straddling the borders give rise to seams. Thus in the panoramic image neither all objects are imaged in focus nor is the mosaic seamless. One may attempt to alleviate this problem by reducing the size of the camera&#8217;s visual field, but this does not eliminate the problem because the objects in the scene are not of the same shape as the camera&#8217;s visual field (e.g., rectangular) so they always straddle across image borders. Of course, the smaller the visual field size the larger is the number of images in the mosaic, which increases the seam density. In fact, a major issue in the construction of panoramic images has been how to process the mosaic to camouflage seams in order to avoid perceptual detection.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>Panoramic Cameras: </strong>A number of panoramic cameras <span class="GramE">have been designed</span> over the years for photographic applications. The scene scan <span class="GramE">is performed</span> by moving the camera mechanically, or pointing it at a special reflector surface such as a conical mirror. To image certain scene points in sharp focus, either the<span class="GramE"> points</span> have to be at a specified depth from the camera, or the depth of field of the camera must be made sufficiently large by a combination of reducing the aperture and increasing the focal length. These solutions are not acceptable since they require that the<span class="GramE"> ambient</span> light intensity levels be high or that the scene objects of interest  all lie in a narrow depth range.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
</span><strong><span style="font-size: 10pt; font-family: Arial;">Panoramic</span></strong><strong><span style="font-size: 10pt; font-family: Arial;"> </span></strong><strong><span style="font-size: 10pt; font-family: Arial;">Range</span></strong><strong><span style="font-size: 10pt; font-family: Arial;"> Acquisition: </span></strong><span style="font-size: 10pt; font-family: Arial;">Analogous to panoramic image acquisition, panoramic range acquisition methods are not very common. <span class="GramE">Typically</span> a narrow field of view device is pointed in different directions to obtain a panoramic range image. <span class="GramE">Or</span>, multiple devices are arranged in a circle such as in sonar sensor rings. Almost all methods are invasive, i.e., they involve scene illumination, e.g., using a laser beam or structured lighting.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>Patented Technologies:</strong> Most of the relevant </span><span style="font-size: 10pt; font-family: Arial;">U.S.</span><span style="font-size: 10pt; font-family: Arial;"> patents in the last 5 years have been filed by Japanese<span class="GramE"> corporations</span>. However, none of these <span class="GramE">come</span> close to the NICAM methodology. The closest idea to NICAM was in a patent filed for the Asahi <span class="SpellE">Kogaku</span> Kogyo Kabushiki Kaisha of </span><span style="font-size: 10pt; font-family: Arial;">Tokyo</span><span style="font-size: 10pt; font-family: Arial;">,<span class="GramE"> </span></span><span class="GramE"><span style="font-size: 10pt; font-family: Arial;">Japan</span></span><span style="font-size: 10pt; font-family: Arial;">. In this patent, a camera was described that could select one of three tilt angles between the CCD plane and the optical axis. The application suggested was in photographing one frame of a scene with two subjects. The range to the subjects<span class="GramE"> was</span> determined by an unspecified &#8220;range-determining means&#8221;. EMI Limited, </span><span style="font-size: 10pt; font-family: Arial;">England</span><span style="font-size: 10pt; font-family: Arial;"> has a patent on using a tilted plane sensor to determine the focus motor drive signal. Some of the other companies that hold patents in slightly related ideas are Olympus Optical Corporation, </span><span style="font-size: 10pt; font-family: Arial;">Japan</span><span style="font-size: 10pt; font-family: Arial;">, Canon Kabushiki Kaisha and Hitachi Ltd, </span><span style="font-size: 10pt; font-family: Arial;">Tokyo</span><span style="font-size: 10pt; font-family: Arial;">, </span><span style="font-size: 10pt; font-family: Arial;">Japan</span><span style="font-size: 10pt; font-family: Arial;">. A <span class="GramE">camera with a tilted sensor plane and a scanning mirror was also used by a researcher at Jet Propulsion Laboratory to determine the range of scene points</span>.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>SAMPLE IMAGES TO ILLUSTRATE PERFORMANCE</strong></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
Images 1-5 illustrate the performance of NICAM. Typically, a panoramic image <span class="GramE">is divided</span> into multiple rows, each showing the view over smaller than the entire angle covered. If the entire length of the panoramic image <span class="GramE">were printed</span> in a single row, the height will be reduced significantly. To avoid such excessive compression of detail, and to maximize legibility by using all available space, a complete panoramic view is divided into smaller contiguous <span class="SpellE">subangles</span>, and the corresponding <span class="SpellE">subimages</span> are shown in successive rows of the image. Thus, the right end of the row connects to the left end of the following row.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
Images 1, 2 and 3 are examples of <span class="SpellE">omnifocused</span> panoramic images acquired using NICAM. Image 1 shows a 100<sup>0</sup> view of an outdoor scene, shown split into two rows of 50<sup>0</sup> each. The objects in the scene are at a range of distances (flowers 1.5&#8242;, tree 4.5&#8242;, bench 8&#8242;, chair 30&#8242; and building 50&#8242; and larger) but all are imaged in focus and no seams exist across the entire panoramic view. Image <span class="GramE">2</span> shows a 60<sup>0</sup> view of a room inside the Beckman Institute where the distances range from a few feet to about 20 feet. Finally, Image 3 contains a 360<sup>0</sup> panoramic view of the Computer Vision and Robotics Laboratory in the Beckman Institute where object distances of 2 to 30 feet from the camera <span class="GramE">are indicated</span>.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
Image 4 demonstrates the <span class="SpellE">omnifocusing</span> performance of NICAM compared to the limited depth of field of a regular camera. The upper row in Image 4 (<span class="SpellE">i</span>) shows a 40<sup>0</sup> (angle chosen by user) <span class="SpellE">omnifocused</span> image acquired by NICAM and the lower row shows a 20<sup>0</sup> (angle dictated by the camera) view of the same scene acquired using a regular camera focused at 4&#8242; (a choice must be made as to which object to focus on). The progressive loss of focus for objects located closer or farther than the focused depth of 4&#8242; <span class="GramE">can be seen</span>. Image <span class="GramE">4</span> (ii) shows a &#8220;panoramic&#8221; view constructed from multiple images taken by a regular camera, by concatenating images of contiguous parts of the scene. Since the different images <span class="GramE">are taken</span> when the camera is focused at different objects, the borders between images give rise to seams. Further, since the camera <span class="GramE">is focused</span> on a specific object as each image is acquired, not all parts within even a single image are in focus. This <span class="GramE">should be contrasted</span> with the seamless, panoramic imaging capability of NICAM shown in Images 1-3.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
Image 5 (<span class="SpellE">i</span>) <span class="GramE">demonstrates</span> the impact of the range estimation capability of NICAM A pair of <span class="SpellE">contiguos</span> planar patches formed by wrapping newspaper on a step-like structure is placed in front of NICAM. The resulting <span class="SpellE">omnifocused</span> panoramic image <span class="GramE">is shown</span> in the first row. The second row shows the range estimation capability wherein the step structure recovered by NICAM <span class="GramE">is depicted</span>. The range estimate available for each pixel in the <span class="SpellE">omnifocused</span> image determines the position and irradiance of the corresponding scene point. The <span class="SpellE">omnifocused</span> image and the recovered shape <span class="GramE">are combined</span> to produce the 3D <span class="SpellE">omnifocused</span> step structure shown through a perspective view in the bottom row. If we assume that the irradiance due to this scene point is invariant for small perturbations of the viewpoint, then the intensity and range information <span class="GramE">can also be combined</span> to produce <span class="SpellE">pseudostereo</span> images as would be acquired by a pair of hypothetical cameras placed around NICAM. Such stereo images when viewed through a stereo mechanism, e.g. stereo glasses, depict the scene in full 3D, using data obtained by a single NICAM! Image <span class="GramE">5</span> (ii) shows such stereo pairs for three scenes. For ease of viewing, the left and right &#8220;eye&#8221; images have been color coded <span class="GramE">red and green</span> and overlapped; when viewed through the enclosed red and green glasses on the different eyes, the scenes can be seen in 3D and <span class="SpellE">omnifocused</span>.  The top left scene consists of two planes perpendicular to the line of sight, the nearer one at a distance of 2 ft. from the camera (right plane) and the farther one at a distance of 3 ft. (left plane). The top right scene contains a single chess piece at the distance 1 in. The bottom row shows multiple chess pieces placed at three different distances from the camera: 17 in., 20 in. and 25 in. All parts of all scenes are in focus and the 3D structure is visible through stereo viewing. It appears that such 3D depiction using a single visual </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span class="GramE"><span style="font-size: 10pt; font-family: Arial;">camera</span></span><span style="font-size: 10pt; font-family: Arial;"> has never been accomplished before!</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p style="text-align: justify;"><span style="font-size: 12pt; font-family: Arial;"><br style="page-break-before: always;" /> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 1: 100<sup>0</sup> Outdoor <span class="GramE">View</span></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1025" src="../../newpubs/nicam/summary_files/image001.jpg" alt="" width="624" height="178" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1026" src="../../newpubs/nicam/summary_files/image002.jpg" alt="" width="623" height="182" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 2: 60<sup>0</sup> Indoor <span class="GramE">View</span></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1027" src="../../newpubs/nicam/summary_files/image003.jpg" alt="" width="624" height="200" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1028" src="../../newpubs/nicam/summary_files/image004.jpg" alt="" width="624" height="196" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 3: 360<sup>0</sup> Indoor <span class="GramE">View</span></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1029" src="../../newpubs/nicam/summary_files/image005.jpg" alt="" width="624" height="332" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 4(<span class="SpellE">i</span>): 40<sup>0</sup> <span class="SpellE">Omnifocused</span> View</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1030" src="../../newpubs/nicam/summary_files/image006.jpg" alt="" width="624" height="234" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 4(<span class="SpellE">i</span>): 20<sup>0</sup> Standard Camera View of the Same Scene as in 4(<span class="SpellE">i</span>), Focused at 4 ft.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1031" src="../../newpubs/nicam/summary_files/image007.jpg" alt="" width="623" height="411" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 4(ii): Regular Camera Mosaic</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1032" src="../../newpubs/nicam/summary_files/image008.jpg" alt="" width="623" height="561" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Image 5(<span class="SpellE">i</span>): On the following page:</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1033" src="../../newpubs/nicam/summary_files/image009.jpg" alt="" width="408" height="863" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> Image 5(ii)</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><img id="_x0000_i1034" src="../../newpubs/nicam/summary_files/image010.jpg" alt="" width="623" height="468" /></span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p style="text-align: justify;"><span style="font-size: 12pt; font-family: Arial;"><br style="page-break-before: always;" /> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><strong><span style="font-size: 10pt; font-family: Arial;"> APPLICATIONS OF OMNICAM</span></strong></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Following are some examples of applications that depend on <span class="SpellE">NICAM&#8217;s</span> unique imaging capabilities.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>Photography: </strong>Suppose a photographer wishes to capture a scene around the </span><span style="font-size: 10pt; font-family: Arial;">Washington</span><span style="font-size: 10pt; font-family: Arial;"> monument. She must make two decisions before she can push the camera button. First, she must decide which part of the scene she wants to capture in the photograph, and then direct the camera to point in the desired direction using the appropriate zoom lens. Second, within the visible scene, she must determine whether the photograph should show the </span><span style="font-size: 10pt; font-family: Arial;">Washington</span><span style="font-size: 10pt; font-family: Arial;"> monument in sharp focus, or the trees in front, or the buildings behind, and set the focus control accordingly. The result will be a picture showing, say the monument, in sharp focus, and the lawn to the left and right as well as the trees and the buildings blurred. If she wants to show a wider scene than the camera&#8217;s field of view, she must take multiple pictures by moving the camera across the scene and then manually &#8220;pasting&#8221; the individual photographs to make a mosaic showing the large scene. Each picture in the mosaic will show a pre-selected object in sharp focus, with consecutive photographs taken, in general, with different focus settings. Consequently, the degree of blur across a picture boundary would change visibly, causing perceptible sharpness transitions across objects that happen to straddle inter-picture borders. Of course, even within each pasted image the objects other than those focused on appear blurred proportional to their relative distances from the focused object. Further, the entire process is very time consuming because of the required mechanical redirecting of cameras, mechanical focusing in each direction, and the subsequent cutting and pasting. Because of these difficulties, <span class="GramE">such <span class="SpellE">mosaicing</span> of images has been practiced as an art by photographers wherein they attempt to smooth out the <span class="SpellE">interimage</span> transitions</span>. Using the NICAM technology, the photographer could capture the </span><span style="font-size: 10pt; font-family: Arial;">Washington</span><span style="font-size: 10pt; font-family: Arial;"> monument, the buildings behind, the trees in front, the lawn and objects as far to the left and <span class="GramE">right</span> as she wants, up to a complete 360<sup>0 </sup>view, all of which would be completely focused in a single photograph. The significance of NICAM to nature photography is mentioned in a letter (Appendix A) written by a former staff photographer of the <em>National Geographic Magazine</em>. </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><strong><span style="font-size: 10pt; font-family: Arial;">Security and Surveillance:</span></strong><span style="font-size: 10pt; font-family: Arial;"> Another major application area is that of security systems and surveillance. For example, consider multiple cameras located at one or more posts outside a building to achieve visibility in all directions. Typically, a guard inside the building would monitor the images delivered by the cameras on separate monitors placed in front of him. The cameras of course show certain objects in focus while others, outside of the depth of field, appear blurred. The guard could control the cameras to focus at different objects in different directions, but that would only result in switching among the areas monitored best. If a fast panning NICAM is used, a single camera will replace the entire set of cameras and still obtain a focused image of all parts of the scene visible from the <span class="SpellE">watchpost</span>, with no loss of detail due to blurring. This image <span class="GramE">could be displayed</span> over a surrounding screen, as well as on the usual separate monitors. Further, the guard <span class="GramE">could also be shown</span> a 3D stereo display of the 360<sup>0</sup> surround, using a single NICAM. His display <span class="GramE">could be viewed</span> in 3D using a <span class="SpellE">headmounted</span> display. Alternatively, the guard could see it on a surrounding 360<sup>0</sup> screen, e.g., using stereo glasses, while being able to turn his head in any direction as if he were on the <span class="SpellE">watchpost</span> itself. </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;">Analogously, inside a building, a small number of <span class="SpellE">NICAM&#8217;s</span> can cover the entire premises to <span class="GramE">be monitored</span>, instead of a much larger number of normal cameras. For example, a 360<sup>0</sup> view of a large building lobby may be covered by a few <span class="SpellE">NICAM&#8217;s</span> instead of, say <span class="GramE">15,</span> ordinary cameras which may still not cover as much depth as NICAM. When located inside a nuclear plant or another hazardous area, clear views of the entire scene can facilitate much faster <span class="SpellE">teleoperation</span> with no adjustment of the camera.</span></p>
<p class="MsoBodyText" style="text-align: justify;"><span style="font-size: 10pt; font-family: Arial;"><br />
<strong>Surgery: </strong>In <span class="SpellE">endoscopic</span> or <span class="SpellE">laproscopic</span> surgery, it is a common problem that the surgeon cannot clearly see the interior body structure <span class="GramE">in the vicinity of</span> the area under operation which leads to imprecision in surgery. A miniature NICAM could imag