Michael Lindenbaum




Computer Science Department
Technion, Israel Institute of Technology
Haifa 32000, Israel.

Tel: +972-4-8294331
Fax: +972-4-8293900
Home page: http://www.cs.technion.ac.il/~mic/
Email address: mic at cs.technion.ac.il



Research ---- Teaching

Main Research Topics and some Publications

Performance Prediction, Attention, Recognition, Grouping, Image Processing, Learning, Robotics


Performance Prediction

My main research interest is computer vision, with a focus on the analysis and prediction of visual process performance. Performance is usually evaluated empirically, by implementing the algorithm and testing it on real images. I am interested in developing methods for statistically modeling the image data and using these models to analytically predict the performance of algorithm. In order for the predictions to be useful, the models must be simple enough for analysis but also realistic. The analysis tells us what to expect from an algorithm's performance and what level of performance can be achieved. My work on performance analysis began with theoretical models (Geometric Probing) but then moved on to more realistic domains. See some examples of Recognition, Grouping, and recently, Attention.


Attention

Visual search is required in situations where a person or a machine views a scene with the goal of finding one or more familiar entities. The highly effective visual search (or, more generally, attention) mechanisms in the human visual system were extensively studied from the viewpoints of psychophysics and physiology and were also used as models for computer vision implementations.

Taking a stochastic approach, we modeled the target presence in different image regions as a set of correlated random variables taking target/non-target values. We proposed an efficient search algorithm, based on linear estimation, which can handle both inner-scene similarity and top-down information, when available. Being interested in inherent limitations, we characterized the difficulty of search tasks using a metric-space cover (similar to Kolmogorov's epsilon-covering), and derived (tight) bounds on the performance of all search algorithms.

T. Avraham and M. Lindenbaum, Dynamic Visual Search using Inner Scene Similarity - Algorithms and Inherent Limitations. Proceedings of the 8th European Conference on Computer Vision, volume II, pages 58-70, 2004.

T. Avraham and M. Lindenbaum, Attention-based Dynamic Visual Search using Inner-scene Similarity: Algorithms and Bounds. IEEE Trans. Pattern Analysis and Machine Intelligence, 28(2):251-264, 2006..

It turned out that the predictions developed in the quantitative computer vision context are relevant also in the human vision context and that a search task is harder if its associated metric cover is larger.

Y. Yeshurun, T. Avraham and M. Lindenbaum, Predicting Visual-search Performance by Quantifying Stimuli Similarities. Journal of Vision, 8(4):1{22, 2008.



Object Recognition

Analysis of Object Recognition Tasks

In recognition performance, we first considered the relation between the number of data features collected from the image and the probability of success in the recognition task that relies on this data. In other words, we asked how much data is required to decide, with some pre-specified reliability, that some familiar object is in the scene. The answer depends on factors such as object similarity, transformation class, and noise. One of our papers suggests a way to measure the effect of object similarity, while another uses an integrated random model in the context of a complex cluttered scene. (See the figure above for an instance of the random model.)

M. Lindenbaum, Bounds on Shape Recognition Performance. IEEE Trans. on PAMI-17, No. 7, pp. 666-680, 1995.

M. Lindenbaum, An Integrated Model for Evaluating the Amount of Data Required for Reliable Recognition, IEEE Trans. on PAMI-19, No. 11, pp. 1251-1264, 1997.

Similar (but less tight) results may be derived by observing that the recognition task is equivalent to a learning task. Thus, we re-answered our question by reformulating it in the PAC (Probably Approximately Correct) theoretical learning model, and finding the VC-dimension of a class containing the instances of a particular model under all transformations.

M. Lindenbaum and S. Ben-David, Applying VC-dimension Analysis to Object Recognition. Proceedings of the 3rd European Conference on Computer Vision, 1994, pp. 239-240.

M. Lindenbaum and S. Ben-David, Applying VC-dimension Analysis to 3D Object Recognition from Perspective Projections. Proceedings of the 12th National Conf. on Artificial Intelligence (AAAI), 1994, pp. 985-990.

These results quantify many prior intuitive observations, such as "similar objects are harder to discriminate," or "higher reliability requires more data." They are useful for setting the required data subset size when designing a new algorithm, for analyzing reported results by comparing them to the theoretical bounds, for inferring performance when conditions are changed, and for ranking different recognition tasks according to their difficulty. The results are concerned with the ability of a "consistency test" to reject incorrect hypotheses and are independent of the hypothesis generating mechanism. Thus, they are, in a sense, algorithm independent.

Grouping-based Nonadditive Verification

Perceptual grouping can be informative for object recognition as well. Intuitively, we can say that a good hypothesis should also be a good group. This criterion turns out to be powerful and significantly improves the reliability of the verification process.

A. Amir and M. Lindenbaum, Grouping-based Nonadditive Verification . IEEE Trans. on PAMI-20, No. 2, pp. 186-192, 1998.

Comparing Images under Varying Illumination or Why Gabor Jets Are So Effective ?

Comparing images is fundamental to vision systems. This task becomes harder under variable lighting. Most of the proposed methods work well on rough objects containing discontinuities or areas of rapid change in albedo or shape. Comparing images of smooth surfaces with no edges or texture under varying illumination is, however, more challenging. When the surface is smooth, its normals are highly correlated if they are in close proximity. We model such surfaces as Gaussian processes and derive the resulting statistical characterization of the corresponding images. Supported by this model, we treat the difference between two images associated with the same surface and different lighting as colored Gaussian noise, and use the whitening tool from signal detection theory to construct a measure of difference between them. While our Gaussian assumption is a simplification, the resulting measure functions very well for both synthetic and real smooth objects. Thus we improve upon methods for matching images of smooth objects, while providing insight into their performance.

And how does this whitening relate to Gabor jets ? It turns out that Gabors jets may be interpreted as a combination of gradient direction comparison (a great method for rough surfaces) and whitening (which works for smooth surfaces). This way, they win in both worlds.

M. Osadchy, M. Lindenbaum, and D. Jacobs Whitening for Photometric Comparison of Smooth Surfaces under Varying Illumination. ECCV04, volume IV, pages 217-228, 2004.

M. Osadchy, D. Jacobs, and M. Lindenbaum Surface Dependent Representations for Illumination Insensitive Image Comparison. . IEEE Trans. Pattern Analysis and Machine Intelligence, 29(1):98{111, 2007.



Analysis of Grouping Processes

Grouping processes partition an image (or some partial representation of it, such as an edge map) into regions such that all data elements in one region correspond to one imaged object. As in the recognition problem, my interest in grouping is mainly in modeling and prediction. That is, my work is concerned with suggesting probabilistic models, assesing their validity, and using them to derive predictions on grouping performance with specific algorithms.

Maximum Likelihood (Generic) Grouping Processes and Their Analysis

Together with Arnon Amir, I proposed a quantitative approach to grouping, which consists of a generic grouping method, applicable to many domains, and an analysis of the expected grouping quality. This paper is distinctive in two important ways. First, unlike previous grouping approaches, which were limited to specific domains, the proposed approach is generic. In our study, we demonstrated this virtue by implementing three domain-specific grouping algorithms as instances of the proposed one. The second crucial difference is that we provide results relating the expected grouping quality to the cue reliability and to the computational effort invested.

A. Amir and M. Lindenbaum, Quantitative Analysis of Grouping Processes. IEEE Trans. on PAMI-20, No. 2, pp. 168-185, 1998. (A shorter version appeared in ECCV, pp. 371-384, 1996.)

An Analysis of Connected Components Grouping Processes

We consider the extremely fast connected components grouping process and use probabilistic models to analyze its performance. We derived the expected number of addition errors and the group fragmentation rate and showed that they depend on a few inherent and intuitive parameters. Furthermore, we showed that the grouping process may be controlled so that the performance may be chosen within a given tradeoff. The analytic results are supported by implementing the algorithm and testing it on synthetic and real images.

A. Berengolts and M. Lindenbaum, On the Performance of Connected Components Grouping . IJCV 41(3), pp. 195-216, 2001.

A Probabilistic Interpretation of the Saliency Network

Saliency algorithms aim to find image curves, maximizing some deterministic quality measure which grows with the length of the curve, its smoothness, and its continuity. We proposed a modified saliency estimation mechanism that is based on probabilistically specified grouping cues. In the context of the proposed method, the well-known saliency mechanism proposed by Shaashua and Ullman may be interpreted as a process that tries to detect the curve with maximal expected length. Besides the new interpretation, the proposed characterization has other advantages: First, the probabilistic characterization may be derived and verified from typical images (rather than relying on a preconceived characterization of "important" figure subsets). Second, the probabilistic saliency is more abstract and thus more generic than the common geometric formulations. Therefore, it lends itself to different realizations of saliencies, based on different cues. We demonstrated this by creating, as an instances of the general approach, a saliency process that is based on grey level similarity but still preserves the same interpretation.

We then turned to analyze the expected performance of the saliency process. Regarding the basic grouping information (grouping cues) as random variables, we used ergodicity and asymptotic analysis to derive the saliency distribution associated with the main curves ("figure") and with the rest of the image ("background"). We further considered finite-length curves and analyzed their saliency values. The derived distributions were used to show how to set thresholds on the saliency for deciding optimally between figure and background, how to choose saliency cues, and how to estimate bounds on saliency performance.

M. Lindenbaum and A. Berengolts, A Probabilistic Interpretation of the Saliency Network . ECCV, vol. II, pp. 257-272, 2000.

A. Berengolts and M. Lindenbaum, On the Distribution of Saliency . CVPR (2), pp. 543-549, 2004.

An Information-based Measure for Grouping Quality

The evaluation of grouping results is not straightforward and is often heuristic. We proposed a method for measuring grouping quality, based on the following observation: a better grouping result provides more information about the true, unknown grouping. The amount of information is evaluated by the number of queries required to specify the true grouping. A related measure of informativeness is the uncertainty of the true grouping, characterized by means of a probabilistic model and common information theory terms such as surprise and entropy. This relation between the measures is established and experimentally supported. The proposed information-based quality measure is relatively free from arbitrary choices, treats different types of grouping errors in a uniform way, and does not favor any algorithm. The query count was found to be approximately monotonic in the entropy and independent of the grouping error type, indicating that the query count is an adequate unbiased means for comparing grouping results. Moreover, we found that the query count measure approximates human judgment better than other methods and, as such, gives better results when used to optimize a segmentation algorithm, as demonstrated in our experiments.

E.A. Engbers, M. Lindenbaum and A.W.M. Smeulders, An Information-Based Measure for Grouping Quality. ECCV (3), pp. 392-404, 2004

Unsupervised Estimation of Segmentation Quality using Nonnegative Factorization

Recently I became interested in unsupervised performance estimation using empirical models. Such models evaluate the accuracy/reliability of a particular result or predict performance before the algorithm is run, using a model which is partly derived, in an unsupervised way, from a particular image. The prediction is therefore image adaptive.

We observed, for example, that additive properties of segments (e.g., histograms of some feature), provided by some segmentation algorithm are linear combinations of the corresponding properties associated with the true (in some sense) segmentation. Then, using many segmentations of the same image and nonnegative matrix factorization (NMF) techniques, we were able to get the properties of the true segments. This allowed us to estimate the precision/recall of a given segmentation without ground truth.

R. Sandler and M. Lindenbaum, Unsupervised Estimation of Segmentation Quality using Nonnegative Factorization. Proc. IEEE Conf. Computer Vision and Pattern Recognition - CVPR08, 2008.



Image Processing



Adaptive and progressive image sampling

What is the best method for progressively sampling an image? We proposed a progressive sampling algorithm, which retains (deterministic) uniformity with the increased density and places the samples irregularly, thus providing excellent anti-aliasing properties. The uniformity metric can be either image-independent or image-dependent, so an adaptive sampling scheme (such as the one in the image above) can be derived as well.

Yuval Eldar, Michael Lindenbaum, Moshe Porat and Yehoshua Y. Zeevi, The Farthest Point Strategy for Progressive Image Sampling, IEEE Transactions on Image Processing, 6, No. 9, pp. 1305-1315, 1997.

Efficient Karhunen Loeve (PCA) Basis Computations

The computational cost of the approximated Karhunen Loeve basis calculation (or PCA) is often prohibitive. Together with Hiroshi Murase, I reduced both the computational and storage requirements by exploiting the spatial and temporal correlations common in many image sets.

H. Murase and M. Lindenbaum, Spatial Temporal Adaptive Method for Partial Eigenstructure Decomposition of Large Image Matrices, IEEE Trans. on Image Processing, IP-4, No. 5, pp. 620-629, 1995.

Together with Avi Levy, I developed a sequential method for calcualting the KL basis. The algorithm is faster in typical applications than the usual batch algorithm, and is especially advantageous for image sequences: the KL basis calculation is done with much shorter delay and allows for dynamic updating of object databases for recognition. Systematic tests of the implemented algorithm show that these advantages are indeed obtained with the same accuracy as batch KL algorithms.

A. Levy and M. Lindenbaum, Sequential Karhunen-Loeve Basis Extraction, IEEE Transactions on Image processing 9(8), pp. 1371-1374, 2000 (or an older version in Int Conf. on Image Processing (ICIP) 1998.)

You are welcome to use the software package and demo implementing this algorithm (in Unix and Windows).



Learning


Some Results in Learning Theory

It turns out that the results we proved for analyzing vision tasks are also of interest for the learning theory community.

S. Ben-David and M. Lindenbaum, Localization vs. Identification of Semi-Algebraic Sets. Proceedings of the 6th ACM Conference on Computational Learning Theory, 1993, pp. 327-336.

S. Ben-David and M. Lindenbaum, Learning Distributions by their Density Levels - A Paradigm for Learning Without a Teacher. Journal of Computer and Systems Sciences, 1997, pp. 171-182.

Example Selection in the Context of Nearest Neighbor Classifiers

In many domains, getting examples for learning is easy but labeling them is costly. In such cases, it is beneficial to actively select the examples for labeling with the goal of reducing the labeling cost. Together with Dmitry Rusakov and Shaul Markovitch, I proposed a lookahead selective sampling process, whose goal is to select the examples with the highest utility and which uses a random field model for the required estimate of the possible labels' probability. We found empirically that the proposed algorithm outperforms other selective sampling methods in terms of the average error rate and performance stability.

M. Lindenbaum, S. Markovich, and D. Rusakov, Selective Sampling for Nearest Neighbor Classifiers. Proceedings of AAAI-99.



Robotics and Computational Robotics


Geometric Probing

My thesis, supervised by Freddy Bruckstein, was concerned with algorithmic robotics issues. We considered geometric probing tasks, where one is asked to provide a strategy for finding an object's shape using a small number of simple measurements (probes). Part of my work on this subject was dedicated to finding (the first) strategies for parallel probing, as well as lower bounds which prove that they are optimal. The other part was concerned with broadening the traditional geometric probing scope, which was based on the non-realistic object polygonality assumption. Here, we provided an efficient strategy for reconstructing non-polygonal objects with a finite number of probes and with a certain pre-specified precision.

M. Lindenbaum and A. Bruckstein, Parallel Strategies for Geometric Probing . Journal of Algorithms 13, pp. 320-349, 1992.

M. Lindenbaum and A. Bruckstein, Blind Approximation of Planar Convex Shapes, IEEE Trans. on Robotics and Automation, RA-10, No. 4, pp. 517-529, 1994.

Distributed Ant Robotics

Computational robotics is no longer my main research direction but I am still involved with some topics. In distributed robotics one considers a population of robots driven by the same (usually simple and local) strategy, with the aim of selecting the strategy which leads to a useful emergent behavior.

Together with Israel Wagner and Freddy Bruckstein, and inspired by ant communication (using pheromones), I investigated the ability of a group of robots, who communicate by leaving traces, to perform the task of cleaning the floor of an unmapped building, or any task that requires traversing an unknown network. More specifically, we considered robots that leave chemical odor traces. These traces evaporate over time, and the robots are able to evaluate the strength of the odor at every point they reach, with some measurement error. Our abstract model is a decentralized multia(ge)nt adaptive system with a shared memory, moving on a graph whose vertices are the floor tiles. We describe three methods of cooperatively covering a graph, using smell traces that gradually vanish, and show that they all result in eventual task completion, two of them in a time polynomial in the number of tiles. As opposed to existing traversal methods (e.g., DFS), our algorithms are adaptive: they will complete the traversal of the graph even if some of the agents die or the graph changes (edges/vertices added or deleted) during the execution, as long as the graph stays connected.

Israel A. Wagner, Michael Lindenbaum, and Alfred M. Bruckstein.
Cooperative Covering by Ant-Robots Using Evaporating Traces,
For more details and other computational robotics issues, see the home page of Israel Wagner.
For a demo, you might want to look at our Simulators.

Visual Servoing

Together with Keisuke Kinoshita, I considered the task of visual servoing with partial information, proposed a generic control rule for such tasks and, using techniques from optimization, characterized the conditions required for the convergence. We then focused on one particular task from this class: position and orientation control with a single rotating camera. We suggested strategies for control and camera motion, and implemented the 3D task of inserting a rod into a hollow pipe.

K. Kinoshita and M. Lindenbaum, Robotic Control with Partial Visual Information, ICCV, pp. 883-888, 1998
 



 


Teaching

  • 236327 Introduction to Signal and Image Processing
  • 234262 Logic Design
  • 236815 Computer Vision seminar

  • Information for current/future graduate students.



    You are visitor number  .