Invited Talks on Vision, Graphics, and Speech at Faculty of Information Technology, Brno University of Technology

Monthly Archives: March 2015

Rafał Mantiuk is a senior lecturer (associate professor) at Bangor University (UK) and a member of a Reasearch Institute of Visual Computing. Before comming to Bangor he received his PhD from the Max-Planck-Institute for Computer Science (2006, Germany) and was a postdoctoral researcher at the University of British Columbia (Canada). He has published numerous journal and conference papers presented at ACM SIGGRAPH, Eurographics, CVPR and SPIE HVEI conferences, applied for several patents and was recognized by the Heinz Billing Award (2006). Rafal Mantiuk investigates how the knowledge of the human visual system and perception can be incorporated within computer graphics and imaging algorithms. His recent interests focus on designing imaging algorithms that adapt to human visual performance and viewing conditions in order to deliver the best images given limited resources, such as computation time or display contrast. His talk takes place on Friday, March 27 at 1pm, in room E104.

From high dynamic range to perceptual realism

Abstract: Today’s computer graphics techniques make it possible to create imagery that is hardly distinguishable from photographs. However, a photograph is clearly no match to an actual real-world scene. I argue that the next big challenge in graphics is to achieve perceptual realism by creating artificial imagery that would be hard to distinguish from reality. This requires profound changes in the entire imaging pipeline, from acquisition and rendering to display, with the strong focus on visual perception.

In this talk I will give an brief overview of several projects related to high dynamic range imaging and the applications of visual perception. Then I will discuss in more detail a project in which we explored the “dark side” of the dynamic range in order to model how people perceived images at low luminance. We use such a model to simulate the appearance of night scenes on regular displays, or to generate compensated images that reverse the changes in vision due to low luminance levels. The method can be used in games, driving simulators, or as a compensation for displays used under varying ambient light levels.

Jörn Anemüller studied Physics at the University of Oldenburg, Germany, and Information Processing and Neural Networks at King’s College, University of London, where he received the M.Sc. in 1996. He earned the Ph.D. in Physics at the University of Oldenburg in 2001with a dissertation on “Across frequency-processing in convolutive blind source separation”. From 2001 to 2004 he conducted work on biomedical signal analysis as a post-doctoral fellow at the Salk Institute for Biological Studies and at the University of California, San Diego. Since 2004 he is member of the scientific staff at the Dept. of Physics, University of Oldenburg, currently leading the statistical signal models research group. His interests include statistical signal processing and machine learning techniques with application to acoustic, speech and biomedical signals. His talk takes place on Thursday, March 26 at 1pm, in room D0206.

Abstract: Engineers may view an auditory neuron as an unknown but (hopefully) identifiable system that transforms the acoustic stimulus input into a binary “spike” or “no-spike” output. The linear part of the neuron’s spectro-temporal transfer function is commonly refered to as the spectro-temporal receptive field (STRF). From a machine learning perspective, this setting corresponds to the binary classification problem of discriminating spike-eliciting from non-spike-eliciting stimulus examples. The classification-based receptive field (CbRF) estimation method that we proposed recently adapts a linear large-margin classifier to optimally predict experimental stimulus-response data and subsequently interprets learned classifier weights as the neuron’s receptive field filter.

Efficacy of the CbRF method is validated with simulations and for auditory spectro-temporal receptive field estimation from experimental recordings in the auditory midbrain of Mongolian gerbils. Acoustic stimulation is performed with frequency- modulated tone complexes that mimic properties of natural stimuli, specifically non-Gaussian amplitude distribution and higher-order correlations. Results demonstrate that the proposed approach successfully identifies correct underlying STRFs, even in cases where standard second-order methods based on the spike-triggered average (STA) do not.

Applied to small data samples, the method is shown to converge on smaller amounts of experimental recordings and with lower estimation variance than the generalized linear model and recent information theoretic methods. Analysis of temporal variability of receptive fields quantifies differences between processing at different stages along the auditory pathway.