|
|
|
|
|
|
|
|
|
Roberto
Rinaldo via
delle Scienze, 208 rinaldo@uniud.it Extended curriculum (Italian and English) |
|
Roberto Rinaldo obtained the "Laurea in Ingegneria
Elettronica" degree in 1987 from the University of Padova, Padova,
Italy. From 1990 to 1992, he was with the University of California at
Berkeley, where he received the MS degree in 1992. He received the Doctorate
degree in "Ingegneria Elettronica e dell'Informazione" from
the University of Padova in 1992. In 1992 he joined the Dipartimento
di Elettronica e Informatica of the University of Padova as a "ricercatore".
Starting from November 1st 1998, he was associate professor in Communications
and Signal Processing in the same Department. |
Click here for a list of National and International Projects in which Roberto Rinaldo was and is currently
involved.
>
>
>
>
>
Roberto Rinaldo's research activity is in the
general field of multidimensional signal processing and analysis, with a
particular interest in image and video coding in telecommunication systems.
He is now considering the problem of digital transmission of multimedia
signals in telecommunication networks.
A list of the main research topics he is working
on or worked on in the past is the following:
Object based video coding, composite
document coding and lossless coding
The standard
approach to video compression consists in computing a motion-compensated
prediction of the actual frame image, followed by transform coding of the
residual. More recently, object oriented video processing has received
increasing attention and in fact the most recent standards describe the
scene as a collection of objects that can be animated and coded
independently. In this project, we considered the problem of coding and
also of frame interpolation in reduced rate video transmission systems. The
objects in the scene are modelled as the projection onto the camera plane
of 3D rigid bodies undergoing three dimensional motion. A Kalman filter is
used to estimate motion parameters on the basis of the observations
relative to the 2D positions of characteristic features in the scene. The
estimated motion is used to interpolate missing frames at the receiver. Another
application is coding, where the residual error between original and motion
compensated frames is actually sent to the receiver. More recently, a
similar scheme has been proposed for the animation of virtual faces in a
model based video coder. Local motion relative to the mouth and eye
movements is also considered and estimated using a template matching
procedure.
Coding of composite documents requires, for the regions corresponding to
text or graphic art, the use of lossless coding techniques. The schemes
proposed for this application use spatially variant prediction and entropy
coding of the residual. Recently, the JPEG2000 standard for still images,
provides a scheme for progressive lossy to lossless coding of images. We
are now considering, within the European project "MetaVision",
the challenging problem of lossless compression of colour video with cinema
quality, starting from a raw bit rate of more than 3 Gbit/s.
Image and Video coding
The enormous amount of information required for the representation,
transmission or storage of multimedia signals, in particular for video and
images, requires the adoption of sophisticated source coding techniques in
the first place. Research in this field focused on the theory of subband
decomposition, and in particular of pyramidal schemes based on the wavelet
representation. The wavelet transform opearates a multiresolution
representation, which exhibits a certain amount of redundancy which can be
exploited for efficient compression. In particular, quantized zero
coefficient positions appear to be highly correlated across scales.
Moreover, a certain degree of similarity appears around edges. An image
coder which exploits redundancy of the wavelet representation was developed
for this project. Similarly to what is done in fractal coders, blocks of
coefficients in a subband are predicted from coefficients in lower
resolution subbands with the same orientation. Scalar quantization followed
by entropy coding or pyramid vector quantization is used for those
coefficient blocks which could not be effectively predicted.
Similar ideas were used for the design of a video coder for the
transmission of videoconference sequences at low bit rates (< 64
kbit/s). It exploits the redundancy of the wavelet representation both in
the 2D and 3D domain. In particular, a segmentation mask is used to
discriminate wavelet coeefficients which could not be effectively predicted
using motion compensation, and which are coded using the prediction scheme
outlined above in the 2D motion compensated prediction error image.
Statistical modeling of signals in
the subband decomposition
The signals in a subband decomposition of images can be modelled
approximately as independent vectors of independent Laplacian or
generalized Gaussian random variables. Using this model, it is possible to
evaluate the entropy of the output symbols and the distortion resulting
from the use of a given quantizer. In this project, we designed a procedure
to determine the optimal threshold quantizer for subband coding. This
quantizer has a dead-zone around the origin and is used in various
standards for video coding. The availability of a procedure to design the
optimal quantizer and compute the resulting rate-distortion curve allows
for a precise bit-rate or distortion control in video transmission systems.
A better model for images is a first order Markov process. Using this
model, it is possible to compute the covariance matrix of subband vectors,
and not just use the independence assumption. We developed an efficient
procedure in the time domain to compute the coding gain in subband
decomposition systems, both for intraframe and inter-frame coding. We also
considered the problem of optimal reconstruction filter design and of wavelet
best basis using this model.
Multiresolution analysis and wavelet
transform for fractal images
Research in this field has been carried out at the Department of Electrical
Engineering of the University of California at Berkeley, in the years
1990-1992. Fractal geometry is a useful tool to build deterministic and
stochastic models for many natural phenomena. As a consequence, fractal
theory has been used in a variety of problems, including image segmentation
and coding, or generation of virtual worlds in computers graphics. From a
general stand-point, fractal images exhibit a high degree of
self-similarity when analyzed at different scales. In other words, a
fractal image, when analyzed through a magnifying lens, is likely to
exhibit the same properties and characteristics of the original image. The
wavelet transform, which provides a multiresolution analysis of signals, is
therefore a promising tool for fractal image analysis. Moreover, the
self-similarity property of fractals suggests that a multiresolution
representation can provide a very efficient method for coding or
representation. In this project, we addressed the "inverse
problem" of determining the parameters of a class of fractal objects,
generated using iterated maps (IFS), which could approximate a given image.
The solution is obtained using the wavelet transform and the moment method.
Spectral estimation
of television sequences
Quantitative information on frequency characteristics of signals is very
useful for filter design and coding. In this project, a tool for the
spectral estimation of television signals was developed, based on the
multidimensional Periodogram and autoregressive models. The extension to
the multidimensional case of the techniques usually considered for 1D
signals is quite challenging, due to the enormous amount of data required
for estimation.
The spectrum analyzer was used for the design of 3D filters for luminance
and chrominance separation in the PAL signal. The filter design was carried
out by taking into account the 3D frequency characteristics of the signal
and using an MSE criterion. Applications to High Definition Television and
the problem of algorithm optimization were also considered.
2003 European
Project "WireNet", RTD Performers: Sintef (NO), Labor (I)
1999 "Analisi, sintesi e indicizzazione di sequenze audio/video
nella comunicazione multimediale", Coordinator: Prof. Carlo Braccini
(Univ. Genova), Financing: MURST (40%)
1999 European Project IST-1999-20859 "Metavision", Partecipanti:
Snell & Wilcox (UK), BBC (UK), INESC (P), Electrocraft (UK), France 2
(F), ARRI (D), MRT (N), Università di Padova (I)
1998 "Ambienti avanzati per comunicazioni a pacchetto," Coordinator
Prof. Enrico Gregori (CNR CNUCE Pisa), Financing: CNR
1997 "Comunicazione
multimediale: analisi di contenuto, modellamento, accesso e protezione dell'informazione",
Coordinator Prof. Carlo Braccini (Univ. Genova), Financing: MURST (40%)
1995 "Elaborazione
e codifica di segnali per sistemi multimediali di telecomunicazione",
Coordinator Prof. Carlo Braccini (Univ. Genova), Financing: MURST (40%)
1996 "Elaborazione e codifica di segnali per sistemi multimediali
di telecomunicazione", Coordinator Prof. Carlo Braccini (Univ. Genova),
Financing: MURST (40%)
1997 "Comunicazione
video in ambito radio-mobile", Coordinator Prof. Riccardo Leonardi (Univ.
Brescia), Financing: CNR
1997 "Ambienti
avanzati per comunicazioni a pacchetto", Coordinatore: Prof. Enrico Gregori
(CNR CNUCE Pisa), Financing: CNR
1996 "Progetto
di un codificatore video a basso bit rate", Coordinatore: Prof. Riccardo
Leonardi (Univ. Brescia), Financing: CNR
1995 "Progettazione
di un sistema di comunicazione video a bassissimo bit-rate", Coordinator
Prof. Riccardo Leonardi (Univ. Brescia), Financing: CNR
He teaches the courses:
Telecommunications Networks, Electrical Communications
and Telecommunication Systems II.
For a brief course description and class material (in Italian), please click
here.