ZIB

1

Book

Advances in image processing and understanding /; 52 (2002)

Bovik, Alan C. [[Hrsg.]] ; Huang, Thomas S.

Singapore [u.a.] :World Scientific,

add to mindlist on the mindlist

Details

Title: Advances in image processing and understanding /; 52

Contributer: Bovik, Alan C. , Huang, Thomas S.

Publisher: Singapore [u.a.] :World Scientific,

Year of publication: 2002

Pages: VI, 390 S.

Series Statement: Series in machine perception and artificial intelligence 52

ISBN: 981-238-091-4

Type of Medium: Book

Language: Undetermined

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

ZIB Catalog

Zuse Institute Berlin

2

Book

Facial analysis from continuous video with applications to human-computer interface (2004)

Colmenarez, Antonio J. ; Xiong, Ziyou ; Huang, Thomas S.

Boston [u.a.] :kluwer Acad. Publ.,

add to mindlist on the mindlist

Details

Title: Facial analysis from continuous video with applications to human-computer interface

Author: Colmenarez, Antonio J.

Contributer: Xiong, Ziyou , Huang, Thomas S.

Publisher: Boston [u.a.] :kluwer Acad. Publ.,

Year of publication: 2004

Pages: XXIV, 134 S.

Series Statement: Kluwer international series on biometrics

ISBN: 1-4020-7802-1

Type of Medium: Book

Language: English

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

ZIB Catalog

Zuse Institute Berlin

3

Book

3D face processing / (2004)

Wen, Zhen ; Huang, Thomas S.

Boston [u.a.] :Kluwer Acad. Publ.,

add to mindlist on the mindlist

Details

Title: 3D face processing /

Author: Wen, Zhen

Contributer: Huang, Thomas S.

Publisher: Boston [u.a.] :Kluwer Acad. Publ.,

Year of publication: 2004

Pages: XVIII, 136 S. : , Ill., graph. Darst.

Series Statement: Kluwer international series in video computing

ISBN: 1-402-08047-6

Type of Medium: Book

Language: English

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

ZIB Catalog

Zuse Institute Berlin

4

Electronic Resource

Constructing table-of-content for videos (1999)

Rui, Yong ; Huang, Thomas S. ; Mehrotra, Sharad

Springer

Multimedia systems 7 (1999), S. 359-368

add to mindlist on the mindlist

Details

ISSN: 1432-1882

Keywords: Key words:Video accessing – Scene-level ToC construction

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract. A fundamental task in video analysis is to extract structures from the video to facilitate user's access (browsing and retrieval). Motivated by the important role that the table of content (ToC) plays in a book, in this paper, we introduce the concept of ToC in the video domain. Some existing approaches implicitly use the ToC, but are mainly limited to low-level entities (e.g., shots and key frames). The drawbacks are that low-level structures (1) contain too many entries to be efficiently presented to the user; and (2) do not capture the underlying semantic structure of the video based on which the user may wish to browse/retrieve. To address these limitations, in this paper, we present an effective semantic-level ToC construction technique based on intelligent unsupervised clustering. It has the characteristics of better modeling the time locality and scene structure. Experiments based on real-world movie videos validate the effectiveness of the proposed approach. Examples are given to demonstrate the usage of the scene-based ToC in facilitating user's access to the video.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/s005300050138

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

5

Electronic Resource

A Region-Based Representation of Images in MARS (1998)

Servetto, Sergio D. ; Rui, Yong ; Ramchandran, Kannan ; [et al.]

Springer

The journal of VLSI signal processing systems for signal, image, and video technology 20 (1998), S. 137-150

add to mindlist on the mindlist

Details

ISSN: 1573-109X

Source: Springer Online Journal Archives 1860-2000

Topics: Electrical Engineering, Measurement and Control Technology

Notes: Abstract We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propose new image coding techniques which combine a wavelet representation, embedded coding of the wavelet coefficients, and segmentation of image-domain regions in the wavelet domain. A bitstream is generated in which each image region is encoded independently of other regions, without having to explicitly store information describing the regions. Simulation results show that our proposed algorithms achieve coding performance which compares favorably, both perceptually and objectively, to that achieved using state-of-the-art image/video coding techniques while additionally providing region-based support.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1008026508931

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

6

Electronic Resource

Animated Talking Head with Personalized 3D Head Model (1998)

Ostermann, Jörn ; Chen, Lawrence S. ; Huang, Thomas S.

Springer

The journal of VLSI signal processing systems for signal, image, and video technology 20 (1998), S. 97-105

add to mindlist on the mindlist

Details

ISSN: 1573-109X

Source: Springer Online Journal Archives 1860-2000

Topics: Electrical Engineering, Measurement and Control Technology

Notes: Abstract Natural Human-Computer Interface requires integration of realistic audio and visual information for perception and display. An example of such an interface is an animated talking head displayed on the computer screen in the form of a human-like computer agent. This system converts text to acoustic speech with synchronized animation of mouth movements. The talking head is based on a generic 3D human head model, but to improve realism, natural looking personalized models are necessary. In this paper, we report a semi-automatic method for adapting a generic head model to 3D range data of a human head obtained from a 3D-laser range scanner. This personalized model is incorporated into the talking head system. With texture mapping, the personalized model offers a more natural and realistic look than the generic model. The model created with the proposed method compares favorable to generic models.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1008070323952

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

7

Electronic Resource

Learning Recognition and Segmentation Using the Cresceptron (1997)

Weng, John (Juyang) ; Ahuja, Narendra ; Huang, Thomas S.

Springer

International journal of computer vision 25 (1997), S. 109-143

add to mindlist on the mindlist

Details

ISSN: 1573-1405

Keywords: visual learning ; face recognition ; face detection ; object recognition ; object segmentation ; feature selection ; feature extraction ; shape representation ; self-organization ; associative memory

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract This paper presents a framework called Cresceptron for view-based learning, recognition and segmentation. Specifically, it recognizes and segments image patterns that are similar to those learned, using a stochastic distortion model and view-based interpolation, allowing other view points that are moderately different from those used in learning. The learning phase is interactive. The user trains the system using a collection of training images. For each training image, the user manually draws a polygon outlining the region of interest and types in the label of its class. Then, from the directional edges of each of the segmented regions, the Cresceptron uses a hierarchical self-organization scheme to grow a sparsely connected network automatically, adaptively and incrementally during the learning phase. At each level, the system detects new image structures that need to be learned and assigns a new neural plane for each new feature. The network grows by creating new nodes and connections which memorize the new image structures and their context as they are detected. Thus, the structure of the network is a function of the training exemplars. The Cresceptron incorporates both individual learning and class learning; with the former, each training example is treated as a different individual while with the latter, each example is a sample of a class. In the performance phase, segmentation and recognition are tightly coupled. No foreground extraction is necessary, which is achieved by backtracking the response of the network down the hierarchy to the image parts contributing to recognition. Several stochastic shape distortion models are analyzed to show why multilevel matching such as that in the Cresceptron can deal with more general stochastic distortions that a single-level matching scheme cannot. The system is demonstrated using images from broadcast television and other video segments to learn faces and other objects, and then later to locate and to recognize similar, but possibly distorted, views of the same objects.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1007967800668

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

8

Electronic Resource

New Devices for 3D Pose Estimation: Mantis Eyes, Agam Paintings, Sundials, and Other Space Fiducials (2000)

Bruckstein, Alfred M. ; Holt, Robert J. ; Huang, Thomas S. ; [et al.]

Springer

International journal of computer vision 39 (2000), S. 131-139

add to mindlist on the mindlist

Details

ISSN: 1573-1405

Keywords: Pose determination ; space fiducials

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract Several unconventional ideas for viewer/camera pose estimation are discussed. The methods proposed so far advocate the use of advanced image processing for identification and precise location of calibration objects in the images acquired, and base pose recovery on the identification of the viewing dependent deformations of these objects. We propose to more fully exploit the freedom in the design of “space fiducials” or calibration objects showing that we can build objects whose images directly encode, in easily identifiable gray-level/color or temporal patterns, the pose of their viewer. We also show how to construct high-precision fiducials, which can determine a viewing direction quite accurately when it is known to lie within a relatively narrow range.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1008123110489

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

9

Electronic Resource

Optimum Fiducials Under Weak Perspective Projection (1999)

Bruckstein, Alfred M. ; Holt, Robert J. ; Huang, Thomas S. ; [et al.]

Springer

International journal of computer vision 35 (1999), S. 223-244

add to mindlist on the mindlist

Details

ISSN: 1573-1405

Keywords: pose recovery ; weak perspective projection ; fiducial design

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract We investigate how a given fixed number of points should be located in space so that the pose of a camera viewing them from unknown locations can be estimated with the greatest accuracy. We show that optimum solutions are obtained when the points form concentric complete regular polyhedra. For the case of optimal configurations we provide a worst-case error analysis and use it to analyze the effects of weak perspective approximation to true perspective viewing. Comprehensive computer simulations validate the theoretical results.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1008156210387

Permalink

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext