*bibtex popups requires javascript support
Tao Chen

Tao Chen (Tao Chen)

I am a postdoctor at Department of Computer Science and Technology in Tsinghua University. Before that, I received my doctor and bachelor degree from Department of Computer Science and Technology and Department of Physics (Fundamental Science Class), Tsinghua University in 2011 and in 2005, respectively
.

My research interests include: image/video processing, editing and composition

My contact info:

Email: chent (at) cg.cs.tsinghua.edu.cn
Phone: 86-10-62797001-801 (Office)
Address: FIT Building 3-523, Tsinghua Univeristy, Beijing, P.R. China, P.C.: 100084


  Publications

object

Data-Driven Object Manipulation in Images
Chen Goldberg, Tao Chen, Fang-Lue Zhang, Ariel Shamir, Shi-Min Hu
Eurographics 2012.

We present a framework for interactively manipulating objects in a photograph using related objects obtained from internet images. Given an image, the user selects a scene-object to modify, and provides keywords to describe it. The application then retrieves and segments objects with a similar shape from online images matching the keyword, and deforms them to correspond with the selected object. By matching the candidate object and adjusting manipulation parameters, the application appropriately modifies candidate objects and composites them into the scene. Supported manipulations include transferring texture, color and shape from the matched object to the target in a seamless manner. We demonstrate the versatility of our framework using several inputs of varying complexity, showing applications to object completion, augmentation, replacement and revealing. We also present an evaluation of our results with a user study.

[ paper ] [ video ] [ user study ]


VStorylines

Visual Storylines: Semantic Visualization of Movie Sequence
Tao Chen, Aidong Lu, Shi-Min Hu
Computers & Graphics. Accept After Minor Revision.

This paper presents a video summarization approach that automatically extracts and visualizes movie storylines in a static image for the purposes of efficient representation and quick overview. A new type of video visualization, Visual Storylines, is designed to summarize video storylines in a succinct visual format while preserving the elegance of original videos. This is achieved with a series of video analysis, image synthesis, relationship quantification and geometric layout optimization techniques. Specifically, we analyze video contents and quantify video story unit relationships automatically through clustering video shots according to both visual and audio data. A multi-level storyline visualization method then organizes and synthesizes a suitable amount of representative information, including both locations and interested objects and characters, with the assistants of special visual languages, according to the relationships between video story units and temporal structure of the video sequence. Several results have demonstrated that our approach is able to abstract the storylines of professionally edited video such as commercial movies and TV series. Preliminary user studies have been performed to evaluate our approach and the results show that our approach can be used to assist viewers to grasp video contents efficiently, especially when they are familiar with the context of the video, or a text synopsis is provided.

[ paper ] [ supplemental ]


Sketch2Photo: Internet Image Montage
Tao Chen, Ming-Ming Cheng, Ping Tan, Ariel Shamir, Shi-Min Hu
Siggraph Asia 2009.

We present a system that composes a realistic picture from a simple freehand sketch annotated with text labels. The composed picture is generated by seamlessly stitching several photographs in agreement with the sketch and text labels; these are found by searching the Internet. Although online image search generates many inappropriate results, our system is able to automatically select suitable photographs to generate a high quality composition, using a filtering scheme to exclude undesirable images. We also provide a novel image blending algorithm to allow seamless image composition. Each blending result is given a numeric score, allowing us to find an optimal combination of discovered images. Experimental results show the method is very successful; we also evaluate our system using the results from two user studies.

[ paper ] [ project page ] [ bibtex ]


Vectorizing Cartoon Animations
Song-Hai Zhang, Tao Chen, Yi-Fei Zhang, Shi-Min Hu, Ralph R. Martin
IEEE Transaction on Visualization and Computer Graphics (TVCG), 2009.

We present a system for vectorizing 2D raster format carton animations. The output animations are visually flicker free, smaller in file size, and easy to edit. We identify decorative lines separately from coloured regions. We use an accurate and semantically meaningful image decomposition algorithm which supports an arbitrary color model for each region. To ensure temporal coherence in the output cartoon, we reconstruct a universal background for all frames, and separately extract foreground regions. Simple user-assistance is required to complete the background. Each region and decorative line is vectorized and stored together with their motions from frame to frame.

[ paper ] [ video ] [ bibtex ]


waterfall
Video-Based Running Water Animation in Chinese Painting Style
Song-Hai Zhang, Tao Chen, Yi-Fei Zhang, Shi-Min Hu, Ralph R. Martin
Science in China Series F: Information Sciences, 2009.

This paper presents a novel algorithm for synthesizing animations of running water, such as waterfalls and rivers, in the style of Chinese paintings, for applications such as cartoon making. All video frames are first registered in a common coordinate system, simultaneously segmenting the water from background and computing optical flow of the water. Taking artists’ advice into account, we produce a painting structure to guide painting of brush strokes. Flow lines are placed in the water following an analysis of variance of optical flow, to cause strokes to be drawn where the water is flowing smoothly, rather than in turbulent areas: this allows a few moving strokes to depict the trends of the water flows. A variety of brush strokes is then drawn using a template determined from real Chinese paintings. The novel contributions of this paper are: a method for painting structure generation for flows in videos, and a method for stroke placement, with the necessary temporal coherence.

[ paper ] [ video ] [ bibtex ]

Projects

ironman

PoseShop: Human Image Database Construction and Personalized Content Synthesis
Tao Chen, Ping Tan, Li-Qian Ma, Ming-Ming Cheng, Ariel Shamir, Shi-Min Hu
Paper Submitted to IEEE Transaction on Visualization and Computer Graphics (TVCG).

We present PoseShop -- a pipeline to construct segmented human image database with minimal manual intervention. By downloading, analyzing, and filtering massive amounts of human images from the Internet we achieve a database which contains 400 thousands human figures that are segmented out of their background. The human figures are organized based on action semantic, clothes attributes and indexed by the shape of their poses. They can be queried using either silhouette sketch or a skeleton to find a given pose. We demonstrate applications for this database for multi-frame personalized content synthesis in the form of comic-strips, where the main character is the user or his/her friends. We address the two challenges of such synthesis, namely personalization and consistency over a set of frames, by introducing head swapping and clothes swapping techniques. We also demonstrate an action correlation analysis application to show the usefulness of the database for vision application.

[ Technical Report ] [ video ] [ supplemental ]


video

Motion-Aware Gradient Domain Video Composition
Tao Chen, Jun-Yan Zhu, Ariel Shamir, Shi-Min Hu
Paper Submitted to IEEE Transaction on Image Processing (TIP).

Gradient domain composition methods like Poisson blending offer practical solutions for uncertain object boundaries and differences in illumination conditions. However, adapting Poisson image blending to videos faces new challenges due to the addition of the temporal dimension. In videos, the human eye is sensitive to small changes in the blending boundaries across frames, and slight differences in the motion of the source patch and the target video. We present a novel video blending approach that tackles these problems by merging the gradient of source and target video and optimizing consistent blending boundary according to a user provided blending trimap for the source video. We extend the mean-value coordinates interpolation to support hybrid blending with dynamic boundary while maintaining interactive performance. We also provide a user interface and source object positioning method that can efficiently deal with complex video sequences beyond the capability of alpha blending.

[ Technical Report ] [ video ]


 

Awards

  • The Netexplorateur Internet Invention Award, 2010.
  • LuZengYong” CAD&CG High Technology Award, 2009.

Academic Services

Links

My Supervisor Shi-Min Hu's group and personal homepage.

My Collaborators and Friends' homepages : Ralph R. Martin, Ariel Shamir, Ping Tan, Aidong Lu, Ming-Ming Cheng, Kun Xu, Yong Li.