Virtual Headcam: Pan/tilt Mirror-based Facial Performance Tracking

Virtual Headcam: Pan/tilt Mirror-based Facial Performance Tracking
SIGGRAPH 2009 Poster?
Xueming Yu, Shanhe Wang, Jay Busch, Thai Phan, Tracy McSheery, Mark Bolas, Paul Debevec
USC Institute for Creative Technologies PhaseSpace Inc.

Introduction:

High-end facial performance capture solutions typically use head-mounted camera systems which provide one or more close-up video streams of each actor’s performance. These provide clear views of each actor’s performance, but can be bulky, uncomfortable, get in the way of sight lines, and prevent actors from getting close to each other. To address this, we propose a virtual head-mounted camera system: an array of cameras placed around the performance capture volume which automatically track zoomed-in, sharply focused, high-resolution views of the each ac-tor’s face from a multitude of directions. The resulting imagery can be used in conjunction with body motion capture data to derive nuanced facial performances without head-mounted cameras.

System Design:

Our approach keeps the weighty camera and zoom lens in a fixed place and frames the view of the actor through a two-axis mirror pan/tilt apparatus as in [Okumura et al. 2011], and adds remote zoom & focus control of the lens. The light weight of the glass mirrors eliminates the need for heavy mechanics, and dra-matically improves the system response time and accuracy so that cameras can be quickly retasked to different actors as they move within the performance volume. Unlike [Okumura et al. 2011], we design our system to work with fast off-the-shelf video lenses with a large exit pupil, and we avoid expensive mirror galvanometers. Instead, we adapt brushless gimbal motors (Fig. 2) in the 36N42P (N for stator arm; P for magnet pole) configuration to drive mirrors, providing smooth rotation as well as constant torque over variant speed. A 10K-count optical encoder is used for motor position feed-back. With this setup, we are able to achieve rotary resolution as accurate as 0.009 ◦ for the mirror.

We use Fujinon-TV.Z 7.5mm-105mm lenses to cover a subject’s movement inside a performance capture stage. This lens includes on-board gear & motor box for remote control of zoom and focus. We designed a circuit board (Fig. 2) with a networking interface and an on-board micro-controller to control the lens box and the gimbal motors. 3D printed parts and 80/20TM aluminum framing modules are used for the mechanical construction. We use Pt. Grey Flea3 USB3 cameras running at 1280 × 1024 pixels at 60fps to record.

Capture Volume Integration:

We install the tacking cameras around the volume of a PhaseSpace motion capture system, and use 120Hz position and orientation data of lightweight active tracking markers on top of the head (out of view of the actor) to calculate the proper mirror angles and zoom/focus positions to aim each camera at each actor (Fig. 2). The resulting view of each camera is firmly locked on the face of the actor with tight framing, allowing nu-anced performances to be seen. The use of active focus allows for a wide lens aperture, minimizing the requirements for ambient light. The cameras in each unit stream the image data to storage media. We can track two actors simultaneously and switch which camera is looking to each actor in less than 0.1s. This will allow multi-ple cameras to record performances of multiple actors, dynamically switching between faces as actors turn around and walk in front of each other to ensure that each actor’s face is seen from multiple angles throughout their performance.

Ongoing Work:

We are developing 3D reconstruction techniques to convert the multi-view video data into 3D models of dynamic facial performances. We are constructing additional facial tracking cameras in order to provide sufficient views. More units will be built in order to yield a robust input data into the software pipeline for successful 3D reconstruction.

Material:

VHC 2015 Paper:

VirtualHeadcam_SIGGRAPH2015.pdf, 3.36 MB. (Adobe Acrobat)

SIGGRAPH 2009 Poster:

CosineLobeBaseRelighting _4ftx3ft_Poster.png, 5.89 MB.
VirtualHeadcam_SIGGRAPH2015.pdf, 3.36 MB. (Adobe Acrobat)

Related Projects:

Light Stage 3:
- A Lighting Reproduction Approach to Live-Action Compositing
  Polarized Spherical Gradient Illumination , EGSR 2007
Light Stage 5:
- Rapid Acquisition of Specular and Diffuse Normal Maps from
  Polarized Spherical Gradient Illumination , EGSR 2007

Surface Reflectometry:
- Estimating Specular Roughness and Anisotropy from Second Order
  Spherical Gradient Illumination , SIGGRAPH 2009 Poster