Immersive Telepresence aims at bringing immersive experience into telecommunication so people across geographically distributed sites can interact collaboratively as if they were face-to-face. This requires deep understanding of multiple disciplines. In particular, computer vision, graphics and acoustics are indispensable in capturing and rendering 3D dynamic environments in order to create the illusion that the remote participants are in the same room. The existing videoconferencing systems, whether they are available on desktop and mobile devices or in dedicated conference rooms with built-in furniture and life-sized high-definition video, leave a great deal to be desired: mutual gaze, 3D, motion parallax, spatial audio, to name a few. In this talk, I will describe various research activities that are being conducted in Microsoft Research for tele-immersive communication and collaboration. Broadly, we follow two approaches: virtual tele-immersion (projects: Avatar Kinect and Viewport) and physical tele-immersion (projects: Embodied Social Proxy and ViewMark).
Speaker's Brief Bio
Zhengyou Zhang received the B.S. degree in electronic engineering from Zhejiang University, Hangzhou, China, in 1985, the M.S. degree in computer science from the University of Nancy, Nancy, France, in 1987, the Ph.D. degree in computer science from the University of Paris XI, Paris, France, in 1990, and the Doctorate of Science (Habilitation à diriger des recherches) from the University of Paris XI, Paris, France, in 1994.
He is a Principal Researcher with Microsoft Research, Redmond, WA, USA, and the Research Manager of the “Multimedia, Interaction, and Communication” group. Before joining Microsoft Research in March 1998, he was with INRIA (French National Institute for Research in Computer Science and Control), France, for 11 years and was a Senior Research Scientist from 1991. In 1996-1997, he spent a one-year sabbatical as an Invited Researcher with the Advanced Telecommunications Research Institute International (ATR), Kyoto, Japan. He has published over 200 papers in refereed international journals and conferences, and has coauthored the following books: 3-D Dynamic Scene Analysis: A Stereo Based Approach (Springer-Verlag, 1992); Epipolar Geometry in Stereo, Motion and Object Recognition (Kluwer, 1996); Computer Vision (Chinese Academy of Sciences, 1998, 2003, in Chinese); Face Detection and Adaptation (Morgan and Claypool, 2010); and Face Geometry and Appearance Modeling (Cambridge University Press, 2011). He has given a number of keynotes in international conferences and invited talks in universities.
Dr. Zhang is a Fellow of the Institute of Electrical and Electronic Engineers (IEEE), the Founding Editor-in-Chief of the IEEE Transactions on Autonomous Mental Development, an Associate Editor of the International Journal of Computer Vision, an Associate Editor of Machine Vision and Applications, and an Area Editor of the Journal of Computer Science and Technology. He served as Associate Editor of the IEEE Transactions on Pattern Analysis and Machine Intelligence from 2000 to 2004, an Associate Editor of the IEEE Transactions on Multimedia from 2004 to 2009, an Associate Editor of the International Journal of Pattern Recognition and Artificial Intelligence from 1997 to 2009, among others. He has participated in organizing numerous international conferences in the areas of computer vision, signal processing, multimedia, human-computer interaction, and autonomous mental development. He is serving as a Chair of a new track “Technical Briefs” of the ACM SIGGRAPH Asia Conference, Nov. 28 – Dec. 1st, 2012. More information is available at http://research.microsoft.com/~zhang/