| Job Title | Contact | Location |
|---|---|---|
| Senior Lecturer |
B dot Theobald at uea dot ac dot uk
Tel: +44 (0)1603 59 2574 |
Science 2.09 |
Barry Theobald is part of the Speech, Language and Music Group.
My research interests focus on modelling, analysing and synthesising faces. In terms of modelling I am interested in developing methods for analysing a set of images and automatically constructing appearance-based models without manual input. The analysis aspects of my research are concerned with computer lip-reading, and robustly and efficiently tracking faces in video. The synthesis aspects are focussed on developing a flexible system that can re-synthesise visual speech and expression from video, voice or text. I am also interested in applying face models in a wider context. For example, in collaboration with a team of psychologists we are developing techniques that use avatars in real-time face-to-face applications. The models allow us to manipulate particular aspects of a conversation, such as the temporal dynamics, the expressiveness of a conversant, and even their identity.
Selected Publications
Theobald, B., Matthews, I., Mangini, M., Spies, J., Brick, T., Cohn, J.F., and Boker, S. Mapping and Manipulating Facial Expression. Journal of Language and Speech. (Accepted to appear).
Theobald, B., and Cohn, J. Facial Image Synthesis. Oxford Companion to Affective Sciences: An Encyclopaedic Dictionary for the Affective Sciences. NY: Oxford University Press. (In Press).
Theobald, B., and Wilkinson, N. A Probabilistic Trajectory Synthesis System for Synthesising Visual Speech. In Proceedings of Interspeech, pp. 2310-2313, 2008.
Theobald, B., Fagel, S., Elsei, F., and Bailly, G. LIPS2008: Visual Speech Synthesis Challenge.In Proceedings of Interspeech, pp. 1875-1878, 2008.
Theobald, B., Wilkinson, N., and Matthews, I. On Evaluating Synthesised Visual Speech. In Proceedings of the International Conference on Auditory-Visual Speech Processing, pp. 7-12, 2008.
For a more complete list of my publications, see my personal page.
| Project Title | Start Date | End Date | Funding Body | Project Members |
|---|---|---|---|---|
| LILiR2 Language Independent Lip Reading | 31/5/2007 | 29/9/2010 | EPSRC | Richard Harvey, Stephen Cox, Barry Theobald |
| Creating expressive three-dimensional talking faces | 5/6/2006 | 4/10/2008 | EPSRC | N/A |
| Synthesis of the Face in Real-Time | 1/1/2006 | 31/12/2008 | National Science Foundation | N/A |
| Research & Development Support for Human Signing | 1/1/2000 | 31/12/2002 | Independent Television Commission | Barry Theobald, John Glauert, Judith Tryggvason |
Theobald, B (2012) Relating Objective and Subjective Performance Measures for AAM-Based Visual Speech Synthesis. IEEE Transactions on Audio, Speech and Language Processing. ISSN 1558-7916
Davis, L, Theobald, BJ and Bagnall, A (2012) Automated Bone Age Assessment Using Feature Extraction. Lecture Notes in Computer Sciences, 7435. pp. 43-51. ISSN 0302-9743
Davis, LM, Theobald, BJ, Lines, J, Toms, A and Bagnall, A (2012) On the segmentation and classification of hand radiographs. International Journal of Neural Systems, 22 (05). pp. 1250020-1250036. ISSN 0129-0657
Theobald, B and Matthews, I (2012) Relating Objective and Subjective Performance Measures for AAM-based Visual Speech Synthesizers. IEEE Transactions on Audio, Speech and Language Processing, 20 (8). p. 2378.
Boker, S, Cohn, J, Theobald, B, Matthews, I, Mangini, M, Spies, J and Ambadar, Z (2011) Something in the Way We Move: Motion, not Perceived Sex, Influences Nods in Conversation. Journal of Experimental Psychology: Human Perception and Performance, 37 (3). pp. 874-891.
Lai, S, Liu, Y, Zhang, M and Theobald, B (2011) REBoost: Probabilistic Resampling for Boosted Pedestrian Detection. Journal of Optical Engineering, 50 (12). p. 127203.
Boker, Steven M., Cohn, Jeffrey F., Theobald, Barry-John, Matthews, Iain, Brick, Timothy R. and Spies, Jeffrey R. (2009) Effects of damping head movement and facial expression in dyadic conversation using real - time facial expression tracking and synthesized avatars. Phil. Trans. R. Soc. B. pp. 3485-3495.
Fagel, Sascha, Bailly, Gérard and Theobald, Barry (2009) Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation (Editorial). EURASIP Journal on Audio, Speech, and Music Processing, 2009. ISSN 1687-4714
Theobald, Barry-John, Matthews, Iain, Mangini, Michael, Spies, Jeffrey R., Brick, Timothy R., Cohn, Jeffrey F. and Boker, Steven M. (2009) Mapping and manipulating facial expression. Journal of Language and Speech, 52 (2-3). pp. 369-386. ISSN 0023-8309
Theobald, B, Bangham, JA, Matthews, I and Cawley, GC (2004) Near-videorealistic synthetic talking faces: Implementation and evaluation. Speech Communication, 44 (1-4). pp. 127-140. ISSN 0167-6393
Theobald, BJ, Kruse, SM, Bangham, JA and Cawley, GC (2003) Towards a low bandwidth talking face using appearance models. Image and Vision Computing, 21 (13-14). pp. 1117-1124. ISSN 0262-8856
Theobald, Barry, Cox, Stephen, Cawley, Gavin and Milner, Ben (1999) Fast Method of Channel Equalisation for Speech Signals and its Implementation on a DSP. IEE Electronics Letters, 35 (16). pp. 1309-1311. ISSN 0013-5194
Davis, L, Theobald, B-J, Toms, A and Bagnall, A (2011) On the Extraction and Classification of Hand Outlines. In: Proceedings onf the 12th International Conference on Intelligent Data Engineering and Automated Learning. Spring, pp. 92-99.
Theobald, Barry and Cohn, J. F. (2009) Facial image synthesis. In: Oxford Companion to affective sciences: An encyclopaedic dictionary for the affective sciences. NY: Oxford University Press, pp. 176-179.
Theobald, B, Bangham, JA, Kruse, S, Cawley, GC and Matthews, I (2001) Towards videorealistic synthetic visual speech. In: Uncertainty in Geometric Computations. The Kluwer International Series in Engineering and Computer Science, 704 . Kluwer Academic Publishers, pp. 175-184. ISBN 978-0-7923-7309-4
Hilder, S. and Theobald, B. (2010) In pursuit of visemes. In: International Conference on Auditory-visual Speech Processing, 2011-01-01, Volterra.
Lan, Y., Theobald, B., Harvey, R. and Ong, E. (2010) Improving visual features for lip-reading. In: International Conference on Auditory-visual Speech Processing, 2011-01-01, Volterra.
Newman, Jacob, Theobald, Barry and Cox, Stephen (2010) Limitations of Visual Speech Recognition. In: Proceedings of the International Conference on Auditory-Visual Speech Processing, 2010-01-01, Hakone, Kanagawa.
Brick, T., Spies, J., Theobald, B. and Matthews, I. (2009) High-presence, low-bandwidth, apparent 3D video-conferencing with a single camera. In: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), 2009-01-01.
Lan, Yuxuan, Harvey, Richard, Theobald, Barry-John and Ong, Eng-Jon (2009) Comparing Visual Features for Lipreading. In: Proc. of International Conference on Auditory-visual Speech Processing, 2009-01-01, Norwich.
Ong, Eng-Jon, Lan, Yuxuan, Theobald, Barry and Harvey, Richard (2009) Robust Facial Feature Tracking using Multiscale Biased Linear Predictors. In: International Conference on Computer Vision, 2009, Kyoto, 2009-01-01.
Cox, Stephen, Harvey, Richard, Lan, Yuxuan, Newman, Jacob and Theobald, Barry (2008) The Challenge of Multispeaker Lip-Reading. In: International Conference on Auditory-Visual Speech Processing, 2008-09-26 - 2008-09-29, Queensland.
Boker, S., Cohn, J.F. and Theobald, B. (2008) Dissociating facial appearance and dynamics in real time during natural conversation. In: COSYNE Workshop, 2008-01-01.
Theobald, B, Cawley, G, Bangham, A and Matthews, I (2008) Comparing text-driven and speech-driven visual speech synthesisers. In: INTERSPEECH, 2011-01-01, Italy.
Theobald, B. and Wilkinson, N. (2008) On evaluating synthesised visual speech. In: In Proceedings of the International Conference on Auditory-visual Speech Processing, 2008-01-01.
Theobald, Barry-John, Fagel, Sascha, Bailly, Gérard and Elisei, Frédéric (2008) LIPS2008: Visual speech synthesis challenge. In: Proceedings of Interspeech 2008, 2008-01-01, Brisbane.
Theobald, Barry-John and Wilkinson, Nicholas (2008) A probabilistic trajectory synthesis system for synthesising visual speech. In: UNSPECIFIED.
Theobald, Barry-John (2007) Audiovisual Speech Synthesis. In: Proceedings of the International Congress on Phonetic Sciences, 2007-08-06 - 2007-08-10, Saarbrücken.
Theobald, B., Matthews, I., Wilkinson, N., Cohn, J. and Boker, S. (2007) Animating Faces Using Appearance Models. In: Proceedings of the Workshop on Vision, Video and Graphics, 2007-09-14, Warwick University.
Theobald, Barry-John, Matthews, Iain A., Cohn, Jeffrey F. and Boker, Steven M. (2007) Real-time expression cloning using appearance models. In: Proceedings of the 9th international conference on Multimodal interfaces, 2007-11-12 - 2007-11-15, Nagoya, Aichi.
Theobald, Barry-John and Wilkinson, Nicholas (2007) A Real-Time Speech-Driven Talking Head using Active Appearance Models. In: Proceedings of the International Conference on Audio-visual Speech Processing (AVSP), 2007-08-31 - 2007-09-03, Kasteel Groenendaal, Hilvarenbeek, The Netherlands.
Theobald, Barry, Harvey, Richard, Cox, Stephen, Lewis, Colin and Owens, Gari (2006) Lip-reading enhancement for law enforcement. In: Proceedings of SPIE 6402 Conference on Optics and Photonics for Counterterrorism and Crime Fighting, 2006-09-11, Stockholm.
Theobald, B., Matthews, I. and Baker, S. (2006) Evaluating Error Functions for Robust Active Appearance Models. In: Proceedings of the International Conference on Automatic Face and Gesture Recognition, 2006-04-02 - 2006-04-06, Southampton.
Glauert, J. R. W., Kennaway, J. R., Elliott, R. and Theobald, B. J. (2004) Virtual Human Signing as Expressive Animation. In: Symposium on Language, Speech and Gesture for Expressive Characters, 2004-03-29 - 2004-04-01, University of Leeds, Leeds.
Theobald, B-J, Bangham, JA, Matthews, I and Cawley, GC (2003) Evaluating talking heads based on appearance models. In: Proceedings of the International Conference on Auditory-Visual Speech Processing (AVSP-2003), 2003-09-04 - 2003-09-07, St. Jorioz.
Theobald, B, Cawley, GC, Glauert, JRW, Abider, JA and Matthews, I (2003) 2.5D Visual Speech Synthesis Using Appearance Models. In: British Machine Vision Conference, 2005-09-05 - 2005-09-08, Oxford Brookes University, Oxford.
Theobald, BJ, Cawley, GC, Matthews, I and Bangham, JA (2003) Near-videorealistic synthetic visual speech using non-rigid appearance models. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '03), 2003-04-06 - 2003-04-10, Hong Kong.
Theobald, B, Bangham, JA, Matthews, I and Cawley, GC (2002) Towards video realistic synthetic visual speech. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP- 2002), 2002-05-13 - 2002-05-17, Orlando, Florida.
Theobald, B, Bangham, JA, Matthews, I and Cawley, GC (2001) Visual speech synthesis using statistical models of shape and appearance. In: International Conference on Auditory-Visual Speech Processing (AVSP-2001), 2001-09-07 - 2001-09-09, Aalborg.
Send this page to your mobile phone by scanning this code using a 2D barcode (QR Code) reader. These can be installed on most modern Smart Phones.