Career History

In 1994 Ben moved to BT Laboratories in the speech recognition group, where he specialised in noise and channel robustness and front-end processing. In particular he worked on reducing error rates for services such as automated directory enquiries and speech recognition from payphones. More recent work at BT considered the effect of performing speech recognition over mobile and IP networks where problems of packet loss and codec distortion are addressed. He was also responsible for the handwriting component of BT's SmartQuill which features in the Guinness Book of Records as 'the world's most intelligent pen!'. He was also involved in work to automatically detect the internal rotting of telephone poles. He holds three patents in the areas of speech processing and handwriting recognition.

In March 2001 Ben returned to the School of Information Systems (now the School of Computing Sciences) as a Senior Lecturer.

Ben has also spent some time in oil and gas exploration working on side scan sonar and seabed profiling.

Academic Background

Ben Milner received a PhD in robust speech processing from the University of East Anglia in 1995. Prior to this he received a BEng in electronic engineering

All Publications

<- Page 1 of 5 ->

Le Cornu, T., Milner, B. P.

(2017)

Generating intelligible audio speech from visual speech,

in IEEE Transactions on Audio, Speech, and Language Processing

25

(9)

pp. 1447-1457

Full Text UEA Repository

(Article)

(Published)


Harding, P., Milner, B.

(2017)

Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework,

in Computer Speech and Language

42

pp. 1–19

Full Text UEA Repository

(Article)

(Published)


Thangthai, A., Milner, B., Taylor, S.

(2016)

Visual speech synthesis using dynamic visemes, contextual features and DNNs,

in Proceedings of the Interspeech Conference 2016.

International Speech Communication Association

pp. 2458-2462

Full Text

(Conference contribution)

(Published)


Taylor, S., Kato, A., Milner, B., Matthews, I.

(2016)

Audio-to-Visual Speech Conversion using Deep Neural Networks,

in Proceedings of the Interspeech Conference 2016.

International Speech Communication Association

pp. 1482-1486

Full Text

(Conference contribution)

(Published)


Kato, A., Milner, B.

(2016)

HMM-Based Speech Enhancement Using Sub-Word Models and Noise Adaptation,

in Proceedings of the Interspeech Conference 2016.

International Speech Communication Association

pp. 3748-3752

Full Text

(Conference contribution)

(Published)


Harding, P., Milner, B.

(2015)

Reconstruction-based speech enhancement from robust acoustic features,

in Speech Communication

75

pp. 62-75

Full Text UEA Repository

(Article)

(Published)


Khan, F., Milner, B.

(2015)

Using audio and visual information for single channel speaker separation,

UEA Repository

(Paper)

(Published)


Websdale, D., Le Cornu, T., Milner, B.

(2015)

Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation,

UEA Repository

(Paper)

(Published)


Milner, B., Le Cornu, T.

(2015)

Reconstructing intelligible audio speech from visual speech features,

UEA Repository

(Paper)

(Published)


Le Cornu, T., Milner, B.

(2015)

Voicing classification of visual speech using convolutional neural networks,

UEA Repository

(Paper)

(Published)


Milner, B., Websdale, D.

(2015)

Analysing the importance of different visual feature coefficients,

UEA Repository

(Paper)

(Published)


Almajai, I., Milner, B.

(2011)

Visually Derived Wiener Filters for Speech Enhancement,

in IEEE Transactions on Audio, Speech, and Language Processing

19

(6)

pp. 1642-1651

Full Text UEA Repository

(Article)

(Published)


Harding, P., Milner, B.

(2011)

Speech enhancement by reconstruction from cleaned acoustic features,

UEA Repository

(Paper)

(Published)


Milner, B.

(2011)

Maximum a posteriori Estimation of Noise from Non-Acoustic Reference Signals in Very Low Signal-to-Noise Ratio Environments,

UEA Repository

(Paper)

(Published)


Pawi, S., Vaseghi, B., Milner, B., Ghorsi, S.

(2011)

Fundamental Frequency Estimation Using Modified Higher Order Moments and Multiple Windows,

UEA Repository

(Paper)

(Published)


Milner, B., Darch, J.

(2010)

Robust Acoustic Speech Feature Prediction from Noisy Mel-Frequency Cepstral Coefficients,

in IEEE Transactions on Audio, Speech, and Language Processing

19

(2)

pp. 338-347

Full Text UEA Repository

(Article)

(Published)


Pawi, A., Vaseghi, S. V., Milner, B. P.

(2010)

Pitch extraction using modified higher order moments,

Full Text UEA Repository

(Paper)

(Published)


Almajai, I., Milner, B.

(2009)

Effective visually-derived Wiener filtering for audio-visual speech processing,

UEA Repository

(Paper)

(Published)


Almajai, I., Milner, B.

(2009)

Enhancing Audio Speech using Visual Speech Features,

UEA Repository

(Paper)

(Published)


Milner, B., Darch, J., Almajai, I.

(2009)

Reconstructing clean speech from noisy MFCC vectors,

UEA Repository

(Paper)

(Published)


<- Page 1 of 5 ->

Key Research Interests

Ben Milner is part of the Speech, Language and Audio Processing

Ben's research interests continue to be in the area of robust speech processing. In particular considering the effects of mobile and IP networks on speech quality and recognition performance.

Follow this link for current PhD opportunities in Computing Sciences. But feel free to email me to discuss projects outside these areas and alternative sources of funding.

Selected Publications

Milner, B. P. and Shao, X., Prediction of Fundamental Frequency and Voicing from Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction. IEEE Transactions on Audio, Speech and Language Processing, Volume 15, Issue 1, Page(s) 24-33, 2007.

Darch, J., Milner, B.P., and Saeed Vaseghi, MAP Prediction of Formant Frequencies and Voicing Class from MFCC Vectors in Noise. Speech Communication, Volume 48, Issue 11, Page(s) 1556-1572, 2006.

James, A. B. and Milner, B. P., Towards improving the robustness of distributed speech recognition in packet loss. Journal of Speech Communication, Volume 48, Issue 11, Page(s) 1402-1421, 2006.

Milner, B. P. and James, A. B., Robust speech recognition over mobile and IP networks in burst-like packet loss. IEEE Trans. Audio, Speech and Language Processing, Volume 14, Issue 1, Page(s) 223-231, 2006.

 

External Activities and Indicators of Esteem

  • Guest editor: special edition of Speech Communication and Robustness Issues for Conversation Interaction, 2006
  • Chairman: ISCA Tutorial and Research Workshop (ITRW) Robust2004-Robustness Issues for Conversation Interaction-UEA, 2004

Key Responsibilities

Associate Dean for Learning and Teaching, Faculty of Science