Module
CMPE3I07 - SOUND AND IMAGE II
- Module Code:
- CMPE3I07
- Department:
- Computing Sciences
- Credit Value:
- 20
- Level:
- 3
- Organiser:
- Dr. Barry Theobald
In lectures, handouts will be distributed for material that is difficult or lengthy to copy from the board e.g. derivations of formulae. These handouts will be available on Blackboard. However, the handouts are not comprehensive, and students are expected to make their own notes from lecturers' notes on the board. In workshops, students will be expected to tackle problems individually but with help available from the seminar leader and one other teacher. For some workshops, the class will read sections of a textbook beforehand and then analyse the material in the workshop. Laboratory work (MATLAB programming) will take place during time-tabled laboratory periods using networked personal computers. CMP teaching laboratories running MATLAB are available to CMP students during term time outside time-tabled teaching hours.
Required purchases
- W. and J. Holmes, Speech Synthesis and Recognition,CRC
- R. Gonzalez, and R. Woods, Digital Image Processing Prentice Hall
Possible alternative purchases:
Gonzalez, R. and Woods,R., Digital Image Processing Prentice Hall,
Submission:
Written coursework should be submitted by following the standard CMP practice. Students are advised to refer to the Guidelines and Hints on Written Work in CMP.
Deadlines:
If coursework is handed in after the deadline day or an agreed extension:
| Work submitted | Marks deducted |
| After 15:00 on the due date and before 15:00 on the day following the due date | 10 marks |
| After 15:00 on the second day after the due date and before 15:00 on the third day after the due date | 20 marks |
| After 15:00 on the third day after the due date and before 15:00 on the 20th day after the due date. | All the marks the work merits if submitted on time (ie no marks awarded) |
| After 20 working days | Work will not be marked and a mark of zero will be entered |
Saturdays and Sundays will NOT be taken into account for the purposes of calculation of marks deducted.
All extension requests will be managed through the LTS Hub. A request for an extension to a deadline for the submission of work for assessment should be submitted by the student to the appropriate Learning and Teaching Service Hub, prior to the deadline, on a University Extension Request Form accompanied by appropriate evidence. Extension requests will be considered by the appropriate Learning and Teaching Service Manager in those instances where (a) acceptable extenuating circumstances exist and (b) the request is submitted before the deadline. All other cases will be considered by a Coursework Coordinator in CMP.
For more details, including how to apply for an extension due to extenuating circumstances download Submission for Work Assessment (PDF, 39KB)
Plagiarism:
Plagiarism is the copying or close paraphrasing of published or unpublished work, including the work of another student; without due acknowledgement. Plagiarism is regarded a serious offence by the University, and all cases will be investigated. Possible consequences of plagiarism include deduction of marks and disciplinary action, as detailed by UEA's Policy on Plagiarism and Collusion.
Module specific:
- Understanding of the effects of a sampled representation of a continuous signal
- Knowledge of the processes involved in speech production and the fundamentals of articulatory and acoustic phonetics
- Understanding and analysis of the source-filter model of speech production and how it is applied in speech coding
- An overview of the processes involved in speech recognition, appreciation of the problems that underlie each and an appreciation of how stochastic modelling can be used
- Appreciation of the different approaches to speech synthesis and the advantages and disadvantages of each
- Understanding of signal representation in the frequency domain
- Understanding of frequency domain analysis for multi-dimensional signal
- Understanding of image filtering techniques
- Understanding of the differences between enhancement methods and restoration methods
- Understanding of frequency domain techniques for image restoration
Transferable skills:
- Enhanced MATLAB programming skills
- Skills in the manipulation of sound and image files
- Enhanced problem-solving skills
- Increased knowledge and appreciation of speech and language
Subject specific:
- In-depth understanding of DSP and its application
- Ability to design discrete-time filters and process signals with them
- Ability to use DSP techniques to process audio and video signals for use in audio and video coding and recognition systems
- Understanding of the important areas in image and speech technology and of the research issues in them
Total hours: 49
Lectures: 22, hours: 1, Content (with provisional weekly schedule)
- Introduction to module; Introduction to the speech signal
- Source/filter model of speech production I
- Source/filter model of speech production II
- Speech recognition: the front end
- Speech recognition: stochastic methods and search methods
- Speech recognition; acoustic and language modelling
- Speech synthesis I
- Speech synthesis II
- Speech dialogue systems
- Speech Coding
- Vectors and Review of Complex Numbers
- Complex Exponential Representation of Signals
- Complex Fourier Series
- Introduction to the DFT
- Frequency Domain Filtering
- Frequency Domain Filter Design
- Image Restoration
- Wiener Filtering
- Audiovisual Speech
- Audiovisual Speech Synthesis
- Revision I
- Revision II
Workshops: 10, hours: 15, Content (with provisional weekly schedule)
- Assignment I briefing
- Speech coding
- Speech recognition I
- Speech recognition II
- Speech synthesis
- Complex Exponentials
- Discrete Fourier Transforms
- Assignment II Briefing
- Frequency Domain Filtering
- Filter Design
Laboratory Work: 12, hours: 12, Content (with provisional weekly schedule)
- Front-end speech processing
- Speech Coding
- Vowel recognition
- Image processing 1
- Introduction to the DFT
- Image Restoration
Coursework


