Subject
Speech Processing
General details of the subject
- Mode
- Face-to-face degree course
- Language
- English
Description and contextualization of the subject
The objective of this subject is that the students become familiar with the speech signal and with the basic techniques used for its analysis and processing. For this, the fundamentals of human speech production and its perception by the human ear are described, to continue with the definition of the basic techniques of voice signal processing. The course includes numerous exercises and practices for students to work with these concepts.Teaching staff
Name | Institution | Category | Doctor | Teaching profile | Area | |
---|---|---|---|---|---|---|
HERNAEZ RIOJA, INMACULADA CONCEPCION | University of the Basque Country | Profesorado Catedratico De Universidad | Doctor | Not bilingual | Theory of Signals and Communications | inma.hernaez@ehu.eus |
Competencies
Name | Weight |
---|---|
Conocer e interpretar los procesos de generación y percepción de la voz humana. | 25.0 % |
Analizar y explicar el análisis temporal y espectral de señales de voz. | 25.0 % |
Manejar las herramientas informáticas básicas para el procesado y tratamiento de la señal de voz. | 25.0 % |
Comprender e interpretar los principales parámetros utilizados en la representación de la señal de voz. | 25.0 % |
Study types
Type | Face-to-face hours | Non face-to-face hours | Total hours |
---|---|---|---|
Lecture-based | 15 | 22.5 | 37.5 |
Applied computer-based groups | 30 | 45 | 75 |
Training activities
Name | Hours | Percentage of classroom teaching |
---|---|---|
Computer work practice, laboratory, site visits, field trips, external visits | 37.5 | 40 % |
Individual work and/or group work | 75.0 | 40 % |
Assessment systems
Name | Minimum weighting | Maximum weighting |
---|---|---|
Internship Report/Summary | 30.0 % | 60.0 % |
Multiple-choice examination | 40.0 % | 70.0 % |
Learning outcomes of the subject
RA1 Describe the perception generation processes of the voice signal.RA2 Demonstrate knowledge about the temporal and frequency representation of the voice signal and about the techniques used for its analysis.
RA3 Measure and interpret the main parameters that characterize the voice signal.
RA4 Handle voice signal analysis and processing tools.
Ordinary call: orientations and renunciation
The evaluation system is divided into two independent blocks: theory and laboratory practices.The theoretical part is evaluated by means of an individual test written from short questions and multiple choice. (fifty%)
The part of laboratory practices is evaluated through the memories and the work carried out in the laboratory. (fifty%)
In the ordinary call, both parts (master class and laboratory) must be passed with a minimum grade of 3 out of 10 in each. To pass the subject, a final grade of more than 5 out of 10 must be obtained once all the grades have been added and averaged. Each evaluation block is independent and once a block has been passed, the mark can be saved for the extraordinary call.
RESIGNATION:
People who cannot follow the mixed evaluation must document their causes to the teaching team, following the procedure established in current regulations. They will be able to accredit the achievement of the learning results of the subject through a final evaluation that will consist of: a written test and a laboratory test, with the same minimum conditions as in the continuous evaluation.
If a person does not take the final written test, it will be understood that that person has renounced said call.
Extraordinary call: orientations and renunciation
In the extraordinary call there will be two tests: a lecture block test and a laboratory block test. Each test will count for 50% of the final grade and both must be passed with a minimum grade of 5 out of 10.Those students who have not presented a written document justifying their non-participation in the mixed evaluation must certify the proper completion of the practices.
Temary
1. Speech production and perception2. Basic concepts about signals
3. Basic concepts about systems
4. The source filter model
5. Basic representations of the speech signal
Bibliography
Compulsory materials
Documentation provided via eGela, both for master classes and for laboratory practicesBasic bibliography
A classical: Fant: Acoustic Theory of Speech Production (1960)“The Production and Perception of Speech” Mark Tatham Katherine Morton (1997) -several versions and editions
Introduction to Digital Speech Processing, Lawrence R. Rabiner, Ronald W. Schafer
Now Publishers Inc, 2007
In-depth bibliography
Speech perception: R. Munkongand B. H. Juang, “Auditory perception and cognition” inIEEE Signal Processing Magazine, vol. 25, no. 3, pp. 98-117, May 2008.doi: 10.1109/MSP.2008.918418http://auditoryneuroscience.comfor a general understanding of sound and brain.
L. R. Rabiner and R. W. Schafer Theory and Applications of Digital Speech Processing, Prentice-Hall, 2011.
X. Huang, A. Acero, H. Hon. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall, 2001. (ISBN: 978-0130226167)
L. R. Rabiner, R. W. Schafer. Digital processing of speech signal. Prentice-Hall, 1978. (ISBN: 978-0132136037)
J. G. Proakis, D. G. Manolakis. Digital signal processing. Principles, algorithms and applications (4th edition). Pearson Prentice Hall, 2007. (ISBN: 978-0131873742)
D. Jurafsky, J. H. Martin. Speech and Language Processing (2nd edition). Prentice Hall, 2008. (ISBN: 978-0131873216)
Journals
Computer Speech and LanguageSpeech Communication
IEEE Transactions on Audio Speech and Language Processing
IEEE Transactions on Multimedia