Subject
Statistics and Mathematics for NLP
General details of the subject
- Mode
- Face-to-face degree course
- Language
- English
Description and contextualization of the subject
Introduction to basic concepts of statistics in the field of natural language processing. Inferential aspects will be worked on, including the most common statistical tests, as well as interval estimation. Furthermore, basic concepts of functions and linear algebra will be introduced for Natural Language Processing tasks with the final aim of gaining understanding of the rationale behind neural networks.Teaching staff
Name | Institution | Category | Doctor | Teaching profile | Area | |
---|---|---|---|---|---|---|
EZEIZA RAMOS, NEREA | University of the Basque Country | Profesorado Agregado | Doctor | Bilingual | Computer Languages and Systems | n.ezeiza@ehu.eus |
IRIGOYEN GARBIZU, ITZIAR | University of the Basque Country | Profesorado Agregado | Doctor | Bilingual | Science of Computation and Artificial Intelligence | itziar.irigoien@ehu.eus |
Competencies
Name | Weight |
---|---|
Capacidad de comprender y aplicar los test estadísticos adecuados según los objetivos planteados, dado un conjunto de datos. | 35.0 % |
Capacidad para comprender lenguaje matemático básico. | 25.0 % |
Capacidad de comprender la intuición subyacente en los espacios vectoriales y de realizar cálculos básicos. | 40.0 % |
Study types
Type | Face-to-face hours | Non face-to-face hours | Total hours |
---|---|---|---|
Lecture-based | 10 | 15 | 25 |
Applied computer-based groups | 20 | 30 | 50 |
Training activities
Name | Hours | Percentage of classroom teaching |
---|---|---|
Computer work practice, laboratory, site visits, field trips, external visits | 50.0 | 40 % |
Lectures | 25.0 | 40 % |
Assessment systems
Name | Minimum weighting | Maximum weighting |
---|---|---|
Attendance and participation | 10.0 % | 10.0 % |
Practical tasks | 30.0 % | 60.0 % |
Written examination | 30.0 % | 60.0 % |
Learning outcomes of the subject
Pose the correct hypotheses according to the objectives and characteristics of the data to perform statistical tests.Interpret the results of a statistical test and complement it with estimation by intervals.
Know the definition of what a function is. Know the concept of derivative. Perform basic calculations of linear algebra.
Learn to use specific software to perform calculations related to natural language processing tasks.
Ordinary call: orientations and renunciation
Final exam and practical tasks (projects) to assess the subject.Extraordinary call: orientations and renunciation
Final exam and practical tasks (projects) to assess the subject.Temary
1. Introduction to hypothesis testing: independence test, Mc Nemar test2. Real function. Concept of derivative
3. Matrix calculus
Bibliography
Basic bibliography
R.H. Baayen (2008). Analyzing Linguistic Data. A Practical Introduction to Statistics using R. Cambridge University PressG. Strang (2019). Linear Algebra and Learning from Data. Cambridge University Press
A. Trask (2019). Grokking deep learning. Manning Publications Co.