Contenido de XSL

Técnicas Avanzadas de Inteligencia Artificial

Centro: Facultad de Informática
Titulación: Grado en Ingeniería Informática
Curso académico: 2024/25
Curso: 4
Nº Créditos: 6
Idiomas: Inglés

Distribución de horas por tipo de enseñanza
Tipo de docencia	Horas de docencia presencial	Horas de actividad no presencial del alumno/a
Magistral	40	50
P. Laboratorio	20	40

ObjetivosAlternar navegación

This course provides the basic concepts of reinforcement learning. It gives students a detailed understanding of various topics: Markov Decision Processes, sample-based learning algorithms and deep reinforcement learning.

TemarioAlternar navegación

Topic 1 Introduction to the course

Topic 2 Introduction to Reinforcement Learning: definition of basic concepts such as Markov Decision Proccess and Value Functions

Topic 3 Dynamic Programming: methods to solve the problem when the model is known: policy iteration and value iteration methods

Topic 4 Monte Carlo Methods: methods to solve the problem learning from simulated experiences.

Topic 5 Temporal-Difference Learning: combination of Dynamic Programming and Monte-Carlo: SARSA, Q-Learning and variants

Topic 6 Deep Reinforcement Learning: Function approximation, Batch Learning, Deep Q-Network and Rainbow (combination of several improvements in Deep Reinforcement Learning)

MetodologíaAlternar navegación

Master classes, seminars, laboratories, assignments, practices and presentations.

The skills and competences demonstrated in all aspects of the subject make up your note: active participation, tasks, practice, presentations, etc.

Sistemas de evaluaciónAlternar navegación

The subject has two possible evaluation modes: final and continuous.

The continuous evaluation is the preferred mode. It establishes a set of activities that allows assessing the progress of each student throughout the course. Thus, the continuous evaluation is offered by default to students who should deliver the assignments of the subject in the established framework including assistance, presentations and face-to-face activities.

Students can also be evaluated through the final evaluation mode. In this case, the students on established dates (when reaching around 60% and 80% of the course) must submit to the teaching supervisors a formal resignation to the continuous evaluation. Then, the teaching supervisors will assign a mandatory practical work and a date for an oral presentation prior to the date indicated for ordinary and extraordinary examination.

The weight of the different aspects to consider in the two alternative forms of evaluation is presented below.

Continuous Evaluation

• 3 obligatory assignments (100%), 40% of the mark must be obtained in each one in order to pass subject

◦ Individual Multiple Choice Exam: 40%

◦ Oral presentation in group (3-4 people) about a Reinforcement Learning applied paper: 30%

◦ Practical work in group (3-4 people): 30%

Final Evaluation

Delivery of mandatory practical work and oral presentation prior to the written exam on the date indicated for the ordinary and extraordinary examination: 100%

Materiales de uso obligatorioAlternar navegación

• eGela
• Google Colab

BibliografíaAlternar navegación

Bibliografía básica

Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction 2nd Edition, 2018

Maxim Lapan. Deep Reinforcement Learning Hands-on. Packt Publishing Ltd., 2nd edition, 2020.

Revistas

Artificial Intelligence. Elsevier Science.

Calendario
Semanas	Lunes	Martes	Miércoles	Jueves	Viernes
1-15	14:00-15:30			15:30-17:00

Profesorado

MENDIALDUA BEITIA, IÑIGO

Calendario
Semanas	Lunes	Martes	Miércoles	Jueves	Viernes
1-15			17:00-18:30

Profesorado

MENDIALDUA BEITIA, IÑIGO

Calendario
Semanas	Lunes	Martes	Miércoles	Jueves	Viernes
1-15			14:00-15:30

Profesorado

MENDIALDUA BEITIA, IÑIGO

Fecha de última modificación: 2024-05-16

Departamento de Lenguajes y Sistemas Informáticos

Barra de búsqueda

Contenido de XSL

Técnicas Avanzadas de Inteligencia Artificial

DocenciaAlternar navegación

Guía docenteAlternar navegación

ObjetivosAlternar navegación

TemarioAlternar navegación

MetodologíaAlternar navegación

Sistemas de evaluaciónAlternar navegación

Materiales de uso obligatorioAlternar navegación

BibliografíaAlternar navegación

Bibliografía básica

Revistas

GruposAlternar navegación

66 Teórico (Inglés - Tarde)Mostrar/ocultar subpáginas

Profesorado

66 P. Laboratorio-1 (Inglés - Tarde)Mostrar/ocultar subpáginas

Profesorado

66 P. Laboratorio-2 (Inglés - Tarde)Mostrar/ocultar subpáginas

Profesorado