gabriel – Page 2 – Gabriel Turinici

Statistical Learning, M1 Math 2024+

janvier 6, 2025janvier 15, 2025gabriel

Instructor: Gabriel TURINICI

Preamble: this course is just but an introduction, in a limited amount of time, to Statistical and Machine learning. This will prepare for the next year’s courses (some of them on my www page cf. « Deep Learning » and « Reinforcement Learning »).

1/ Introduction to statistical learning : supervised, non-supervised and reinforcement learning, general learning procedure, model evaluation, under and overfitting

2/ K-nearest neighbors and the « curse of the dimensionality »

3/ Regression in high dimensions, variable selection and model regularization (ridge, lasso)

4/ Stochastic gradient descent, mini-batch

5/ Neural networks: introduction, operator, datasets, training, examples, implementations

6/ K-means clustering

Main document for the theoretical presentations: (no distribution autoried without WRITTEN consent from the author): see your « teams » group.

Exercices, implementations: see « teams » group.

Analyse numérique: évolution (M1 Math, Université Paris Dauphine – PSL, 2005-11, 2019-2025

janvier 5, 2025janvier 8, 2026gabriel

Responsable de cours: Gabriel TURINICI
Contenu:
1 Introduction
2 EDO
3 Calcul de dérivée et contrôle
4 EDS
Bibliographie: poly distribué

Documents de support de cours, autres documents

NOTA BENE: Tous des documents sont soumis au droit d’auteur, et ne peuvent pas être distribués sauf accord préalable ÉCRIT de l’auteur.

Supports de cours:

livre en anglais « Numerical simulations of time-dependent problems : applied to epidemiology, artificial intelligence and finance »
POLY en français (mis à jour 11/03/2024). Ne pas diffuser sauf accord ÉCRIT de l’auteur. Attention: seulement les cours en amphi font foi, le poly est juste pour aider.
poly annoté 2024/25 lors des séances de cours: cf. « teams/fichiers »
Notes manuscrites prises lors des séances de cours 2024/25: cf « teams/fichiers »

Implementations TP:

EDO: exo sur la précision, exo stabilité , SIR(EE+H+RK4), order for the EE/H schemes/SIR

SIR (version controle, adjoint / backward); (version 2023 here)

EDS version 2025 : implémenter :

1/ simulation brownien

2/ calcul intégrale de W par/rapport à W

3/ calcul par Euler-Maruyama pour équation d’Ornstein-Uhlenbeck

4/ calcul par Euler-Maruyama faible pour modèle log-normal (Black-Scholes)

Version anciennes

2022/23:

Autres notes manuscrites : séances de cours 2022/23 (document 1/2) , 2022/23(document 2/2) (version 2021/22 ici)
Cours du 07/03/2023 (version confinement 2021!!): video Youtube chapitre 3 (motivation calcul backward) : visioner ICI

2020/21:

Supports de cours: livre « Simulations numériques des problèmes dépendant du temps: appliquées à l’épidémiologie, l’intelligence artificielle et les finances«
Poly avec les annotations pendant le cours: ici.
Notes manuscrites 2020/21 prises lors des séances de cours.
Notes manuscrites séance de TD (groupe 1): séance 1: ICI, séance 2, séance 3, séance 4, séance 5, séance 6,séance 7 (EDS).
Séances de TP (groupe 1) : code d’exploration de la précision, (version courte ici), code stabilité Euler explicite, code SIR (début), code contrôle SIR, code génération brownien, code Euler-Maruyama et Monte Carlo,
Cours 1 du 19/1/2021: video Youtube chapitre 1 (motivation/EDO) : visioner ICI , ensuite revenir sur la séance Teams pour commentaires et questions.; video Youtube chapitre 1 (motivation/EDS) : visioner ICI , ensuite revenir sur la séance Teams pour commentaires et questions.; video Youtube chapitre 1 (motivation/backward) : visioner ICI , ensuite revenir sur la séance Teams pour commentaires et questions.
Cours 2: voir le poly annoté; une video Youtube partielle chapitre 2 (EDO/existence) : visioner ICI

« Convergence of a L2 regularized Policy Gradient Algorithm for the Multi Armed Bandit » at ICPR 2024

décembre 3, 2024décembre 5, 2024gabriel

This joint work with Stefana-Lucia ANITA has been presented at the at the 27th International Conference on Pattern Recognition (ICPR) 2024 held in Kolkata, India, Dec 1st through 5th 2024.

Talk materials:

Abstract : Although Multi Armed Bandit (MAB) on one hand and the policy gradient approach on the other hand are among the most used frameworks of Reinforcement Learning, the theoretical properties of the policy gradient algorithm used for MAB have not been given enough attention. We investigate in this work the convergence of such a procedure for the situation when a L2 regularization term is present jointly with the ‘softmax’ parametrization. We prove convergence under appropriate technical hypotheses and test numerically the procedure including situations beyond the theoretical setting. The tests show that a time dependent regularized procedure can improve over the canonical approach especially when the initial guess is far from the solution.

« Optimal time sampling in physics-informed neural networks » at ICPR 2024

décembre 3, 2024décembre 3, 2024gabriel

This talk has been presented at the at the 27th International Conference on Pattern Recognition (ICPR) 2024 held in Kolkata, India, Dec 1st through 5th 2024.

Talk materials:

Abtract : Physics-informed neural networks (PINN) is a extremely powerful paradigm used to solve equations encountered in scientific computing applications. An important part of the procedure is the minimization of the equation residual which includes, when the equation is time-dependent, a time sampling. It was argued in the literature that the sampling need not be uniform but should overweight initial time instants, but no rigorous explanation was provided for this choice. In the present work we take some prototypical examples and, under standard hypothesis concerning the neural network convergence, we show that the optimal time sampling follows a (truncated) exponential distribution. In particular we explain when is best to use uniform time sampling and when one should not. The findings are illustrated with numerical examples on linear equation, Burgers’ equation and the Lorenz system.

« Time density in PINNs (Physics-Informed Neural Networks) » presented at NANMAT nov 2024

novembre 6, 2024novembre 15, 2024gabriel

This is a talk presented at Nanmat conference held Nov 4-7 2024 at ICTP, Cluj.

Talk materials: the slides of the presentation. and here the Youtube video version.

General chair of the conference FAAI24 « Foundations and applications of artificial intelligence », Iasi, October 28-30, 2024

octobre 30, 2024novembre 6, 2024gabriel

General chair with C. Lefter and A. Zalinescu of the conference FAAI24 « Foundations and applications of artificial intelligence » Iasi Oct 28-30 2024. At the conference I also serve as tutorial presenter.

LLM and time series at the « 6th J.P. Morgan Global Machine Learning Conference », Paris, Oct 18th, 2024

octobre 18, 2024octobre 25, 2024gabriel

Invited joint talk « Using LLMs techniques for time series prediction » with Pierre Brugiere presented at the 6th JP Morgan Global Machine Learning conference held in Paris, Oct 18th 2024

Talk materials: slides(click here) and here a link to the associated paper.

Co-organizer of the « Numerical Analysis » sessions at « Le 16ème Colloque Franco-Roumain, Bucarest, Roumanie »

août 28, 2024gabriel

Co-organizer with E. Catinas of the « Numerical Analysis » sessions at « Le 16ème Colloque Franco-Roumain, Bucarest, Roumanie »

Conference flyer:

Interview with radio « France Culture » on the ethics of generative AI

juin 5, 2024juin 5, 2024gabriel

A short interview with Celine Loozen from ‘France Culture’ radio station within a radio program concerning AI and GAFAM ethics.

Link for the full radio broadcast

Interview with Celine Loozen : here (local version if necessary here)

Reinforcement Learning, M2 ISF App, 2021-2025

février 8, 2024mai 11, 2025gabriel

Instructor: Gabriel TURINICI

1/ Introduction to reinforcement learning
2/ Theoretical formalism: Markov decision processes (MDP), value function ( Belman and Hamilton- Jacobi – Bellman equations) etc.
3/ Common strategies, building from the example of « multi-armed bandit »
4/ Strategies in deep learning: Q-learning and DQN
5/ Strategies in deep learning: SARSA and variants
6/ Strategies in deep learning: Actor-Critic and variants
7/ During the course: various Python and gym/gymnasium implementations
8/ Perspectives.

Principal document for the theoretical presentations: (no distribution autoried without WRITTEN consent from the author)

Multi Armed Bandit codes (MAB) : play MAB, solve MAB , solve MAB v2., policy grad from chatGPT to correct., policy grad corrected.

Bellman iterations: code to correct here, solution code here

Gym: play Frozen Lake (v2023) (version 2022)

Q-Learning : with Frozen Lake, python version or notebook version

-play with gym/Atari-Breakout: python version or notebook version

Deep Q Learning (DQN) : Learn with gym/Atari-Breakout: notebook 2024 and its version with smaller NN and play with result

Policy gradients on Pong adapted from Karpathy, 2024 version (correct to get it working!) python or notebook

You can also load from HERE a converged version (rename as necessary) pg_pong_converged_turinici24

Notebook to use it: here (please send me yours if mean reward above 15!).

Projets : cf. Teams