Interview with radio « France Culture » on the ethics of generative AI

A short interview with Celine Loozen from ‘France Culture’ radio station within a radio program concerning AI and GAFAM ethics.

Link for the full radio broadcast

Interview with Celine Loozen : here (local version if necessary here)

Reinforcement Learning, M2 ISF App, 2021-2025

Instructor: Gabriel TURINICI


1/ Introduction to reinforcement learning
2/ Theoretical formalism: Markov decision processes (MDP), value function ( Belman and Hamilton- Jacobi – Bellman equations) etc.
3/ Common strategies, building from the example of « multi-armed bandit »
4/ Strategies in deep learning: Q-learning and DQN
5/ Strategies in deep learning: SARSA and variants
6/ Strategies in deep learning: Actor-Critic and variants
7/ During the course: various Python and gym/gymnasium implementations
8/ Perspectives.


Principal document for the theoretical presentations: (no distribution autoried without WRITTEN consent from the author)

Multi Armed Bandit codes (MAB) : play MAB, solve MAB , solve MAB v2., policy grad from chatGPT to correct., policy grad corrected.

Bellman iterations: code to correct here, solution code here

Gym: play Frozen Lake (v2023) (version 2022)

Q-Learning : with Frozen Lake, python version or notebook version

-play with gym/Atari-Breakout: python version or notebook version

Deep Q Learning (DQN) : Learn with gym/Atari-Breakout: notebook 2024 and its version with smaller NN and play with result

Policy gradients on Pong adapted from Karpathy, 2024 version (correct to get it working!) python or notebook

You can also load from HERE a converged version (rename as necessary) pg_pong_converged_turinici24

Notebook to use it: here (please send me yours if mean reward above 15!).

Projets : cf. Teams



« Reinforcement learning in finance: online portfolio allocation and policy gradient approaches, the Onflow algorithm », NANMATH nov 2023

This is a talk presented at Nanmath conference held Nov 6-9 2023 at ICTP, Cluj..

Talk materials (click to open or download): the Slides of the presentation, the ArXiv preprint and the Youtube VIDEO.

« Reinforcement learning in finance: portfolio allocation, value functions and policy gradients flows », ACDSDE conference sept 2023

This is a talk presented at ACDSDE conference held Sept 28-30 2023 at the Romanian Academy (Iasi station), Octav Mayer Institute of mathematics.

Talk materials: the slides of the presentation.

IA générative et société, M1 gestion, Université Paris Dauphine – PSL, sept 2023

Conférence « IA générative et société », tenue le 21 sept 2023 devant la promo de M1 2023/24, Université Paris – Dauphine PSL, coordinatrice Aurore HAAS.

DOCUMENT PRINCIPAL : les slides en pdf, suivre le lien.

Autres documents :

  • pour les 2 vidéos voir les liens dans le document pdf sinon directement les 2 liens youtube : vidéo 1, vidéo 2
  • pour le GIF utiliser ce lien.

« Reduced representation of non-linear manifolds », at « The Tenth Congress of the Romanian Mathematicians », Pitesti, June 30th- July 5th, 2023

Invited joint talk « Reduced representations of non linear manifolds: from reduced basis to (conditional) vector quantization of measures » at the Tenth Congress’s of the Romanian Mathematiciens held in Pitesti, Romania, June 30 to July 5th 2023

Talk materials: slides.