Statistique

Description du programme

La statistique concerne l’élaboration et l’emploi de méthodes mathématiques et informatiques pour la collecte, l’analyse et l’interprétation de données visant à soutenir la recherche scientifique, la prise de décision éclairée et la gestion des risques. Elle fait appel à un large éventail d’outils, allant de la théorie des probabilités aux techniques de calcul intensif sur ordinateur. Parmi les principaux domaines de recherche des statisticiens du réseau de l’ISM, notons

la modélisation de la dépendance et l’analyse multivariée
la modélisation des données de grande dimension
l’analyse de survie
l’analyse de valeurs extrêmes
l’apprentissage machine
l’apprentissage statistique
la statistique directionnelle
la statistique non paramétrique
la théorie des processus empiriques
le calcul statistique
les séries chronologiques
les techniques d’enquête
l’inférence bayésienne et méthodes MCMC
l’inférence causale

La recherche statistique est motivée en grande partie par des collaborations interdisciplinaires. Elle trouve des applications dans de nombreux domaines tels la biologie, les sciences de l’environnement, la finance et l’assurance, les sciences de la santé, l’hydrologie, le marketing et les sciences sociales. Avec l’abondance d’ensembles de données complexes et de grande taille émanant entre autres des médias sociaux et des processus numériques, des transactions financières, de l’astronomie, de la génomique, de la météorologie ou de la mégascience comme le grand collisionneur de hadrons, le traitement et l’analyse de données volumineuses est un enjeu majeur de la statistique moderne.

Membres du programme

Jean-François Angers (UdeM)
Masoud Asgharian (McGill)
Juli Atherton (UQAM)
Cédric Beaulac (UQAM)
Mylène Bédard (UdeM)
Martin Bilodeau (UdeM)
Guillaume Blanchet (Sherbrooke)
Taoufik Bouezmarni (Sherbrooke)
Félix Camirand Lemyre (Sherbrooke)
Anne-Sophie Charest (Laval)
Yogendra P. Chaubey (Concordia)
Ting Huei Chen (Laval)
Janie Coulombe (UdeM)
Mehdi Dagdoug (McGill)
Marie-Hélène Descary (UQAM)
Pierre Duchesne (UdeM)
Thierry Duchesne (Laval)
Debbie Dupuis (HEC)
Sorana Froda (UQAM)
Christian Genest (McGill)
Nadia Ghazzali (UQTR)
Frédéric Godin (Concordia)
Simon Guillotte (UQAM)
Klaus Herrmann (Sherbrooke)
Lisa Kakinami (Concordia)
Khader Khadraoui (Laval)
Abbas Khalili (McGill)
Eric Kolaczyk (McGill)
Lajmi Lakhal Chaieb (Laval)
Michaël Lalancette (UQAM)
Geneviève Lefebvre (UQAM)
Christian Léger (UdeM)
Florian Maire (UdeM)
Éric Marchand (Sherbrooke)
Mhamed Mesfioui (UQTR)
Alejandro Murua (UdeM)
Kirill Neklyudov (UdeM)
Johanna Nešlehová (McGill)
Karim Oualkacha (UQAM)
François Perron (UdeM)
Jean-François Quessy (UQTR)
Louis-Paul Rivest (Laval)
Arusharka Sen (Concordia)
Russell Steele (McGill)
David Stephens (McGill)
Audrey-Anne Vallée (Laval)
Yi Yang (McGill)
Mamadou Yauck (UQAM)

Formation

Le programme de statistique fournit aux étudiants gradués l'occasion d'étudier dans ces deux domaines importants de la statistique moderne. Les cours offerts dans ce programme permettront aux étudiants de 2e et 3e cycles de bien se familiariser avec les bases de la statistique mathématique, de la théorie de la décision et la statistique appliquée. De plus, quelques cours sont offerts pour initier les étudiants à des sujets de pointe dans ces domaines.

Ce programme est ouvert à tous les étudiants ayant une base solide en calcul différentiel et intégral, statistique mathématique, analyse numérique ainsi qu'en probabilité (le tout au niveau du 1er cycle). Pour acquérir une bonne formation en théorie de la décision et en statistique mathématique, nous pensons que les étudiants devraient prendre un cours de base en mesure et intégration (pour les étudiants au 3e cycle) et au moins trois cours aux niveaux intermédiaires et avancés.

Cours 2026-27

Automne

Statistical Inference 1

This course is an introduction to statistical inference for parametric models. The following topics will be covered:

1. Distribution of functions of several random variables (distribution function and change of variable techniques), sampling distribution of mean and variance of a sample from Normal distribution.

2. Distribution of order statistics and sample quantiles.

3. Estimation: unbiasedness, CramÈr-Rao lower bound and efficiency, method of moments and maximum likelihood estimation, consistency, limiting distributions, delta-method.

4. Sufficiency, minimal sufficiency, completeness, UMVUE, Rao-Blackwell and Lehman-Scheffe theorems.

5. Hypothesis-testing: likelihood-ratio tests.

6. Elements of Bayesian estimation and hypothesis-testing.

Prof. Arusharka Sen

MAST 672/2 sec. C / MAST 881C

Institution: Concordia University

Statistical Consultation and Data Analysis

Statistical software (R) will be used for the analysis of real‑life data sets. Topics may include techniques from generalized linear models, model selection, log‑linear models for categorical data, logistic regression, survival models.

Prof. Lisa Kakinami

MAST 678, sec. AA/ 881

Institution: Concordia University

Multivariate Statistics

This course introduces multivariate statistical analysis, both theory and methods, with focus on the multivariate Normal distribution. It can be seen as a preparatory course, although not a formal prerequisite, for Statistical Learning. Topics covered include:

Matrix Algebra & Random Vectors
The Multivariate Normal Distribution
Inferences about a Mean Vector
Comparisons of Several Multivariate Means
Principal Components
Factor Analysis and Inference for structured covariance matrices (time permitting)
Canonical Correlation Analysis (time permitting)
Discrimination and Classification

Text: Applied Multivariate Statistical Analysis, 6th Edition, by R. A. Johnson and D. W. Wichern, Pearson Prentice Hall (2007).

Recommended reading: Linear Statistical Inference and Its Applications, 2nd Edition, by C. R. Rao, Wiley (1973).

Evaluation: Assignments (4), Midterm exam, Final exam

Prof. Arusharka Sen

MAST 679 sec. K / 881K

Institution: Concordia University

Reinforcement Learning

This course is an introduction to reinforcement learning techniques. It requires extensive programming with the R language. Topics covered include: Multi-armed bandit problem, Markov Decision Problems, Dynamic Programming, Monte-Carlo solution methods, Temporal difference methods, Multi-period Approximation methods, Policy gradient.

Prof. Frédéric Godin

MAST 679, sec. L / 881

Institution: Concordia University

Regression and Analysis of Variance

Multivariate normal and chi-squared distributions; quadratic forms. Multiple linear regression estimators and their properties. General linear hypothesis tests. Prediction and confidence intervals. Asymptotic properties of least squares estimators. Weighted least squares. Variable selection and regularization. Selected advanced topics in regression. Applications to experimental and observational data.

Prof. Abbas Khalili Mahmoudabadi

MATH 533

Institution: Université McGill

Mathematical Statistics 1

Distribution theory, stochastic models and multivariate transformations. Families of distributions including location-scale families, exponential families, convolution families, exponential dispersion models and hierarchical models. Concentration inequalities. Characteristic functions. Convergence in probability, almost surely, in Lp and in distribution. Laws of large numbers and Central Limit Theorem. Stochastic simulation.

Prof. Johanna Neslehova

MATH 556

Institution: Université McGill

Bayesian Theory and Methods

Subjective probability, Bayesian statistical inference and decision making, de Finetti’s representation. Bayesian parametric methods, optimal decisions, conjugate models, methods of prior specification and elicitation, approximation methods. Hierarchical models. Computational approaches to inference, Markov chain Monte Carlo methods, Metropolis—Hastings. Nonparametric Bayesian inference.

Prof. David Stephens

MATH 559

Institution: Université McGill

Topics in Probability and Statistics / Advanced Topics in Statistics 1: Asymptotic Statistics

The course will develop the probabilistic foundations of large-sample theory and applies them to classical and modern problems in statistics. Emphasis will be placed on stochastic convergence, asymptotic expansions, uniform convergence, rates of convergence, and optimality. The final part of the course will introduce minimax rates in nonparametric regression and some semiparametric ideas such as influence functions and orthogonality. The course will be split into three main parts: (A) Mathematical and probabilistic foundations; (B) Classical large-sample theory; (C) Introduction to minimax and semiparametric theory.

In part (A), the focus will be on mathematical reminders, stochastic convergence, uniform integrability, laws of large numbers, central limit theorems, triangular arrays, and asymptotic linearity.

Part (B) will focus on studying the asymptotic properties of various classical estimators, such as the empirical distribution function and sample quantiles, maximum likelihood, M-estimation, Z-estimation, with various applications to estimation and testing.

Part (C) will be an introduction to selected modern topics such as minimax upper/lower bounds for nonparametric regression and semiparametric theory, as time permits.

Prof. Mohamed Mehdi Dagdoug

MATH 598, sec. 1 / MATH 782

Institution: Université McGill

Topics in Probability and Statistics / Advanced Topics in Statistics 1: Introduction to the Conformal Prediction Framework

This course provides a formal introduction to the conformal prediction framework, a model-agnostic uncertainty quantification method applicable to machine learning and artificial intelligence systems. The curriculum examines the mathematical mechanisms of distribution-free inference and their computational implementation. Students will derive finite-sample, model-agnostic coverage bounds utilizing exchangeability and permutation tests. The course connects theoretical foundations with applied methodology, analyzing the use of conformal inference in high-dimensional settings, trustworthy AI pipelines, computational biology, and drug discovery.

Prof. Archer Y. Yang

McGill MATH 598, sec. 2 / MATH 782

Institution: Université McGill

Topics in Probability and Statistics / Advanced Topics in Statistics 1: Model Selection and Sparsity in Statistical Learning

Description: This course introduces the fundamental concepts, methodologies, and computational tools of sparse statistical learning and related areas, with an emphasis on supervised learning problems in both low- and (ultra-)high-dimensional settings. Topics include:

1. An overview of model selection, with a particular focus on feature selection (sparsity) in supervised learning models;
2. Information-theoretic criteria for model and feature selection, together with their theoretical properties;
3. Continuous regularization methods and their theoretical foundations for feature selection in high- and ultra-high-dimensional supervised learning models;
4. An introduction to post-model-selection statistical inference and the challenges arising from data-driven model selection;
4. Sparse neural networks and methods to inducing sparsity in neural network architectures;
5. A brief introduction to distributed statistical learning methods for large-scale and big-data applications.

Numerical algorithms and computational aspects will be discussed throughout the course as they naturally arise in the development of the material.

Method of evaluation: Assignments (40%), final term project (40%), final presentation (20%)

Prof. Abbas Khalili Mahmoudabadi

MATH 598, sec. 3 / MATH 782

Institution: Université McGill

Computation Intensive Statistics

General introduction to computational methods in statistics; optimization methods; EM algorithm; random number generation and simulations; bootstrap, jackknife, cross-validation, resampling and permutation; Monte Carlo methods: Markov chain Monte Carlo and sequential Monte Carlo; computation in the R language.

Prof. Archer Yi Yang

MATH 680

Institution: Université McGill

Statistical Inference

Conditional probability and Bayes’ Theorem, discrete and continuous univariate and multivariate distributions, conditional distributions, moments, independence of random variables. Modes of convergence, weak law of large numbers, central limit theorem. Point and interval estimation. Likelihood inference. Bayesian estimation and inference. Hypothesis testing.

Prof.

MATH 682

Institution: Université McGill

Données catégorielles

Tableaux de contingence à plusieurs dimensions. Mesures d'association. Risque relatif, rapport de cote. Tests exacts et asymptotiques. Régression logistique, de Poisson, multinomiale, logistique cumulative. Modèles log-linéaires. Modèles graphiques.

Prof. Alejandro Murua

STT 6516

Institution: Université de Montréal

Inférence statistique

Principes d'inférence : estimation ponctuelle, distribution des estimateurs, test d’hypothèse, région de confiance. Approche bayésienne. Méthodes de rééchantillonnage. Estimation non paramétrique. Applications modernes de la statistique.

Prof. Mylène Bédard

STT 6700

Institution: Université de Montréal

Statistique mathématique

Fonctions de variables aléatoires, fonction génératrice des moments, quelques inégalités et identités en probabilité, familles de distributions dont la famille exponentielle, vecteurs aléatoires, loi multinormale, espérances conditionnelles, mélanges et modèles hiérarchiques. Théorèmes de convergence, méthodes de simulation, statistiques d'ordre, exhaustivité, vraisemblance. Estimation ponctuelle et par intervalles : construction d'estimateurs et critères d'évaluation, méthodes bayésiennes. Normalité asymptotique et efficacité relative asymptotique.

Prof. Éric Marchand

STT 751

Institution: Université de Sherbrooke

Inférence statistique 1

Espérance conditionnelle. Prédiction. Modèles statistiques, familles exponentielles, exhaustivité. Méthodes d'estimation: maximum de vraisemblance, moindres carrés etc. Optimalité: estimateurs sans biais à variance minimum, inégalité de l'information. Propriétés asymptotiques des estimateurs. Intervalles de confiance et précision. Éléments de base de la théorie des tests. Probabilité critique, puissance en relation avec la taille d'échantillon. Relation entre tests et intervalles de confiance. Tests pour des données discrètes.

Prof. Simon Guillotte

MAT 7081

Institution: Université du Québec à Montréal

Analyse statistique multivariée

Étude des distributions échantillonnales classiques: T2 de Hotelling; loi de Wishart; distribution des valeurs et des vecteurs propres; distribution des coefficients de corrélation. Analyse de variance multivariée. Test d'indépendance de plusieurs sous-vecteurs. Test de l'égalité de matrices de covariance. Sujets spéciaux.

Prof. Karim Oualkacha

MAT 8081

Institution: Université du Québec à Montréal

Principes de simulation

Nombre aléatoire. Simulation de lois classiques. Méthodes d'inversion et de rejet. Algorithmes spécifiques. Simulation des chaines de Markov à temps discret et continu. Solution numérique des équations différentielles ordinaires et stochastiques. Méthode numérique d'Euler et de Runge-Kutta. Formule de Feynman-Kac. Discrétisation. Approximation faible et forte, explicite et implicite. Réduction de la variance. Analyse des données simulées. Sujets spéciaux.

Prof. Simon Guillotte

MAT8780

Institution: Université du Québec à Montréal

Modélisation statistique de la dépendance stochastique

Rappel sur les principales notions de statistique mathématique et sur la statistique asymptotique. Introduction à la théorie des copules. Description des modèles de dépendance bidimensionnels et multidimensionnels les plus populaires et exploration exhaustive des propriétés de ces copules. Inférence statistique dans les modèles de copules : estimation de paramètres, copule empirique, tests d'adéquation et tests d'hypothèses composites. La méthode delta fonctionnelle et ses nombreuses applications, notamment en inférence de copules. Survol des avancées récentes, incluant les tests de rupture, l'étude de la dépendance conditionnelle, la modélisation de la dépendance spatiale et l'utilisation de la fonction caractéristique. Les objectifs spécifiques de ce cours sont : de maîtriser la théorie des copules, de connaître les principales méthodes d'inférence concernant les copules, d'être au fait des principaux développements récents, de bien connaître la littérature sur les copules, d'être capable de mettre en oeuvre les méthodes statistiques avec le logiciel Matlab (estimation de la puissance de tests, analyse de jeux de données).

Prof. Jean-François Quessy

MAP6022

Institution: Université du Québec à Trois-Rivières

Hiver

Time Series and Forecasting

This course introduces the theory and practice of time series analysis. Both time and frequency domain methods will be discussed. The objective of this course is to learn and apply statistical methods for the analysis of data that have been observed over time. The Analyses will be performed using the freely available package ITSM, which accompanies the textbook. Topics covered include:

Introduction to Time Series;
Stationary Processes;
ARMA Models;
Spectral Analysis;
Modeling and Forecasting with ARMA Processes.

Prof. J. Zhang

MAST 677/ sec. J / 881

Institution: Concordia University

Topics in Statistics and Probability: Neural Networks

This course is an introduction to the theory of prediction with neural networks. Several applications of neural networks to common problems faced in practice are finally explored. Students will also be exposed to the implementation of methods seen in class; programming assignments use the Python or R programming languages.

Topics covered: Review of predictive analytics and numerical computation concepts: Supervised learning, cross-validation, hyperparameters; Overflow and underflow; Feed-forward neural networks, Motivation, Non-linear predictions, Universality property.

Classification versus regression problems: Architecture specification, Parameter estimation, Objective function; Steepest gradient descent; Backpropagation, saturation, Hessian computation; Parameter initialization strategies.

Advanced estimation topics: Adaptive learning rates: Regularization, Dataset augmentation and noise injection, Alternative neural network types, Recurrent neural networks (RNN), Long-short term (LSTM) neural networks, Convolutional neural networks, Implementations and Applications.

Prof. Cody Hyndman

MAST 679, sec. G/ 881

Institution: Concordia University

Topics in Statistics and Probability: Statistical Learning

This course is an introduction to statistical learning techniques. Topics covered include: cross- validation, regression methods, classification methods, tree-based methods, introduction to neural networks, unsupervised learning.

Prof. Simone Brugiapaglia

Concordia MAST 679, sec. H/ 881

Institution: Concordia University

Topics in Statistics and Probability: Large Sample Statistics

This course will cover selected topics from asymptotic theory of statistical inference, i.e., properties of statistical inference procedures when sample-size is large. Needless to say, these properties are obtained via taking limit as sample-size goes to infinity. Even in moderately complex statistical models the large-sample properties, such as variance of an estimator, are less cumbersome than the exact ones, i.e., those for a fixed sample-size. Both parametric and non-parametric framework will be considered. Topics to be covered include:
Functional Delta-method, U-statistics, M-estimators, Rank statistics, Local asymptotic normality (LAN).

Texts: Serfling, 'Approximation theorems of mathematical statistics' (John Wiley, 1980), van der Vaart, 'Asymptotic statistics' (Cambridge University Press, 1998) and journal articles.

Evaluation: Assignments (3) and class presentations of assigned journal articles.

Prof. Arusharka Sen

Concordia MAST 679, sec. I/ 881

Institution: Concordia University

Analysis of Categorical Data

Probability distributions for categorical data, Analysis of 2X2 contingency tables, Multiway contingency tables, The Logistic regression, Logistic regression for categorical predictors, Logit models for nominal and ordinal responses, Log-linear models and modelling ordinal associations in contingency tables, Unsupervised learning techniques for categorical data, Non Linear Principal component analysis, Applications of unsupervised learning techniques using R, Item Response Theory, Rasch model. Some topics may be included or excluded as the time permits.

Prof. Alia Sajjad

MATH 511

Institution: Université McGill

Generalized Linear Models

Exponential families, link functions. Inference and parameter estimation for generalized linear models; model selection using analysis of deviance. Residuals. Contingency table analysis, logistic regression, multinomial regression, Poisson regression, log-linear models. Multinomial models. Overdispersion and Quasilikelihood. Applications to experimental and observational data.

Prof. Johanna Neslehova

MATH 523

Institution: Université McGill

Lifetime Data Analysis

In-depth study of survival analysis, covering foundational concepts and advanced techniques in time-to-event data analysis. Exploration of censoring and truncation, survival and hazard functions, and nonparametric methods like the Kaplan-Meier estimator. Core topics include hypothesis testing for survival distributions, parametric and semiparametric modeling, and covariate inclusion through the Cox proportional hazards model. Emphasis is placed on model diagnostics, validation, and variable selection techniques, including best-subset selection, LASSO, and nonconcave penalized likelihood approaches. Practical applications and hands-on analysis using R for survival data in research and applied settings.

Prof. Masoud Asgharian-Dastenaei

MATH 526

Institution: Université McGill

Introduction to Time Series Analysis

Stationary processes; estimation and forecasting of ARMA models; non-stationary and seasonal models; state-space models; financial time series models; multivariate time series models; introduction to spectral analysis; long memory models.

Prof. Johanna Neslehova

MATH 545

Institution: Université McGill

Mathematical Statistics 2

Sufficiency, minimal and complete sufficiency, ancillarity. Fisher and Kullback-Leibler information. Elements of decision theory. Theory of estimation and hypothesis testing from the Bayesian and frequentist perspective. Elements of asymptotic statistics including large-sample behaviour of maximum likelihood estimators, likelihood-ratio tests, and chi-squared goodness-of-fit tests.

Prof. Masoud Asgharian-Dastenaei

MATH 557

Institution: Université McGill

Design of Experiments (McGill)

Introduction to concepts in statistically designed experiments. Randomization and replication. Completely randomized designs. Simple linear model and analysis of variance. Introduction to blocking. Orthogonal block designs. Models and analysis for block designs. Factorial designs and their analysis. Row-column designs. Latin squares. Model and analysis for fixed row and column effects. Split-plot designs, model and analysis. Relations and operations on factors. Orthogonal factors. Orthogonal decomposition. Orthogonal plot structures. Hasse diagrams. Applications to real data and ethical issues.

Prof. Alia Sajjad

MATH 558

Institution: Université McGill

Méthodes de statistique bayésienne

Principes de l’analyse bayésienne; loi à priori et à postériori, inférence statistique et théorie de la décision. Méthodes computationnelles; méthodes de Monte Carlo par chaînes de Markov. Applications.

Prof. Dirk Douwes-Schultz

STT6215

Institution: Université de Montréal

Régression

Rappels sur les modèles linéaires généralisés (inférence, tests, validation, choix de modèle). Géométrie de la régression. Étude asymptotique des estimateurs et réduction de variance. Régression robuste. Régression non paramétrique.

Prof. Florian Maire

STT 6415

Institution: Université de Montréal

Analyse des données multivariées

Distributions elliptiques. Estimateurs de localisation et dispersion. Estimateur robuste. Corrélations multiple, partielle, canonique. Tests paramétriques, de permutation, du bootstrap. Classification. Analyse en composantes principales. Prévision.

Prof. Martin Bilodeau

STT6515

Institution: Université de Montréal

Statistique-informatique

Prof. Kirill Neklyudov

STT6715U

Institution: Université de Montréal

Analyse de données

Analyse en composantes principales. Analyse des corrélations canoniques et régression multidimensionnelle. Analyse des correspondances. Discrimination. Classification. Analyse factorielle d'opérateurs.

Prof. Taoufik Bouezmarni

STT 707

Institution: Université de Sherbrooke

Méthodes en régression – Théorie et pratique

Régression linéaire, modèles mixtes, modèles linéaires généralisés, régression de copule, méthodes non paramétriques, sélection de modèle, erreurs de mesure, données manquantes. Utilisation du logiciel R.

Prof.

STT 753

Institution: Université de Sherbrooke

Modèles de régression

Théorie des modèles linéaires généraux. Théorie des modèles linéaires généralisés. Régression logistique. Modèles log-linéaires.

Prof.

MAT 7381

Institution: Université du Québec à Montréal

Théorie de l'apprentissage statistique

Classes d'hypothèse. Fonctions de perte et de risque. Décomposition biais-complexité. Complexité algorithmique. Régularisation, stabilité et surapprentissage. Optimisation convexe. Révision des modèles d'apprentissage statistique classiques, tels les réseaux de neurones, à travers cette nouvelle perspective. Programmation dans un langage tel que R ou Python.