Alejandro de la Concha Duarte

Doctoral Researcher in Statistical Machine Learning

Contact: alejandro.d.delaconcha(at)gmail.com

I am Alejandro de la Concha, a PhD researcher in statistical machine learning at the École Normale Supérieure Paris-Saclay under the supervision of Prof. Nicolas Vayatis and Argyris Kalogeratos.

My thesis focuses on developing graph-based machine learning techniques for analyzing data generated in complex systems such as transport systems, financial networks, and sensor networks, to name a few. My work includes contributions to the subfields of: 1) Kernel methods and hypothesis testing. 2) Density/likelihood-ratio estimation. 3) Graph-signal processing.

Professionally, I have held positions in public and private institutions, where I have mainly focused on applying machine learning to risk management. I held a research data scientist position at Financial Network Analytics (FNA) and a financial risk analyst position at the Central Bank of Mexico (BANXICO) within the Directorate of Risk Management.

Research Interests

• Kernel Methods
• Non-Parametric Statistics
• Time Series Analysis
• Graph-Signal Processing
• Federated Learning
• Machine Learning
• Risk Management

Education

OCT.2019 - Present

Gif-sur-Yvette, France
PhD in Applied Mathematics

École Normale Supérieure Paris-Saclay
SEP.2018 - SEP.2019

Gif-sur-Yvette, France
Msc in Applied Mathematics

École Normale Supérieure Paris-Saclay

Master Year 2 Mathematics, Vision, Learning (MVA)

GPA: 17/20
AUG.2017 - AUG.2018

Évry, France
Msc in Applied Mathematics

Paris-Saclay University/ University of Évry Val d'Essonne

Master Year 1 Mathematics and applications

GPA: 17/20, Class rank: 1.
AUG.2011 - AUG.2015

Mexico city, Mexico
BSc in Actuarial Sciences

National Autonomous University of Mexico

Semester abroad at University of Rouen Normandy, France, SEP-2014 - JAN-2015

GPA: 9.97/10, Class rank: 1.

Publications

Collaborative likelihood-ratio estimation over graphs

Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos

2024

Assuming we have i.i.d observations from two unknown probability density functions (pdfs), p and q, the density/likelihood-ratio estimation (LRE) is an elegant approach to compare the two pdfs only by relying on the available data. In this paper, we introduce the first — to the best of our knowledge — graph-based extension of this problem, which reads as follows: Suppose each node v of a fixed graph has access to observations coming from two unknown node-specific pdfs, p_v and q_v, and the goal is to estimate for each node the likelihood-ratio between both pdfs by also taking into account the information provided by the graph structure. The node-level estimation tasks are supposed to exhibit similarities conveyed by the graph, which suggests that the nodes could collaborate to solve them more efficiently. We develop this idea in a concrete non-parametric method that we call Graph-based Relative Unconstrained Least-squares Importance Fitting (GRULSIF). We derive convergence rates for our collaborative approach that highlights the role played by variables such as the number of available observations per node, the size of the graph, and how accurately the graph structure encodes the similarity between tasks. These theoretical results explicit the situations where collaborative estimation effectively leads to an improvement in performance compared to solving each problem independently.

arXiv Code
Collaborative non-parametric two-sample testing

Alejandro David De Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos

In Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 03–05 may 2025

Multiple two-sample test problem in a graph-structured setting is a common scenario in fields such as Spatial Statistics and Neuroscience. Each node v in fixed graph deals with a two-sample testing problem between two node-specific probability density functions, p_v and q_v. The goal is to identify nodes where the null hypothesis p_v = q_v should be rejected, under the assumption that connected nodes would yield similar test outcomes. We propose the non-parametric collaborative two-sample testing (CTST) framework that efficiently leverages the graph structure and minimizes the assumptions over p_v and q_v. CTST integrates elements from f-div estimation, Kernel Methods, and Multitask Learning.

PDF Code
Online non-parametric likelihood-ratio estimation by Pearson-divergence functional minimization

Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos

In Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, 03–05 may 2024

The density/likelihood-ratio estimation problem between two pdfs p and q, has been investigated mainly for the offline case. This paper contributes by introducing a new framework for online non-parametric density/likelihood-ratio estimation for the setting where pairs of i.i.d observations (x_t ∼p, x’_t ∼q) are observed over time. Our online estimator named OLRE capitalizes on recent advances in Kernel Methods and functional minimization. The non-parametric and online nature of OLRE makes it a significant step in the development of sequential change-point detectors based on the likelihood-ratio.

PDF Code
Offline detection of change-points in the mean for stationary graph signals

Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos

In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, 03–05 may 2021

This paper addresses the problem of segmenting a stream of graph signals: we aim to detect changes in the mean of a multivariate signal defined over the nodes of a known graph. We propose an offline change-point detection method that relies on the concept of graph stationarity. This approach allows the problem to be translated from the original vertex domain to the spectral domain induced by the Graph Fourier Transform, making the change-point detection problem much easier to solve. Our change-point detection method adopts a model selection approach that assumes the mean of the graph signals at each segment admits a sparse spectral representation and determines automatically the number of change-points.

PDF Code
Multiplex Financial Networks: Revealing the Level of Interconnectedness in the Banking System

Alejandro de la Concha Duarte, Serafin Martinez-Jaramillo, and Christian Carmona

In Complex Networks & Their Applications VI, 03–05 may 2018

Complex network models have proven to be useful tools for studying systemic risk, particularly in characterizing and describing the level of interconnectedness in a financial system. Nevertheless, most existing work has focused primarily on a single phenomenon: interbank (exposure) networks. In this paper, we study the Mexican banking system using a comprehensive set of market interactions, including transactions in securities markets, payment system flows, interbank loans, foreign exchange, and derivatives exposures. To the best of our knowledge, this is the first attempt to describe so comprehensively the complexity and interconnectedness of a banking system.

PDF
Modelos de ecuaciones estructurales vía mínimos cuadrados parciales con R y SmartP

Alejandro de la Concha

In Aspectos metodológicos para la investigación social: Modelos de ecuaciones estructurales, 03–05 may 2019

This book chapter describes the Structural Equation Modeling approach using Partial Least Squares (SEM-PLS). Different software alternatives for implementing this type of model are presented. Finally, a practical case is solved step by step to better illustrate both the theory and the use of the software.

PDF

Work Experience

FEB.2020 - FEB.2022

London, UK
Research Data Scientist

Financial Network Analytics
- Developed machine learning algorithms for market and systemic risk management
- Designed models to enhance network analysis
- Built tools for natural language processing
APR.2019 - SEP.2019

Gif-sur-Yvette, France
Master's Internship

École Normale Supérieure Paris-Saclay

Thesis: Segmentation of multivariate signals observed over graphs
- Reviewed the state of the art in change-point detection and graph-signal processing
APR.2018 - JUN.2018

London, UK
Data Scientist (Master's Internship)

Financial Network Analytics

Thesis: Tools from Network Theory applied to Systemic Risk
- Developed graph-based machine learning techniques for market and systemic risk management
APR.2016 - SEP.2017

Mexico City, Mexico
Risk Analyst

Central Bank of Mexico, Risk Management Directorate
- Financial risk management for Mexico’s international reserve portfolio
- Conducted research on systemic risk in the Mexican banking system
- Performed actuarial analysis of the central bank’s private pension plan
OCT.2015 - MAR.2016

Mexico City, Mexico
Risk Analyst (Bacherlor's Internship)

Central Bank of Mexico, Risk Management Directorate
- Created in-house computational tools for estimating the maximum limitation of liability of insurance contracts subscribed by BANXICO
JUN.2015 - SEP.2015

Toronto, Canada
Research Internship

University of Toronto

Insurance Loss Modelling with Erlang-based Mixture Models
- Conducted a literature review on copula modeling and mixture models applied to insurance