Alejandro de la Concha Duarte

Doctoral Researcher in Statistical Machine Learning

foto.jpg

Contact: alejandro.d.delaconcha(at)gmail.com

I am Alejandro de la Concha, a PhD researcher in statistical machine learning at the École Normale Supérieure Paris-Saclay under the supervision of Prof. Nicolas Vayatis and Argyris Kalogeratos.

My thesis focuses on developing graph-based machine learning techniques for analyzing data generated in complex systems such as transport systems, financial networks, and sensor networks, to name a few. My work includes contributions to the subfields of: 1) Kernel methods and hypothesis testing. 2) Density/likelihood-ratio estimation. 3) Graph-signal processing.

Professionally, I have held positions in public and private institutions, where I have mainly focused on applying machine learning to risk management. I held a research data scientist position at Financial Network Analytics (FNA) and a financial risk analyst position at the Central Bank of Mexico (BANXICO) within the Directorate of Risk Management.

Research Interests

Education

  • OCT.2019 - Present

    Gif-sur-Yvette, France

    PhD in Applied Mathematics
    École Normale Supérieure Paris-Saclay
  • SEP.2018 - SEP.2019

    Gif-sur-Yvette, France

    Msc in Applied Mathematics
    École Normale Supérieure Paris-Saclay
    Master Year 2 Mathematics, Vision, Learning (MVA)
    GPA: 17/20
  • AUG.2017 - AUG.2018

    Évry, France

    Msc in Applied Mathematics
    Paris-Saclay University/ University of Évry Val d'Essonne
    Master Year 1 Mathematics and applications
    GPA: 17/20, Class rank: 1.
  • AUG.2011 - AUG.2015

    Mexico city, Mexico

    BSc in Actuarial Sciences
    National Autonomous University of Mexico
    Semester abroad at University of Rouen Normandy, France, SEP-2014 - JAN-2015
    GPA: 9.97/10, Class rank: 1.

Publications

  1. Collaborative likelihood-ratio estimation over graphs
    Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos
    2024

    Assuming we have i.i.d observations from two unknown probability density functions (pdfs), p and q, the density/likelihood-ratio estimation (LRE) is an elegant approach to compare the two pdfs only by relying on the available data. In this paper, we introduce the first — to the best of our knowledge — graph-based extension of this problem, which reads as follows: Suppose each node v of a fixed graph has access to observations coming from two unknown node-specific pdfs, p_v and q_v, and the goal is to estimate for each node the likelihood-ratio between both pdfs by also taking into account the information provided by the graph structure. The node-level estimation tasks are supposed to exhibit similarities conveyed by the graph, which suggests that the nodes could collaborate to solve them more efficiently. We develop this idea in a concrete non-parametric method that we call Graph-based Relative Unconstrained Least-squares Importance Fitting (GRULSIF). We derive convergence rates for our collaborative approach that highlights the role played by variables such as the number of available observations per node, the size of the graph, and how accurately the graph structure encodes the similarity between tasks. These theoretical results explicit the situations where collaborative estimation effectively leads to an improvement in performance compared to solving each problem independently.

    data_to_ratio.jpg
  2. Collaborative non-parametric two-sample testing
    Alejandro David De Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos
    In Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 03–05 may 2025

    Multiple two-sample test problem in a graph-structured setting is a common scenario in fields such as Spatial Statistics and Neuroscience. Each node v in fixed graph deals with a two-sample testing problem between two node-specific probability density functions, p_v and q_v. The goal is to identify nodes where the null hypothesis p_v = q_v should be rejected, under the assumption that connected nodes would yield similar test outcomes. We propose the non-parametric collaborative two-sample testing (CTST) framework that efficiently leverages the graph structure and minimizes the assumptions over p_v and q_v. CTST integrates elements from f-div estimation, Kernel Methods, and Multitask Learning.

    collaborative_ctst.png
  3. Online non-parametric likelihood-ratio estimation by Pearson-divergence functional minimization
    Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos
    In Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, 03–05 may 2024

    The density/likelihood-ratio estimation problem between two pdfs p and q, has been investigated mainly for the offline case. This paper contributes by introducing a new framework for online non-parametric density/likelihood-ratio estimation for the setting where pairs of i.i.d observations (x_t ∼p, x’_t ∼q) are observed over time. Our online estimator named OLRE capitalizes on recent advances in Kernel Methods and functional minimization. The non-parametric and online nature of OLRE makes it a significant step in the development of sequential change-point detectors based on the likelihood-ratio.

    time_series_to_ratio.png
  4. Offline detection of change-points in the mean for stationary graph signals
    Alejandro de la Concha Duarte, Nicolas Vayatis, and Argyris Kalogeratos
    In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, 03–05 may 2021

    This paper addresses the problem of segmenting a stream of graph signals: we aim to detect changes in the mean of a multivariate signal defined over the nodes of a known graph. We propose an offline change-point detection method that relies on the concept of graph stationarity. This approach allows the problem to be translated from the original vertex domain to the spectral domain induced by the Graph Fourier Transform, making the change-point detection problem much easier to solve. Our change-point detection method adopts a model selection approach that assumes the mean of the graph signals at each segment admits a sparse spectral representation and determines automatically the number of change-points.

    offline_mean.jpg
  5. Multiplex Financial Networks: Revealing the Level of Interconnectedness in the Banking System
    Alejandro de la Concha Duarte, Serafin Martinez-Jaramillo, and Christian Carmona
    In Complex Networks & Their Applications VI, 03–05 may 2018

    Complex network models have proven to be useful tools for studying systemic risk, particularly in characterizing and describing the level of interconnectedness in a financial system. Nevertheless, most existing work has focused primarily on a single phenomenon: interbank (exposure) networks. In this paper, we study the Mexican banking system using a comprehensive set of market interactions, including transactions in securities markets, payment system flows, interbank loans, foreign exchange, and derivatives exposures. To the best of our knowledge, this is the first attempt to describe so comprehensively the complexity and interconnectedness of a banking system.

    multiplex.png
  6. Modelos de ecuaciones estructurales vía mínimos cuadrados parciales con R y SmartP
    Alejandro de la Concha
    In Aspectos metodológicos para la investigación social: Modelos de ecuaciones estructurales, 03–05 may 2019

    This book chapter describes the Structural Equation Modeling approach using Partial Least Squares (SEM-PLS). Different software alternatives for implementing this type of model are presented. Finally, a practical case is solved step by step to better illustrate both the theory and the use of the software.

    SEM.png

Work Experience

  • FEB.2020 - FEB.2022

    London, UK

    Research Data Scientist
    Financial Network Analytics
    • Developed machine learning algorithms for market and systemic risk management
    • Designed models to enhance network analysis
    • Built tools for natural language processing
  • APR.2019 - SEP.2019

    Gif-sur-Yvette, France

    Master's Internship
    École Normale Supérieure Paris-Saclay
    Thesis: Segmentation of multivariate signals observed over graphs
    • Reviewed the state of the art in change-point detection and graph-signal processing
  • APR.2018 - JUN.2018

    London, UK

    Data Scientist (Master's Internship)
    Financial Network Analytics
    Thesis: Tools from Network Theory applied to Systemic Risk
    • Developed graph-based machine learning techniques for market and systemic risk management
  • APR.2016 - SEP.2017

    Mexico City, Mexico

    Risk Analyst
    Central Bank of Mexico, Risk Management Directorate
    • Financial risk management for Mexico’s international reserve portfolio
    • Conducted research on systemic risk in the Mexican banking system
    • Performed actuarial analysis of the central bank’s private pension plan
  • OCT.2015 - MAR.2016

    Mexico City, Mexico

    Risk Analyst (Bacherlor's Internship)
    Central Bank of Mexico, Risk Management Directorate
    • Created in-house computational tools for estimating the maximum limitation of liability of insurance contracts subscribed by BANXICO
  • JUN.2015 - SEP.2015

    Toronto, Canada

    Research Internship
    University of Toronto
    Insurance Loss Modelling with Erlang-based Mixture Models
    • Conducted a literature review on copula modeling and mixture models applied to insurance

Software

Invited Talks