Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Posts

publications

A Local Existence Theorem for a Parabolic Blow-Up Inverse Problem.

Published in Pure Mathematics, 2017

Recommended citation: Yu Pan, Xuran Meng and Wuqing Ning, "A Local Existence Theorem for a Parabolic Blow-Up Inverse Problem." Pure Mathematics, 2017.
Download Paper

High frequency algorithm and its back-testing results based on GAN.

Published in JUSTC, 2020

Recommended citation: Xuran Meng, Xiuchun Bi and Shuguang Zhang, "High frequency algorithm and its back-testing results based on GAN." JUSTC 50, 2020.
Download Paper

l1–2 minimisation for compressed sensing with partially known signal support.

Published in Electronics Letters, 2020

Recommended citation: Jing Zhang, Shuguang Zhang and Xuran Meng, "l1–2 minimisation for compressed sensing with partially known signal support." Electronics Letters 56, 2020.
Download Paper

Impact of classification difficulty on the weight matrices spectra in Deep Learning and application to early-stopping.

Published in Journal of Machine Learning Research, 2023

Recommended citation: Xuran Meng and Jianfeng Yao, "Impact of classification difficulty on the weight matrices spectra in Deep Learning and application to early-stopping." JMLR 24, 2023.
Download Paper | Download Slides

Multiple Descent in the Multiple Random Feature Model.

Published in Journal of Machine Learning Research, 2024

Recommended citation: Xuran Meng, Jianfeng Yao and Yuan Cao, "Multiple Descent in the Multiple Random Feature Model." JMLR 25, 2024.
Download Paper | Download Slides

Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data.

Published in International Conference on Machine Learning, 2024

Recommended citation: Xuran Meng, Difan Zou and Yuan Cao, "Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data." ICML, 2024.
Download Paper

Initialization Matters: On the Benign Overfitting of Two-Layer ReLU CNN with Fully Trainable Layers.

Published in Arxiv, 2024

Submitted to JMLR

Recommended citation: Shuning Shang, Xuran Meng, Yuan Cao and Difan Zou, "Initialization Matters: On the Benign Overfitting of Two-Layer ReLU CNN with Fully Trainable Layers." arxiv: 2410.19139, 2024.
Download Paper

Per-Example Gradient Regularization Improves Learning Signals from Noisy Data.

Published in Machine Learning, 2025

Machine Learning

Recommended citation: Xuran Meng, Yuan Cao and Difan Zou, "Per-Example Gradient Regularization Improves Learning Signals from Noisy Data." Machine Learning, 2025.
Download Paper

Transformer learns optimal variable selection in group-sparse classification.

Published in International Conference on Learning Representations, 2025

Recommended citation: Chenyang Zhang, Xuran Meng and Yuan Cao, "Transformer learns optimal variable selection in group-sparse classification." ICLR, 2025.
Download Paper

Estimation of Out-of-Sample Sharpe Ratio for High Dimensional Portfolio Optimization.

Published in Journal of the American Statistical Association, 2025

JASA (Theory and Methodology)

Recommended citation: Xuran Meng, Yuan Cao and Weichen Wang, "Estimation of Out-of-Sample Sharpe Ratio for High Dimensional Portfolio Optimization." JASA (Theory and Methodology), 2025.
Download Paper | Download Slides

Towards Understanding Feature Learning in Parameter Transfer.

Published in Arxiv, 2025

Submitted to ICLR

Recommended citation: Hua Yuan, Xuran Meng et. al., "Towards Understanding Feature Learning in Parameter Transfer." arxiv: 2509.22056, 2025.
Download Paper

Xuran Meng and Yi Li’s contribution to the Discussion of “On optimal linear prediction” by I. Helland.

Published in Scandinavian Journal of Statistics, 2025

Recommended citation: Xuran Meng and Yi Li, "Xuran Meng and Yi Li’s contribution to the Discussion of “On optimal linear prediction” by I. Helland." Scandinavian Journal of Statistics, 2025.
Download Paper

Statistical Inference on High Dimensional Gaussian Graphical Regression Models.

Published in Biometrics, 2025

Recommended citation: Xuran Meng, Jingfei Zhang and Yi Li, "Statistical Inference on High Dimensional Gaussian Graphical Regression Models." Biometrics, 2025.
Download Paper

Two-part Statistical Model for Identifying Baseline Predictors of Chronic Postsurgical Pain.

Published in Anesthesiology, 2026

A substantial proportion of patients report no pain after surgery, resulting in an excess of zero values that pose challenges for analysis using traditional statistical models. The present study was designed to test the hypothesis that a two-part model, commonly used in healthcare expenditures research, would demonstrate superior performance in predicting postsurgical pain when compared to traditional models, and would secondarily better identify predictors of this clinically important outcome.

Recommended citation: Stephan G Frangakis, Xuran Meng, Mark C Bicket, Vidhya Gunaseelan, Sawsan As Sanie, Andrew Urquhart, Yi Li and Chad M Brummett,
Download Paper

Inference for Deep Neural Network Estimators in Generalized Nonparametric Models.

Published in Journal of the American Statistical Association, 2026

JASA (Theory and Methodology)

Recommended citation: Xuran Meng and Yi Li, "Inference for Deep Neural Network Estimators in Generalized Nonparametric Models.." JASA (Theory and Methodology), 2026+.
Download Paper

Beyond Consistency: Inference for the Relative Risk Functional in Deep Nonparametric Cox Models.

Published in Arxiv, 2026

There remain theoretical gaps in deep neural network estimators for the nonparametric Cox proportional hazards model. In particular, it is unclear how gradient-based optimization error propagates to population risk under partial likelihood, how pointwise bias can be controlled to permit valid inference, and how ensemble-based uncertainty quantification behaves under realistic variance decay regimes. We develop an asymptotic distribution theory for deep Cox estimators that addresses these issues. First, we establish nonasymptotic oracle inequalities for general trained networks that link in-sample optimization error to population risk without requiring the exact empirical risk optimizer. We then construct a structured neural parameterization that achieves infinity-norm approximation rates compatible with the oracle bound, yielding control of the pointwise bias. Under these conditions and using the Hajek–Hoeffding projection, we prove pointwise and multivariate asymptotic normality for subsampled ensemble estimators. We derive a range of subsample sizes that balances bias correction with the requirement that the Hajek–Hoeffding projection remain dominant. This range accommodates decay conditions on the single-overlap covariance, which measures how strongly a single shared observation influences the estimator, and is weaker than those imposed in the subsampling literature. An infinitesimal jackknife representation provides analytic covariance estimation and valid Wald-type inference for relative risk contrasts such as log-hazard ratios. Finally, we illustrate the finite-sample implications of the theory through simulations and a real data application.

Recommended citation: Sattwik Ghosal, Xuran Meng and Yi Li,
Download Paper

Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future.

Published in International Conference on Machine Learning, 2026

Self-Rewarding Language Models propose an architecture in which the Large Language Models(LLMs) both generates responses and evaluates its own outputs via LLM-as-a-Judge prompting, dynamically improving its generative capabilities through iterative Direct Preference Optimization (DPO). However, our analysis reveals a critical limitation in existing Self-Rewarding paradigms: the synchronized improvement of chosen and rejected responses progressively narrows the representational difference between contrasting samples, undermining effective preference learning. We propose \textbf{Temporal Self-Rewarding Language Models} that strategically coordinate past, present, and future model generations to sustain learning signals. Our dual-phase framework introduces: (1) \textit{Anchored Rejection} - fixing rejected responses using the past initial model’s outputs and (2) \textit{Future-Guided Chosen} - dynamically curating chosen samples using next-generation model predictions. Extensive experiments across three model families (Llama, Qwen, Mistral) and different model sizes (Llama3B/8B/70B) demonstrate significant improvements when trained with our method compared to Self-Rewarding using same computation resources. For example, Llama3.1-8B reaches a 29.44 win rate on AlpacaEval 2.0 with our method, outperforming the Self-Rewarding baseline (19.69) by 9.75. Notably, our method also demonstrates superior out-of-distribution generalization across mathematical reasoning (GSM8K), knowledge-based QA (ARC, TruthfulQA), and code generation (HumanEval) tasks, even though we do not specifically collect such training data.

Recommended citation: Yidong Wang et. al., "Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future." arxiv: 2508.06026, 2025.
Download Paper

teaching

Tutor from 2020-2024

Undergraduate/Postgraduate course, University of Hong Kong, Department of Statistics and Actuarial Science, 2020

Stochastic Process, Financial Economics, Bayesian Learning

Xuran Meng

Sitemap

Pages

Posts

publications

talks

teaching