Publications

You can also find my articles on my Google Scholar profile.
  1. On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
    NeurIPS, 2025. ★ Spotlight
    Shaocong Ma, Heng Huang.
  2. Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets
    NeurIPS, 2025.
    Shaocong Ma, Heng Huang.
  3. Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations
    ICLR, 2025. ★ Spotlight
    Shaocong Ma, Heng Huang.
  4. Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
    TMLR, 2025.
    Shaocong Ma, Ziyi Chen, Yi Zhou, and Heng Huang.
  5. Deep learning of PDE Correction and Mesh Adaption without Automatic Differentiation
    Machine Learning, 2025.
    Shaocong Ma, James Diffenderfer, Bhavya Kailkhura, and Yi Zhou.
  6. Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning
    Foundations and Trends® in Optimization, 2024.
    Yi Zhou, Shaocong Ma.
  7. Decentralized Robust V-Learning for Solving Markov Games with Model Uncertainty
    JMLR, 2023.
    Shaocong Ma, Ziyi Chen, Shaofeng Zou, Yi Zhou.
  8. End-to-End Mesh Optimization of a Hybrid Deep Learning Black-Box PDE Solver
    NeurIPS, 2023 (ML4PS Workshop).
    Shaocong Ma, James Diffenderfer, Bhavya Kailkhura, and Yi Zhou.
  9. Finding Correlated Equilibrium of Constrained Markov Game: A Primal-Dual Approach
    NeurIPS, 2022.
    Ziyi Chen, Shaocong Ma, Yi Zhou.
  10. Data Sampling Affects the Complexity of Online SGD over Dependent Data
    UAI, 2022.
    Shaocong Ma, Ziyi Chen, Yi Zhou, Kaiyi Ji, Yingbin Liang.
  11. Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning
    IEEE ISIT, 2022.
    Ziyi Chen, Shaocong Ma, Yi Zhou.
  12. Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov Game
    ICLR, 2022.
    Ziyi Chen, Shaocong Ma, Yi Zhou.
  13. Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity
    ICLR, 2021.
    Shaocong Ma, Ziyi Chen, Yi Zhou, Shaofeng Zou.
  14. Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
    NeurIPS, 2020.
    Shaocong Ma, Yi Zhou, Shaofeng Zou.
  15. Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle
    ICML, 2020.
    Shaocong Ma, Yi Zhou.