工作经历
2018.08至今,幸运飞艇,教授
2013.08-2018.08,幸运飞艇,副教授
2012.07-2013.08,幸运飞艇,讲师
研究方向
统计模型在生物医学、决策预测、市场研究、教育心理领域的方法与应用研究,相关型数据分析,潜变量建模,特征选择方法与可重复性评价,文本挖掘,函数型数据分析
荣誉奖励
1. 《统计研究》优秀论文奖,三等奖,2021
2. 北京市统计科研优秀成果奖,二等奖,2014
3. 中华中医药学会科学技术奖,二等奖,2013
科研项目
生物医学统计
1. 长寿风险与肺癌保险实务案例研究,2025.
2. 基于数据科学方法的医疗保险研究,2024.
3. 患者自答结局(PRO)的Estimand设计及统计方法,2023-2025.(项目负责人)
4. 数字健康的统计测度与分析方法研究,2022-2025.(项目负责人,教育部人文社会科学重点研究基地重大项目)
5. 基于风险识别与评估的中药临床试验关键环节质量控制研究,2021-2024.
6. 重大传染病理论及模型研究,2021-2022.(项目负责人)
7. 儿童青少年远视储备检验方法和现状研究,2021-2021.(项目负责人)
8. 大规模流行病调查的分块加权估计方法,2020.(项目负责人,中国人民大学特设项目)
9. 混合研究方法评价中西医患共建模型的应用研究,2020-2021.(项目负责人)
10. 枳术宽中胶囊国际临床研究,2020-2020.(项目负责人)
11. 贝叶斯置信度增殖神经网络识别模型的建立,2019-2020.(项目负责人)
12. 中医医患共建式叙事医学诊疗决策模式的建立,2019-2020.(项目负责人)
13. 医患共建平行病历体现中医临床真实疗效的整体评价方法构建,2018-2021.(主要参与人,国家自然科学基金面上项目)
14. 基于症状指标与微观指标的亚健康状态潜变量证候量化诊断模型研究,2018-2021.(参与人,国家自然科学基金面上项目)
15. 研究平台建设与社区人群、心房颤动、大血管疾病队列建设,2016-2020.(项目负责人,国家重点研发计划精准医学研究重点专项子课题)
16. 生物医学大数据的统计方法基础研究,2015-2020.(项目负责人,中国人民大学品牌研究计划)
17. 北京地区流感特征监测及中医预警体系建设,2018.(项目负责人)
18. 真实世界注册研究基础上的晚期消化道恶性肿瘤幸存者中医优势人群特征分析,2016-2018.
19. 亚健康单证量化诊断的统计方法,2016-2017.(项目负责人)
20. 临床试验中的创新设计及统计方法,2014-2018.(参与人,国家自然科学基金面上项目)
21. 中医复杂干预试验设计与综合评价研究,2011-2014.(主要参与人,中医行业公益专项子课题)
22. 以心血管病为示范的中药新药临床评价研究技术平台,2014-2015.
23. 城乡居民医疗消费状况与社会保障政策评价研究,2014.
24. 北京市地区季节性流感发病与中医药预防情况回顾性调查研究,2013-2014.(项目负责人)
25. 我国养老与医疗保障政策评价,2012-2015.
26. 构建表征证候动态变化纵向结局评价指标的示范研究,2009-2012.(参与人,国家自然科学基金面上项目)
27. 名老中医临床经验数据统计分析、整理与挖掘方法学研究,2007-2008.
28. 中医临床疗效评价技术探索,2006-2009.(主要参与人,十一五科技支撑计划)
29. 利用个体化诊疗平台评价中医药延缓糖尿病血管并发症疗效的研究,2006-2008.
30. 二型糖尿病中医证候及疗效评价指标体系建立研究,2006-2007.
决策与预测
1. 动态复杂场景下多模态大模型时序鲁棒预测方法研究,2026-2027.(项目负责人)
2. 网络结构数据的因果推断研究,2026-2029.(项目负责人,国家自然科学基金面上项目)
3. 服务国家治理的政府数据开发方法与应用,2024-2027.(项目负责人)
4. 中国式现代化统计评价与动态监测研究,2024-2026.
5. 变量选择不确定性评价方法及其在管理科学中的应用,2023-2026.(项目负责人,国家自然科学基金面上项目)
6. 面向大规模网络结构数据的统计模型理论与应用研究,2022-2024.
7. 工业大数据统计测度理论及应用,2022-2016.(子课题负责人,国家社会科学基金重大项目)
8. 模型置信界的理论与方法研究,2021-2024.(项目负责人)
9. 突发事件的网络内容传播与治理研究,2021-2021.
10. 健康中国2030背景下的健康老龄化体系优化研究,2020-2022.(子课题负责人,教育部哲学社会科学研究重大课题攻关项目)
11. 新冠肺炎和工作迁徙,2020-2021.(联合负责人,世界大学联盟科研专项)
12. 大数据背景下的整合调查设计与分析技术,2020-2021.(项目负责人,全国统计科学研究项目重大项目)
13. 管理科学中变量选择模型的评价方法,2019.(项目负责人,中国人民大学星火计划)
14. 问卷分割设计与分析技术研究,2018-2021.(项目负责人,国家自然科学基金面上项目)
15. 基于整合分析的多源异质性数据变量选择方法,2017-2018.(项目负责人)
16. 农业统计工作方法与技术,2019-2020.(项目负责人)
17. 百度舆情风险预测,2016-2017.(项目负责人)
18. 基于GIS系统的经济决策分析系统建设,2015.
19. 预测模型的结构化变量选择方法研究,2014-2016.(项目负责人,国家自然科学基金青年项目)
20. 纵向数据模型的理论与应用,2014-2016.(项目负责人,教育部留学回国人员科研启动基金)
21. 高维异质性数据的特征选择方法研究,2014-2015.(项目负责人,教育部重点研究基地自主项目)
22. 复杂空间和时空数据的统计模型研究,2014-2016.(参与人,国际自然科学基金青年项目)
23. 纵向边际化随机效应模型应用研究,2013-2015.(项目负责人,中国人民大学科学研究基金)
24. 乡村旅游统计监测方法研究,2013-2014.
25. 中国税收动态预测研究,2009-2013.
26. 中国纵向数据分析的方法及其应用研究,2009-2012.(主要参与人,教育部重点研究基地重大项目)
27. 中国医疗保险现状宏观分析研究,2010-2013.
市场研究
1. 成品油市场精准营销博弈算法求解研发,2025-2026.(项目负责人)
2. 广播电视与网络视听节目收视综合评价大数据系统扩容,2023-2024. (项目负责人)
3. 咪咕元宇宙基础理论研究,2023-2024.
4. 现代视听全产业链统计分类研究,2023-2024.
5. 电视剧影响力综合评价算法研究,2021-2021.(项目负责人)
6. 有责任消费指标体系与影响因素研究,2019-2020.(项目负责人)
7. 加油站地理信息数据分析模型,2019.(项目负责人)
8. 基于互联网文本数据的央视春晚内容创新与融合传播研究,2017.(项目负责人)
9. 基于交易大数据与空间信息的加油站运行管理研究,2016-2017.(项目负责人)
10. 互联网代驾APP的潜在市场与单量预测,2016.
11. 基于文本模型的互联网时尚指数研究,2015.
12. 热门剧目流量预测研究,2015.
13. 体育赛事商业价值研究,2015
14. 广州市加油站顾客消费行为研究,2014-2015.
15. 中关村企业区域对外合作需求研究,2014-2015.
16. 银行客户理财产品推荐模型研究,2014.(项目负责人)
17. 网络文本的情感分类研究,2014.
18. 中国人身保险行业客户满意度研究,2014.
19. 互联网行业品牌价值与竞争力分析研究,2013-2014.
20. 餐饮企业展店选址研究,2013-2014.
21. 中国汽车行业地域优势量化研究,2012-2014.
22. 电子产品消费行为态度习惯研究,2008.
23. 中远物流、泰康人寿客户满意度测评研究,2005-2008.
24. 北京中端房地产用户区隔及市场定位研究,2006.
教育与心理
1. 高校领导班子和领导干部考核评价研究,2022-2024.
2. 数据科学时代的调查与商业分析融合教学,2020-2021.(项目负责人,教育部产学合作协同育人项目)
3. 面向世界一流大学的本科人才选拔机制研究,2016-2018.(主要参与人,中国人民大学重大项目)
4. 高校学生政治认同相关问题的调查研究,2016.
5. 首都高校大学生网络信息消费现状及对策研究,2015-2016.(主要参与人,中国人民大学社会调查项目)
6. 大学生道德的综合评价与影响因素研究,2014-2015.(项目负责人,中国人民大学人文社会科学国际期刊发表培育项目)
7. 社会心态测量及对政策扭曲影响实证研究,2014-2015.
8. 学生军训效果评估研究,2014-2015.
9. 当代大学生舆情动态调查方法与应用研究,2014.
10. 精细化管理模式下的校园满意度研究,2014.
11. 北京青年政治学院教工满意度研究,2013-2014.
12. 大学校园卡数据挖掘与可视化研究,2013.
13. 北京市大学生就业意向调查研究,2013.
14. 大学生生活满意度追踪调查研究,2012.
15. 中国人民大学校园满意度测评研究,2007-2017.
16. 奥运会志愿者成果转化调研,2008.
17. 首都大学生社会主义荣辱观研究,2007.
科研发表
统计方法与应用论文
1. Yang, H., Qin, Y., Li, Y*. (2026) Design strategies for networked experiments via interference balancing. Journal of the Business and Economic Statistics, Published Online.
2. Mei, H., Xu, X., Yang, H., Wang, F., Li, Y*. (2026) A practical review of response-adaptive randomization: under-explored challenges and potential directions. Statistical Methods in Medical Research, Published Online.
3. Li, R., Qin, Y., Li, Y*. (2026) Assessing estimation uncertainty under model misspecification. Scandinavian Journal of Statistics, 53(1), 268-290.
4. Wu, M., Li, Y., Wu, M*. (2025) Joint identification of spatially variable genes via a network-assisted bayesian regularization approach. Annals of Applied Statistics, 19(4), 2705-2723.
5. Peng, J., Li, Y*., Yang, Y*. (2025) On optimality of mallows model averaging. Journal of the American Statistical Association, 120(550), 1152-1163.
6. Mei, H., Xiao, H., Shia, B-C., Qiao, G., Li, Y*. (2025) Interconnections of multimorbidity-related clinical outcomes: analysis of health administrative claims data with a dynamic network approach. Statistics in Medicine, Published Online.
7. Li, R., Qin, Y., Li, Y*. (2025) Uncertainty assessment for generalized linear models via local bootstrapping. Science China Mathematics (in Chinese), Published Online.
8. Li, R., Xu, S., Li, Y., Tang, Z., Feng, D., Cai, J., Ma, S*. (2024) Incorporating prior information in gene expression network-based cancer heterogeneity analysis. Biostatistics, Published Online.
9. Zhai, G., Li, Y., Zhang, L., Hu, F*. (2024) Group response-adaptive randomization with delayed and missing responses. Statistics in Medicine, 43(27), 5047-5059.
10. Wang, F., Jia, K., Li, Y*. (2024) Integrative deep learning with prior assisted feature selection. Statistics in Medicine, 42(20), 3972-3814.
11. Yang, H., Qin, Y., Li, Y*., Hu, F. (2024) Sequential covariate-adjusted randomization via hierarchically minimizing mahalanobis distance and marginal imbalance. Biometrics, 80(2).
12. Wang, F., Xu, S., Shen, Y., Qin, Y., Li, Y*. (2024) Sparse clustering for customer segmentation with high-dimensional mixed type data. The Annals of Applied Statistics, 18(3), 2382-2402.
13. Li, Y., Wu, M., Wu, M*., Ma, S*. (2024)Identification of influencing factors on self-reported count data with multiple potential inflated values. The Annals of Applied Statistics, 18(2), 991-1009.
14. Qin, Y., Li, Y., Ma, W., Yang, H., Hu, F*. (2023) Adaptive randomization via Mahalanobis distance. Statistica Sinica, 34(1), 353-375.
15. Wang, F., Duan, C., Li, Y., Huang, H*., Shia, B-C. (2023) Spatiotemporal varying coefficient model for respiratory disease mapping in Taiwan. Biostatistics, 25(1), 40-56.
16. Li, Y., Wu, M., Ma, S., Wu, M*. (2023) ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data. Genome Biology, 24, 208.
17. Mei, H., Xie, J., Qin, Y., Li, Y*. (2023) Network and covariate adjusted response-adaptive design for binary response. Statistics in Medicine, 42(29), 5396-5388.
18. Li, Y., Qi, L., Qin, Y., Lin, C*., Yang, Y. (2023) Block weighted least squares estimation for nonlinear cost-based split questionnaire design. Journal of Official Statistics, 39(4), 459-487.
19. Li, Y., Ren, X., Yu, H., Sun, T*., Ma, S. (2023) PIN: A penalized integrative deep neural network for variable selection among multiple omics datasets. The 21th Asia Pacific Bioinformatics Conference, Published Online.
20. Zhao, J., Li, Y. (2023) Binary multi-view sparse subspace clustering. Neural Computing and Application, 35, 21751-21770.
21. Wang, F., Liang, D., Li, Y*., Ma, S*. (2023) Prior information assisted integrative analysis of multiple genetic datasets. Bioinformatics, 39(8), 1-9.
22. Wu, M., Wang, F., Ma, S., Li, Y*. (2023) Bi-level structured functional analysis for genome-wide association studies. Biometrics, 79(4), 3359-3373.
23. Li, Y., Yang, H., Yu, H., Huang, H., Shen, Y*. (2023) Penalized weighted least squares estimate for variable selection on correlated multiply-imputed data. Journal of the Royal Statistical Society Series C, 72(3), 703-717.
24. Wang, F., Li, H., Wang, H., Li, Y*. (2023) Spatial correlated incidence modeling with zero-inflation. Biometrical Journal, 65(4), 1-20.
25. Li, Y., Yang, H., Yu, H., Huang, H., Shen, Y*. (2023) Penalized estimating equations for generalized linear models with multiple imputation. The Annals of Applied Statistics, 17(3), 2345-2363.
26. Zhan, Z., Li, Y., Lin, C*., Yang, Y. (2023) Model averaging for semiparametric varying coefficient quantile regression models. Annals of the Institute of Statistical Mathematics, 75, 649-681.
27. Li, Y., Liang, D., Ma, S., Ma, C*. (2023) Spatio-temporally smoothed deep survival neural network. Journal of Biomedical Informatics, 137, 1-9.
28. Yang, H., Qin, Y., Wang, F., Li, Y*., Hu, F. (2023) Balancing covariates in multi-arm trials via adaptive randomization. Computational Statistics & Data Analysis, 179, 1-15.
29. Lin, C., Peng, J., Qin, Y., Li, Y*., Yang, Y. (2023) Optimal integrating learning for split questionnaire design type data. Journal of Computational and Graphical Statistics, 32(3), 1009-1023.
30. Qin, Y., Wang, L., Li, Y., Li, R*. (2023) Visualization and assessment of model selection uncertainty. Computational Statistics & Data Analysis, 178, 1-20.
31. Zhang, Z., He, Z., Qin, Y., Shen, Y., Shia, B-C., Li, Y*. (2023) Variable selection with scalable bootstrapping in generalized linear model for massive data. Journal of Data Science, 21(1), 87-105.
32. Chen, Y., Li, Y., Smith, M.L., Li, C., Shen, Y*. (2023) Analyzing evidence-based falls prevention data with significant missing information using variable selection after multiple imputation. Journal of Applied Statistics, 50(3), 724-743.
33. Lin, C., Qiao, N., Zhang, W., Li, Y*., Ma, S. (2022) Default risk prediction and feature extraction using a penalized deep neural network. Statistics and Computing, 32, 1-17.
34. Li, Y., Wang, F., Shen, Y., Qin, Y., Si, J*. (2022) Selection of mixed copula for association modeling with tied observations, 31, 1127-1180.
35. Zhang, J., Fan, X*., Li, Y*., Ma, S. (2022) Heterogeneous graphical model for non-negative and non-gaussian PM2.5 data. Journal of the Royal Statistical Society Series C, 71(5), 1303-1329.
36. Chen, J., Bie, R., Qin, Y., Li, Y*., Ma, S. (2022) Lq-based robust analytics on ultrahigh and high dimensional data. Statisitcs in Medicine, 41(26), 5220-5241.
37. Li, Y., Xu, S., Ma, S., Wu, M*. (2022) Network-based cancer heterogeneity analysis incorporating multi-view of prior information. Bioinformatics, 38(10), 2855-2862.
38. Li, Y., Wang, F., Wu, M*., Ma, S. (2022) Integrative functional linear model for genome-wide association studies with multiple traits. Biostatistics, 23(2), 574-590.
39. Qin, Y., Yu, L., Li, Y*. (2021) Iterative connecting probability estimation for networks. NeurIPS 2021, Published Online.
40. Li, Y., Li, R., Qin, Y., Lin, C*., Yang, Y. (2021) Robust group variable screening based on maximum Lq-likelihood estimation. Statistics in Medicine, Published Online.
41. Wang, L., Qin, Y*., Li, Y. (2021) Confidence graphs for graphical model selection. Statistics and Computing, 31, 1-21.
42. Li, Y., Ma, W*., Qin, Y., Hu, F. (2021) Testing for treatment effect in covariate-adaptive randomized trials with generalized linear models and omitted covariate. Statistical Methods in Medical Research, 30(9), 2148-2164.
43. Wang, F., Yang, Y., Tso, G.K.F., Li, Y*. (2021) The determinants of social promotion success: A case study of crowdfunding projects. Applied Stochastic Models in Business and Industry, 37(2), 251-265.
44. Li, Y., Wang, F., Li, R., Sun, Y*. (2020) Semiparametric integrative interaction analysis for non-small-cell lung cancer. Statistical Methods in Medical Research, 29(10), 2865-2880.
45. Ma, C., Li, Y., Shia, B-C., Ma, S*. (2020) Human disease cost network analysis. Statistics in Medicine, 39(9), 1237-1249.
46. Li, Y., Li, Y*., Qin, Y., Yan, J. (2020) Copula modeling for data with ties. Statistics and Its Interface, 13, 103-117.
47. Ma, W., Qin, Y., Li, Y., Hu, F*. (2020) Statistical Inference for Covariate-Adaptive Randomization Procedures. Journal of American Statistical Association, 115(531), 1488-1497.
48. Wang, F., Yang, Y., Tso, G.K.F., Li, Y*. (2019) Analysis of launch strategy in cross-border e-commerce market via topic modeling of consumer reviews. Electronic Commerce Research, 19, 863-884.
49. Sun, Y., Sun, Z., Jiang, Y., Li, Y., Ma, S*. (2019) An integrative sparse boosting analysis of cancer genomic commonality and difference. Statistical Methods in Medical Research, Published Online.
50. Li, Y., Li, R., Qin, Y., Wu, M., Ma, S*. (2019) Integrative interaction analysis using threshold gradient directed regularization. Applied Stochastic Models in Business and Industry, 35(2), 354-375.
51. Sun, Y., Jiang, Y., Li, Y., Ma, S*. (2019) Identification of cancer omics commonality and difference via community fusion. Statistics in Medicine, 38(7), 1200-1212.
52. Li, Y., Li, R., Qin, Y., Lin, C*., Ma, S*. (2019) Penalized integrative semi-parametric interaction analysis for multiple genetic datasets. Statistics in Medicine. Published Online.
53. Li, Q., Tso, G.K.F., Qin, Y., Lovejoy, T. I., Heckman, T. G., and Li, Y*. (2019) Penalized multiple inflated values selection method with application to SAFER data. Statistical Methods in Medical Research, 28(10-11), 3205-3225.
54. Li, Y., Luo, Y., Ferrari, D., Hu, X., Qin, Y*. (2019) Model confidence bounds for variable selection. Biometrics, 75, 392-403.
55. Li, Y., Luo, Y., Ferrari, D., Hu, X., Qin, Y*. (2019) Rejoinder to discussion on model confidence bounds for variable selection. Biometrics, 75, 411-413.
56. Li, Y., Bie, R., Teranhidalgo, S., Qin, Y., Wu, M*., Ma, S*. (2018) Assisted gene expression-based clustering with AWNCut. Statistics in Medicine, 37, 4386-4403.
57. Li, L., Li, Y*., Qin, Y., Wang, L., Chen, J., Yi, D. (2016) Adaptive stochastic gradient boosting tree with composite criterion. Journal of Statistical Computation and Simulation, 86(10), 1901-1911.
58. Li, Y., Qin, Y., Wang, L., Chen, J., Ma, S*. (2016) Grouped variable selection using area under ROC with adjustable predicted threshold. Communication in Statistics: Computation and Simulation, 45(4), 1268-1280.
59. Li, Y., Yu, C., Qin, Y., Wang, L., Chen, J., Yi, D., Shia, B-C., Ma, S*.(2015) Regularized receiver operating characteristic based logistic regression for grouped variable selection with composite criterion. Journal of Statistical Computation and Simulation, 85(13), 2582-2595.
60. Zeng, X., Ma, S., Qin,Y., Li, Y*. (2015) Variable selection in semiparametric models for the strong hierarchical longitudinal data. Statistics and Its Interface, 8(3):355-365.
61. Li, J., Yi, D., Qin, Y., Shen, Y., Li, Y*. (2014) Feature selection for support vector machine in the study of financial early warning system. Quality and Reliability Engineering International, 30(6), 867-877.
62. Zhao, Q., Shi, X., Huang, J., Liu, J., Li, Y., Ma, S*. (2014) Integrative analysis of "-omics" data using penalty functions. WIREs Computational Statistics, 7, 99-108.
63. Li, Y., Qin, Y., Xie, Y., Tian, F. (2013) Grouped penalization estimation of osteoporosis data in Traditional Chinese Medicine. Journal of Applied Statistics , 40(4), 699-711.
64. Li, Y., Yi, D., Zhang, H., Qin, Y. (2012) Syndrome evaluation in Traditional Chinese Medicine using second-order latent variable model. Statistics in Medicine, 31(7), 672-680.
65. Li, Y., Allore, H.G., Lin, H. (2011) Modeling longitudinal multinomial response in presence of death. The Gerontologist, Peer-Reviewed Abstract, 51(2), 555.
66. Ma, S., Shi, M., Li, Y., Yi, D., Shia, B.-C. (2010) Incorporating gene coexpression network in identification of cancer prognosis markers. BMC Bioinformatics, 11, 271–279.
67. Ma, S., Huang, J., Shi, M., Li, Y., Shia, B.-C. (2010) Semiparametric prognosismodels in genomics studies. Briengs in Bioinformatics, 11(4), 385–393.
68. 李嵘,秦祎辰,李扬*. (2025) 基于局部自助法的广义线性模型不确定性度量. 中国科学:数学,在线发表.
69. 林存洁,祁乐,李扬*. (2025) 基于简约化模型平均方法的收入不平等问题预测研究. 系统科学与数学,在线发表.
70. 孔艳蕾,秦祎辰,李扬*. (2025) 基于阈值距离加权损失的股票收益预测. 计量经济学报,5(1), 35-51.
71. 汤惠云,李扬,王菲菲*. (2024) 基于隐私保护的多源数据分析. 系统科学与数学,在线发表.
72. 徐少东,李扬,边策*. (2024) 高维混合型数据的异质性分析. 系统科学与数学, 44(8), 2429-2457.
73. 林存洁,熊照,李扬*. (2024) 先验中介分析及其应用. 系统科学与数学, 44(7), 2122-2145.
74. 王菲菲,贾珂,张开宇,李扬*. (2024) 融合先验信息的整合财务预警模型研究. 统计研究, 41(5), 137-149.
75. 李扬,谢嘉欣,秦祎辰,杨昊宇*. (2023) 经济试验中的社交网络调整自适应随机化方法. 统计研究, 40(10), 151-160.
76. 陈芳露,李扬,秦祎辰,杨昊宇*. (2023) 经济试验中的协变量平衡自适应设计. 计量经济学报, 3(4), 936-947.
77. 杨昊宇,秦祎辰,李扬*. (2022) 问卷分割设计的成组序贯子问卷分配方法. 系统科学与数学, 42(1), 17-34.
78. 李嵘,张文丽,李扬,林存洁*. (2022) 基于深度学习的大规模肿瘤数据生存分析. 中国卫生统计, 39(1), 84-87.
79. 史兴杰,王赛旎,李扬*. (2020) 高维数据的稳健二分类方法. 统计研究, 37(9), 95-105.
80. 李扬*,祁乐,聂佩芸. (2020) 大规模数据的随机森林算法. 统计与信息论坛, 35(6), 24-33.
81. 边策,祁乐,李扬. (2020) 大数据时代的整合调查设计:以校园满意度调查为例. 调研世界,4, 56-61.
82. 袁卫,李扬. (2019) 我国首位有国际重大影响的统计学家:吴定良. 兰州财经大学学报,35(5), 1-6.
83. 李扬*,张长,朱建平. (2018) 融合统计思想的大数据算法. 统计研究,35(7), 125-128.
84. 李扬,许文甫,马双鸽. (2018) 污染数据的稳健稀疏成组变量选择方法研究. 统计与信息论坛,33(6), 26-34.
85. 斯介生,李扬,谢邦昌. (2017) 基于异质性数据的Logit变量选择模型研究. 统计研究,34(12),110-118.
86. 刘艳,李扬,刘罡,张育铭. (2017) 纵向有序数据的临床评价方法研究. 中国卫生统计,34(1), 1-5.
87. 李扬,孔雯婧,谢邦昌. (2016) 基于主题模型的半监督网络文本情感分类研究. 数理统计与管理,35(6),961-971.
88. 李扬,李竟翔,马双鸽. (2016) 不平衡数据的企业财务预警模型研究. 数理统计与管理,35(5),893-906.
89. 林存洁,李扬. (2016) 大数据分析仍需要统计思想-以ARGO模型为例. 统计研究,33(11):109-112.
90. 李扬,赵青,马双鸽. (2016) 生物统计的研究进展与挑战. 统计研究,33(6), 3-12.
91. 斯介生,宋大我,李扬. (2016) 大数据背景下的谷歌翻译:现状与挑战. 统计研究,33(5): 109-112.
92. 蔺顺锋,易丹辉,李扬,李京. (2016) 稀疏函数型回归模型变量选择及实证分析. 数学的实践与认识,46(11):171-177.
93. 于强,耿曙,李扬. (2015) 两岸交流对台湾民众的影响研究:利益、认同与投票行为. 理论月刊,2, 111-117.
94. 李扬,李竟翔,王园萍. (2015) 基于AUC回归的特征选择模型研究. 统计与信息论坛,30(5), 10-16.
95. 李扬,孙泽烨,胡镜清. (2015) 改进的神经网络综合评价模型研究. 数学的实践与认识,10, 59-68.
96. 李扬,曾宪斌. (2014) 面板数据模型的惩罚似然变量选择方法研究. 统计研究,31(3),83-89.
97. 李淞淋,李扬,易丹辉. (2014) 有监督Group MCP变量选择方法的稳健性研究. 统计与信息论坛,29(6),11-17.
98. 李扬,朱建锋,谢邦昌. (2013) 变量选择方法及其在健康食品市场研究中的应用探索. 统计与信息论坛,28(10), 17-25.
99. 谢邦昌, 李扬. (2008) 数据挖掘与商业智能的现状及未来发展. 统计与信息论坛, 23(5), 94–96.
医学、流行病学论文
1. Wu, S., Feng, Y., Cheng, H., Huang, H., Li, Y., Ling, F., Ma, P*., Zhong, W*., Shen, Y*. (2025) Personalized risk score prediction and testing policy adaptations of a COVID-19 population-based contact tracing network. Epidemiology and Infection, Published Online.
2. Zhang, Q., Wang, F., Feng, H., Xing, J., Zhu, S., Zhang, H., Li, Y*., Wei, W*., Zhang, S*. (2024) Modifiable risk factors for esophageal cancer in endoscopic screening population. Chinese Medical Journal, 137(3), 350-352.
3. Tan, J., Ge, Y., Martinez, L., Sun, J., Li, C., Westbrook, A., Chen, E., Pan, J., Li, Y., Cheng, W., Ling, F., Chen, Z., Shen, Y*., Huang, H*. (2022) Transmission roles of symptomatic and asymptomatic COVID-19 cases: A modelling study. Epidemiology and Infection, Published Online.
4. Li, Y., Kong, Y., Ebell, M., Martinez, L., Cai, X., Lennon, R., Tarn, D., Mainous, A., Zgierska, A., Barrett, B., Tuan, W-J., Maloy, K., Goyal, M., Krist, A., Sung, M-H., Li, C., Jin, Y., Shen, Y*. (2022) Development and validation of a two-step predictive risk stratification model for coronavirus disease 2019 In-hospital mortality. Frontiers in Medicine, Published Online.
5. Ge, Y., Chen, Z., Handel, A., Martinez, L., Xiao, Q., Li, C., Chen, E., Pan, J., Li, Y*., Ling, F*., Shen, Y*. (2021) The impact of social distancing, contact tracing, and case isolation interventions to suppress the COVID-19 epidemic: A modeling study. Epidemics, Published Online.
6. Wang, F., Chen, T., Chang, Q., Kao, Y-W., Li, J., Chen, M., Li, Y*., Shia, B-C. (2021) Respiratory disease are positively associated with PM2.5 concentrations in different areas of Taiwan. PLOS One, Published Online.
7. Shen, Y., Li, C., Dong, H., Wang, Z., Martinez, L., Sun, Z., Handel, A., Chen, Z., Chen, E., Ebell, M., Wang, F., Yi, B., Wang, H., Wang, X., Wang, A., Chen, B., Qi, Y., Liang, L., Li, Y., Ling, F*., Chen, J., Xu, G*. (2020) Community outbreak investigation of SARS-CoV-2 transmission among bus riders in eastern China. Journal of the American Medical Association Internal Medicine, 180(12), 1665-1671.
8. Ma, C., Zhang, Y., Li, Y., Wang, Y., Jiang, Y., Wang, X., Ma, S*. (2020) Healthcare, insurance, and medical expenditure of the floating population in Beijing. Frontiers in Public Health, Published Online.
9. Li, Y., Lu, Q., Wang, Y*., Ma, S. (2020) Racial difference in testicular cancer in the United States: descriptive epidemiology. BMC Cancer, Published Online.
10. Ma, C., Li, Y., Wang, X., Ma, S. (2020) The survey approach: outcome evaluation research on health insurance and health care in China. SAGE Research Methods Cases, Published Online.
11. Li, Y., Zhang, Z., Feng, Q., Yi, D., Lu, F*. (2019) An efficacy evaluation method for non-normal outcomes in randomized controlled trials. Scientific Reports, Published Online.
12. Wang, Y., Zhang, Y., Ma, C., Jiang, Y., Li, Y., Wang, X., Ma, S. (2019) Limited effects of the comprehensive pricing healthcare reform in China. Public Health, 175, 4-7.
13. Feng, S., Chen, L., Tian, G., Hu, J., Ding, Y., Du, Z., Zhang, H., Li, B*., Li, Y. (2019) Validity and reliability of patient section of evidence-based medical records about doctor-patient building through integrated therapy of traditional Chinese and western medicine (DPEBMR-P) in patients with gastrointestinal diseases. Annals of Translational Medicine, 7(6), 121-126.
14. Ma, C., Jiang, Y., Li, Y., Zhang, Y., Wang, X., Ma, S., Wang, Y. (2019) Healthcare underutilization in middle-aged and elderly adults in China. Public Health, 166, 65-68.
15. Wang, Y., Chang, Q., Li, Y*. (2018) Racial differences in Urinary Bladder Cancer in the United States. Scientific Reports. Published Online.
16. Li, Y., Zhao, Y., Yi, D., Wang, X., Jiang, Y., Wang, Y., Liu, X., Ma, S. (2017) Differences still exist across insurance schemes in China post-consolidation. PLOS One. Published Online.
17. Li, Y., Sun, Y., Zhang, Y., Yi, D., Ma, C., Ma, S. (2016) Rural–urban disparity in health care: observations from Suzhou, China. Public Health, 138, 164-167.
18. Chu, Y., Dai, L., Qi, S., Huang, H., Li, Y., Shen, Y. (2016) Challenges from Ethnic Heterogeneity across Regions in Cost Effectiveness Analysis in MRCT. Frontiers in Pharmacology, Published Online DOI:10.3389/fphar.2016.00371.
19. Jiang, Y., Wang, Y., Li, Y., Zhang, Y., Zhao, Y., Wang, X., Ma, C., Ma, S. (2016) Inpatient treatment for the middle-aged and elderly: observations from Henan, China. Public Health. Published Online.
20. Jiang, Y., Wang, Y., Li, Y., Wang, X., Ma, S. (2015) Prevalence, characteristics, and cost of self-treatment in the middle-aged and elderly: observations from Henan, China. Public Health, 129(5), 597-600.
21. Li, Y., Wang, Y., Wang, Z., Yi, D., Ma, S. (2015) Racial differences in three major NHL subtypes: descriptive epidemiology. Cancer Epidemiology, 39, 8-13.
22. Li, P., Chen, Y., Liu, J., Hong, J., Deng, Y., Yang, F., Jin, X., Gao, J., Li, J., Fang, H., Liu, G., Shi, L., Du, J., Li, Y., Yan, M., Wen, Y., Yang, W. (2015) Efficacy and safety of Tangshen formula on patients with type 2 diabetic kidney disease: A multicenter double-blinded randomized placebo-controlled trial. PLOS One, Published Online.
23. Shen, Y*., Anderson, A., Sinha, R., Li, Y. (2014) Joint modeling tumor burdenand time to event in oncology trials. Pharmaceutical Statistics, 13(5):286-293.
24. Yi, D., Li, Y., Shao, S., Xie, Y., Yuwen, Y. (2013) Evaluate the conjoint efficacy in the Traditional Chinese Medicine with the longitudinal latent variable linear mixed model. Chinese Journal of Integrative Medicine, 19(8), 629-635.
25. Jiang, Y., Wang, Y., Zhang, L., Li, Y., Wang, X., Ma, S. (2013) Access to healthcare and medical expenditure for the middle-aged and elderly: observations from China. PLOS One, Published Online.
26. Wang, Y., Jiang, Y., Li, Y., Wang, X., Ma, S. (2013) Health Insurance Utilization and its Impact: Observations from the Middle-aged and Elderly in China. PLOS One, Published Online.
27. Xu, J., Li, Y., Lin, H., Sinha, R., Potenza, M.N. (2013) Body Mass Index correlates negatively with white matter integrity in the Fornix and Corpus Callosum: a diffusion tensor imaging study. Human Brain Mapping, 34, 1044-1052.
28. Wang, L., Zhao, X., Wu, X., Li, Y., Yi, D., Cui, H., Chen, J. (2012) Diagnosis analysis of 4 TCM patterns in sub-optimal health status: a structural equation modeling approach. Evidence-Based Complementary and Alternative Medicine, Published Online.
29. Han, X., Li, Y., Huang, J., Zhang, Y., Holford, T., Lan, Q., Rothman, N., Zheng, T., Kosorok, M.R., Ma, S. (2010) Identification of predictive pathways for non-hodgkin lymphoma prognosis. Cancer Informatics, 9, 281–292.
30. Cai, J., Feng, Y., Cheng, H., Ge, Y., Guo, Z., Huang, H., Li, C., Li, Y., Ling, F., Martinez, L., Ma, P*., Zhong, W*. (2026) Pseudo-time reconstruction for analyzing transmission direction in Covid-19 contact-tracing data. Epidemics, Published Online.
31. Wu, S., Feng, Y., Cheng, H., Huang, H., Li, Y., Ling, F., Ma, P*., Zhong, W*., Shen, Y*. (2025) Personalized risk score prediction and testing policy adaptations of a COVID-19 population-based contact tracing network. Epidemiology and Infection, Published Online.
32. 蒋妍,张育铭,朱彦頔,李扬,王晓军,马双鸽,王瑜. (2021) 宁夏回族自治区中老年居民医保使用和医疗消费现状调查研究, 中国预防医学杂志,21(7): 760-768.
33. 贾萍萍,李扬. (2019) 不平衡数据的关键因素筛选方法. 世界科学技术-中医药现代化,21(3): 389-394.
34. 陆芳,李淞淋,李扬,易丹辉,唐旭东. (2018) 联合模型在中药多类型结局疗效评价中的应用探讨,中国新药杂志,27(13): 1483-1490.
35. 马晨瑾,王瑜,蒋妍,李扬,张育铭,王晓军,马双鸽. (2018) 河南省中老年人群医疗保险使用和医疗费用支出情况调查,中国公共卫生,34(5): 655-659.
36. 王瑜,蒋妍,李扬,张育铭,王晓军,马双鸽. (2016) 河南省中老年居民门诊治疗费用及就医选择调查,现代预防医学,43(18): 3347-3352.
37. 王瑜,蒋妍,王晓军,李扬,马双鸽. (2014) 中国老年人口医疗保险使用现状调查研究,保险研究,5, 116-127.
38. 王利敏,曲彬彬,赵歆,陈家旭,李扬,易丹辉,崔华廷. (2011) 基于结构方程模型的亚健康状态中医证候量化分析研究,中华中医药杂志,26(5), 1033-1036.
39. 武常生,谢雁鸣,李扬,周平,易丹辉,翁维良,王永炎. (2011) 应用结构方程模型评估中医方案优化指标体系,世界科学技术-中医药现代化,13(4), 629-633.
40. 武常生,谢雁鸣,周平,李扬,易丹辉,翁维良,王永炎. (2011) 中医临床治疗性研究方案优化指标体系信度的评估,中国中医药信息杂志,18(2), 16-21.
软件包开发
1. mcb:Model Confidence Bounds, //cran.r-project.org/web/packages/mcb
2. censorcopula: Estimate Parameter of Bivariate Copula, //cran.r-project.org/web/packages/censorcopula
3. AsthmaNHANES: Data Sets and Examples from National Health and Nutritional Examination Survey (NHANES), //cran.r-project.org/web/packages/AsthmaNHANES
4. HGMND: Heterogeneous Graphical Model for Non-Negative Data, //cran.r-project.org/web/packages/HGMND
5. LqG: Robust Group Variable Screening based on Maximum Lq-Likelihood Estimation, //cran.r-project.org/web/packages/LqG
6. CARM: Covariate-Adjusted Adaptive Randomization via Mahalanobis-Distance, //cran.r-project.org/web/packages/CARM
7. minb: Multiple-Inflated Negative Binomial Model, //cran.r-project.org/web/packages/minb
8. vsmi: Variable Selection for Multiple Imputed Data, //cran.r-project.org/web/packages/vsmi
著作教材
1. 李扬、林存洁编著,《回归分析》,中国人民大学出版社,2024,ISBN: 9787300332635.
2. 李扬、李舰编著,《数据科学概论》,中国人民大学出版社,2021,SBN: 9787300290607.
3. 易丹辉、李扬译,《商务与经济统计学》,中国人民大学出版社,2015,ISBN: 9787300206783.
4. 李扬著,《复杂调查设计与建模》,科学出版社,2024,ISBN: 9787030783066.
5. 易丹辉、李扬著,《中医研究中的统计方法》,清华大学出版社,2020,ISBN: 9787302567332.
6. 谢邦昌、李扬,、匡宏波编著,《从数据采集到数据挖掘》,中国统计出版社,2009,ISBN:9787503756436.
教学课程
本科生课程:统计学、数据科学概论、时间序列分析、回归分析
研究生课程:统计咨询,生物统计,主文献研读
社会兼职
1. 2026 - 至今,副主任委员,教育部高等学校统计学类专业教学指导委员会
2. 2026 - 至今,Editorial Board Member,Briefings in Bioinformatics
3. 2026 - 至今,编委,《统计与信息论坛》
4. 2025 - 至今,编委,《统计与决策》
5. 2025 - 至今,会士,中国商业统计学会
6. 2025 - 至今,理事,IMS-China
7. 2025 - 至今,副主任委员,中国现场统计研究会医药与生物统计分会
8. 2025 - 至今,学术委员会委员,自然资源部自然资源市场监测分析与智能决策重点实验室
9. 2025 - 至今,学术委员会委员,教育部数字政府与国家治理实验室
10. 2024 - 至今,副主任,中国人民大学交叉科学学术委员会
11. 2024 - 至今,编委,《统计与精算》
12. 2024 - 至今,副会长,中国统计学会残障统计分会
13. 2023 - 至今,常务理事,中国统计学会
14. 2022 - 至今,委员,北京市科协第十届常务委员会决策咨询专门委员会
15. 2021 - 2025,Representative Council Member, International Biometrical Society
16. 2021 - 至今,常务理事,中国现场统计研究会
17. 2021 - 至今,副主任委员,世界中医药学会联合会临床疗效评价专业委员会
18. 2021 - 2024,副理事长,北京企业评价协会
19. 2020 - 至今,副会长,中国商业统计学会
20. 2020 - 至今,副会长,中国商业统计学会市场调查与教学研究分会
21. 2020 - 至今,成员,《中国前列腺癌筛查与早诊早治指南》专家组
22. 2020 - 2025,委员,北京癌症防治学会精准预防专业委员会
23. 2020 - 至今,Associate Editor,Biostatistics and Epidemiology
24. 2020 - 至今,Associate Editor,Journal of Data Science
25. 2019 - 2023,学术委员会委员,临床流行病学北京市重点实验室
26. 2018 - 至今,监事长,北京生物医学统计与数据管理研究会
27. 2018 - 2022,常务理事,全国工业统计学教学研究会
28. 2017,客座副教授,耶鲁大学生物统计系
29. 2017 - 2019,副书记,共青团中国人民大学委员会
30. 2015 - 2018,副秘书长,北京生物医学统计与数据管理研究会
31. 2013 - 至今,推选会员,国际统计学会
