参考文献/References:
[1] 冯兴杰, 贺阳. 基于节点性能的Hadoop作业调度算法改进[J]. 计算机应用与软件, 2017, 34(5): 223-228.
[2] 简琤峰, 平靖, 张美玉. 面向边缘计算的Storm边缘节点调度优化方法[J]. 计算机科学, 2020, 47(5): 277-283.
[3] ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: cluster computing with working sets[C]//Proceedings of the 2nd USENIX conference on hot topics in cloud computing. New York: ACM, 2010: 10.
[4] WIKTORSKI T. Data-intensive systems: principles and fundamentals using Hadoop and Spark[M]. Cham: Springer International Publishing, 2019.
[5] 郑晓薇, 项明, 张大为, 等. 基于节点能力的Hadoop集群任务自适应调度方法[J]. 计算机研究与发展, 2014, 51(3): 618-626.
[6] XU X L, CAO L L, WANG X H. Adaptive task scheduling strategy based on dynamic workload adjustment for heterogeneous hadoop clusters[J]. IEEE Systems Journal, 2016, 10(2): 471-482.
[7] YONG M, GAREGRAT N, MOHAN S. Towards a resource aware scheduler in Hadoop[C]//Proc of the 7th IEEE International Conference on Web Services.[S.l.]: IEEE, 2009: 102-109.
[8] 徐佳俊, 刘功申, 苏波, 等. 基于Spark的异构集群调度策略研究[J]. 计算机科学与应用, 2016(11): 692-704.
[9] 胡亚红, 盛夏, 毛家发. 资源不均衡Spark环境任务调度优化算法研究[J]. 计算机工程与科学, 2020, 42(2): 203-209.
[10] CHAMBERS B, ZAHARIA M. Spark: the definitive guide[M]. 张岩峰, 王方京, 陈晶晶, 译. 北京: 中国电力出版社, 2020.
[11] KOTOULAS S, OREN E, VAN HARMELEN F. Mind the data skew: distributed inferencing by speed dating in elastic regions[C]//Proceedings of the 19th international conference on World wide web. New York: ACM, 2010: 531-540.
[12] DAVIDSON A, OR A. Optimizing shuffle performance in Spark[EB/OL].[2018-11-25]. https://people.eecs.berkeley.edu/~kubitron/courses/cs262a-F13/projects/reports/project16_report.pdf.
[13] 詹剑锋, 高婉铃, 王磊, 等. Big Data Bench: 开源的大数据系统评测基准[J]. 计算机学报, 2016, 39(1): 196-211.