Optimization of collaborative filtering algorithm based on DAG Spark scheduling
返回论文页
|更新时间:2023-12-11
|
Optimization of collaborative filtering algorithm based on DAG Spark scheduling
Acta Scientiarum Naturalium Universitatis SunYatseniVol. 56, Issue 3, Pages: 46-56(2017)
作者机构:
1. 新疆财经大学统计与信息学院,新疆,乌鲁木齐,830012
2.
3. 新疆大学信息科学与工程学院,新疆,乌鲁木齐,830008
作者简介:
基金信息:
DOI:
CLC:
Published:2017,
Published Online:25 May 2017,
扫 描 看 全 文
LIAO Bin, ZHANG Tao, YU Jiong, et al. Optimization of collaborative filtering algorithm based on DAG Spark scheduling. [J]. Acta Scientiarum Naturalium Universitatis SunYatseni 56(3):46-56(2017)
DOI:
LIAO Bin, ZHANG Tao, YU Jiong, et al. Optimization of collaborative filtering algorithm based on DAG Spark scheduling. [J]. Acta Scientiarum Naturalium Universitatis SunYatseni 56(3):46-56(2017)DOI:
Optimization of collaborative filtering algorithm based on DAG Spark scheduling
The scale effect of big data has brought great challenges to data storage
management and analysis. And the high efficiency and low cost big data processing technology has become a hotspot research in academia and industry. In order to improve the efficiency of collaborative filtering algorithms
the implementation of the algorithm under the MapReduce architecture is decomposed in order to analysis the defects of the algorithm. For the Spark suitable for the iterative and interactive tasks
this paper presents the methods to improve the execution efficiency from the MapReduce platform to the Spark platform. The implementation flow of the algorithm in Spark is designed
and efficiency is improved by parameter adjustment and memory optimization. Experimental results show that: based on spark DAG scheduling
the algorithm can reduce more than 65% HDFS I/O operations and enforce the efficiency and energy efficiency were increased by nearly 200% and 50%.