Car accidents prediction based on cloud computing in data analysis

Conference: ISCTT 2022 - 7th International Conference on Information Science, Computer Technology and Transportation
05/27/2022 - 05/29/2022 at Xishuangbanna, China

Proceedings: ISCTT 2022

Pages: 6Language: englishTyp: PDF

Authors:
Yang, Chengke (Gansu Finance and Trade Vocational College, Lanzhou, China)

Abstract:
By using cloud computing and machine learning to analyze traffic accident data in the United States, this paper explores the practicability of cloud computing in data analysis and the possibility of using this model for traffic accident early warning. The experiment first stores the data in Hadoop File System and reads it out as Spark dataframe, and then uses Spark SQL for preliminary analysis and preprocessing. Then, traffic accident predictions are made using four machine learning algorithms in Spark MLlib. After comparing with the original data, logistic regression, random forest, and Gradient Boosted Trees all performed well, and the accuracy rate can reach more than 78% although the data itself is biased slightly, confirming that this model can be used to predict traffic accidents.