Analyzing the heterogenous effects of factors on high-range speeding likelihood of taxi speeders: Does explainable deep learning provides more insights than random parameter approach?

Researchers

Chuanyun Fu Haiyue Liu Xinguo Jiang Yue Zhou

Journal

Modalities

Models

Convolutional Neural Network (CNN)Generalized Linear Model (GLM)Self-attention mechanism XGBoost

Abstract

The random parameters Generalized Linear Model (GLM) is frequently used to model speeding characteristics and capture the heterogenous effects of factors. However, this statistical approach is seldom employed for prediction and generalization due to the challenge of transferring its predefined errors. Recently, the emergence of explainable AI techniques has illuminated a new path for analyzing factors associated with risky driving behaviors. Despite this, there remains a gap that comparing results from machine and deep learning (ML/DL) approaches with those from random parameters GLM. This study aims to apply the random parameter GLM and explainable deep learning to evaluate the heterogenous effects of factors on the taxis’ high-range speeding likelihood. Initially, a Beta GLM with random parameters (BGLM-RP) is developed to model the high-range speeding likelihood among taxi drivers. Additionally, XGBoost, a simple convolutional neural network (Simple-CNN), a deeper CNN (DCNN), and a deeper CNN with self-attention (DCNN-SA) are developed. The quantified explanations and illustrations of the factors’ heterogenous effects from ML/DL models are derived from pseudo coefficients by decomposing factors’ SHapley Additive exPlanations (SHAP) values. All the developed statistical, ML, and DL models are compared in terms of mean absolute errors and mean square errors on testing and full data. Results show that DCNN-SA excels in prediction on testing data, indicating its superior generalization capabilities, while BGLM-RP outperforms other models on full data. The DCNN-SA can reveal the heterogenous effects of factors for both in-sample and out-of-sample data, which is not possible for the random parameter GLM. However, BGLM-RP can reveal larger magnitudes of the factors’ heterogenous effects for in-sample data. The signs and significances are identical between the varying coefficients from BGLM-RP and the pseudo coefficients from the ML/DL models, demonstrating the validity and rationale of using the proposed explanation framework to quantify the factors’ effects in ML/DL models. The study also discusses the contributions of various factors to the high-range speeding likelihood of taxi drivers.Copyright © 2024 Elsevier Ltd. All rights reserved.

Show Full Text

Analyzing the heterogenous effects of factors on high-range speeding likelihood of taxi speeders: Does explainable deep learning provides more insights than random parameter approach?

Researchers

Journal

Modalities

Models

Abstract

Accurate predictions of aqueous solubility of drug molecules the multilevel graph convolutional network (MGCN) and SchNet architectures.

EXPRESS: Clinical Predictions of COVID-19 Patients Using Deep Stacking Neural Network.

Identifying Structure-Property Relationships through SMILES Syntax Analysis with Self-Attention Mechanism.

A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.

A Robust Framework for Data Generative and Heart Disease Prediction Based on Efficient Deep Learning Models.

Using Machine Learning and Deep Learning Methods to Predict the Complexity of Breast Cancer Cases.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply