Traffic sign recognition using local directional histogram of oriented gradients
DOI:
https://doi.org/10.51867/ajernet.6.3.54Keywords:
Histogram of Oriented Gradients, Local Directional Pattern, Local Directional Histogram of Oriented Gradients, Traffic Sign RecognitionAbstract
Histogram of Oriented Gradients (HOG) describes an image gradient by calculating vertical and horizontal gradient magnitudes and directions. HOG uses a one-dimensional (1D) centered derivative mask [−1, 0, +1] for horizontal gradient and its rotations at 90o for vertical gradient. This technique only considers four neighboring pixels while calculating image gradient at a particular pixel. Every pixel in an image carries subtle information and therefore all pixels should be considered when deriving image gradient. Therefore, given a pixel pi, all its N = 2(d+1) neighbors should be considered when calculating the gradient at distance d from pi. This paper proposes Local Directional Histogram of Oriented Gradients (LD-HOG), which, given pixel pi, it calculates the gradient at distance d = 1 from pi by considering all the eight neighbors of pi. The proposed operator calculates the image gradient at 0o, 45o, 90o and 1350. These image gradients are used to generate two HOG histograms. Maximum pooling techniques were applied to combine the two histograms. Experimental results on the German traffic sign detection benchmark (GTSDB) dataset show that LD-HOG (average precision = 0.90, average recall = 0.90 and average F1-score =0.90) out performs HOG (average precision = 0.84, average recall = 0.82 and average F1-score = 0.83) in traffic sign recognition. The averages of the two extractors (HOG and LD-HOG) were calculated from experimental results after applying Support Vector Machine (SVM), Random Forest (RF) and Decision Tree (DT) machine learning classifiers. Stratified K-Fold Cross-Validation was done on the proposed LD-HOG using SVM, RF and DT. Validation results show that SVM performed better with 99 percent, followed by RF with 96 percent. DT was had 76 percent.
Downloads
References
An, F., Wang, J., & Liu, R. (2024). Road traffic sign recognition algorithm based on cascade attention-modulation fusion mechanism. IEEE Transactions on Intelligent Transportation Systems, 25(11), 17841-17851. https://doi.org/10.1109/TITS.2024.3439699 DOI: https://doi.org/10.1109/TITS.2024.3439699
Asha, J., Giridhran, R., Agalya, K., & Sathya, R. (2022). Traffic sign detection using HOG and GLCM with decision tree and random forest. In Proceedings of the 2022 International Conference on Automation, Computing and Renewable Systems (ICACRS) (pp. 879-885). IEEE. https://doi.org/10.1109/ICACRS55517.2022.10029118 DOI: https://doi.org/10.1109/ICACRS55517.2022.10029118
Aziz, S., & Youssef, F. (2018). Traffic sign recognition based on multi-feature fusion and ELM classifier. Procedia Computer Science, 127, 146-153. https://doi.org/10.1016/j.procs.2018.01.109
Buda, M., Maki, A., & Mazurowski, M. A. (2018). A systematic study of the class imbalance problem in convolutional neural networks. Neural Networks, 106, 249-259. https://doi.org/10.1016/j.neunet.2018.07.011 DOI: https://doi.org/10.1016/j.neunet.2018.07.011
Chen, C., Li, B., Zhang, H., Zhao, M., Liang, Z., Li, K., & An, X. (2025). Performance enhancement of deep learning model with attention mechanism and FCN model in flood forecasting. Journal of Hydrology, 658, 133221. https://doi.org/10.1016/j.jhydrol.2025.133221 DOI: https://doi.org/10.1016/j.jhydrol.2025.133221
Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) (Vol. 1, pp. 886-893). https://doi.org/10.1109/CVPR.2005.177
Dalal, N., & Triggs, B. (2010). Histograms of oriented gradients for human detection. In Proceedings of 2005 Institute of Electrical and Electronics Engineers Computer Society Conference on Computer Vision and Pattern Recognition (Vol. 1, pp. 886-893). https://doi.org/10.1109/CVPR.2005.177 DOI: https://doi.org/10.1109/CVPR.2005.177
Halder, R. K., Uddin, M. N., Uddin, M. A., Aryal, S., & Khraisat, A. (2024). Enhancing k-nearest neighbor algorithm: A comprehensive review and performance analysis of modifications. Journal of Big Data, 11(1), 34. DOI: https://doi.org/10.1186/s40537-024-00973-y
https://doi.org/10.1186/s40537-024-00973-y
Hechri, A., & Mtibaa, A. (2020). Two-stage traffic sign detection and recognition based on SVM and convolutional neural networks. IET Image Processing, 14(4), 342-350. https://doi.org/10.1049/iet-ipr.2019.0634 DOI: https://doi.org/10.1049/iet-ipr.2019.0634
https://doi.org/10.1049/iet-ipr.2019.0634
Hechri, A., Hmida, R., Abdelali, A. B., et al. (2014). Real-time road lane markers detection for intelligent vehicles. Advances in Environmental Biology, 8(7), 2266-2272.
Jabid, T., Kabir, M. H., & Chae, O. (2010b, August). Gender classification using local directional pattern (LDP). In Proceedings of 2010 20th International Conference on Pattern Recognition (pp. 2162-2165). IEEE. https://doi.org/10.1109/ICPR.2010.373 DOI: https://doi.org/10.1109/ICPR.2010.373
Jabid, T., Kabir, M., & Chae, O. (2010a, January). Local directional pattern (LDP) for face recognition. In Proceedings of 2010 Digest of Technical Papers International Conference on Consumer Electronics (ICCE). IEEE. https://doi.org/10.1109/ICCE.2010.5418801 DOI: https://doi.org/10.1109/ICCE.2010.5418801
Jain, A., Moparthi, N. R., Swathi, A., Sharma, Y. K., Mittal, N., Alhussen, A., Alzamil, S., & Haq, M. A. (2023). Deep learning-based mask identification system using ResNet transfer learning architecture. Computer Systems Science and Engineering. https://doi.org/10.32604/csse.2023.036973 DOI: https://doi.org/10.32604/csse.2023.036973
Jayaprakasha, A., & KeziSelvaVijilab, C. (2019). Feature selection using ant colony optimization (ACO) and road sign detection and recognition (RSDR) system. Cognitive Systems Research, 58, 123-133. DOI: https://doi.org/10.1016/j.cogsys.2019.04.002
https://doi.org/10.1016/j.cogsys.2019.04.002
Kerim, A., & Efe, M. (2021, April). Recognition of traffic signs with artificial neural networks: A novel dataset and algorithm. In Proceedings of the 2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) (pp. 171-176). IEEE. https://doi.org/10.1109/ICAIIC51459.2021.9415238 DOI: https://doi.org/10.1109/ICAIIC51459.2021.9415238
Khalifa, A. A., Alayed, W. M., Elbadawy, H. M., & Sadek, R. A. (2024). Real-time navigation roads: Lightweight and efficient convolutional neural network (LE-CNN) for Arabic traffic sign recognition in intelligent transportation systems (ITS). Applied Sciences, 14(9), 3903. https://doi.org/10.3390/app14093903 DOI: https://doi.org/10.3390/app14093903
Kirsch, R. A. (1971, June). Computer determination of the constituent structure of biological images. Computers and Biomedical Research, 4(3), 315-328. https://doi.org/10.1016/0010-4809(71)90034-6 DOI: https://doi.org/10.1016/0010-4809(71)90034-6
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998, November). Gradient based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324. https://doi.org/10.1109/5.726791 DOI: https://doi.org/10.1109/5.726791
Lee, J., Kim, K., & Lee, K. (2024). Multi-sensor image classification using the random forest algorithm in Google Earth Engine with KOMPSAT-3/5 and CAS500-1 images. Remote Sensing, 16(24), 4622. https://doi.org/10.3390/rs16244622 DOI: https://doi.org/10.3390/rs16244622
Lee, S.-W. (1996, June). Off-line recognition of totally unconstrained handwritten numerals using multilayer cluster neural network. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(6), 648-652. DOI: https://doi.org/10.1109/34.506416
https://doi.org/10.1109/34.506416
Li, W., Song, H., & Wang, P. (2022). Finely crafted feature features for traffic sign recognition. International Journal of Circuits, Systems and Signal Processing, 16, 185-193. https://doi.org/10.46300/9106.2022.16.20 DOI: https://doi.org/10.46300/9106.2022.16.20
Madani, A., & Yusof, R. (2018). Traffic sign recognition based on color, shape, and pictogram classification using support vector machines. Neural Computing and Applications, 30, 2807-2817. DOI: https://doi.org/10.1007/s00521-017-2887-x
https://doi.org/10.1007/s00521-017-2887-x
Maenpaa, T., & Pietikainen, M. (2005). Texture analysis with local binary patterns. In Handbook of Pattern Recognition and Computer Vision (pp. xx-xx). World Scientific. DOI: https://doi.org/10.1142/9789812775320_0011
https://doi.org/10.1142/9789812775320_0011
Namyang, N., & Phimoltares, S. (2020, October). Thai traffic sign classification and recognition system based on histogram of gradients, color layout descriptor, and normalized correlation coefficient. In Proceedings of the 2020 5th International Conference on Information Technology (INCIT) (pp. 270-275). IEEE. DOI: https://doi.org/10.1109/InCIT50588.2020.9310778
https://doi.org/10.1109/InCIT50588.2020.9310778
Ojala, T., Pietikainen, M., & Harwood, D. (1996). A comparative study of texture measures with classification based on featured distribution. Pattern Recognition, 29, 51-59. https://doi.org/10.1016/0031-3203(95)00067-4 DOI: https://doi.org/10.1016/0031-3203(95)00067-4
Ojala, T., Pietikainen, M., & Maenpaa, T. (2002). Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 971-987. DOI: https://doi.org/10.1109/TPAMI.2002.1017623
https://doi.org/10.1109/TPAMI.2002.1017623
Panis, G., Lanitis, A., Tsapatsoulis, N., & Cootes, T. F. (2016). Overview of research on facial ageing using the FG-NET ageing database. IET Biometrics, 5, 37-46. https://doi.org/10.1049/iet-bmt.2014.0053 DOI: https://doi.org/10.1049/iet-bmt.2014.0053
Patle, A., & Chouhan, D. S. (2013). SVM kernel functions for classification. In 2013 International Conference on Advances in Technology and Engineering (ICATE) (pp. 1-9). IEEE. https://doi.org/10.1109/ICAdTE.2013.6524743 DOI: https://doi.org/10.1109/ICAdTE.2013.6524743
Pratt, W. K. (1978). Digital image processing. New York: Wiley.
Prewitt, J. M. S. (1970). Object enhancement and extraction. In B. Lipkin & A. Rosenfeld (Eds.), Picture processing and psychopictorics (pp. 75–149). Academic Press.
Razavi, M., Mavaddati, S., & Koohi, H. (2024). ResNet deep models and transfer learning technique for classification and quality detection of rice cultivars. Expert Systems with Applications, 247, 123276. DOI: https://doi.org/10.1016/j.eswa.2024.123276
https://doi.org/10.1016/j.eswa.2024.123276
Saouli, A., El Aroussi, M., & Fakhri, Y. (2018). Traffic sign recognition based on multi-feature fusion and ELM classifier. Procedia Computer Science, 127, 146-153. https://doi.org/10.1016/j.procs.2018.01.109 DOI: https://doi.org/10.1016/j.procs.2018.01.109
Sapijaszko, G., Alobaidi, T., & Mikhael, W. (2019, August). Traffic sign recognition based on multilayer perceptron using DWT and DCT. In Proceedings of the 2019 IEEE 62nd International Midwest Symposium on Circuits and Systems (MWSCAS) (pp. 440-443). IEEE. https://doi.org/10.1109/MWSCAS.2019.8884897 DOI: https://doi.org/10.1109/MWSCAS.2019.8884897
Satpathy, A., Jiang, X., & Eng, H.-L. (2010). Extended histogram of gradients feature for human detection. In Proceedings of 2010 17th IEEE Conference on Image Processing (ICIP). IEEE. https://doi.org/10.1109/ICIP.2010.5650070 DOI: https://doi.org/10.1109/ICIP.2010.5650070
Sedaghatjoo, Z., Hosseinzadeh, H., & Bigham, B. S. (2024). Local binary pattern (LBP) optimization for feature extraction. arXiv. https://arxiv.org/abs/2407.18665
Shi, X., Hao, Z., & Yu, Z. (2024, June). SpikingResFormer: Bridging ResNet and Vision Transformer in spiking neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 5610-5619). https://doi.org/10.1109/CVPR52733.2024.00536 DOI: https://doi.org/10.1109/CVPR52733.2024.00536
Sobel, I., & Feldman, G. (1968). A 3 x 3 isotropic gradient operator for image processing. In Presented at the Stanford Artificial Intelligence Project (SAIL).
Soni, D., Chaurasiya, R., & Agrawal, S. (2019, January). Improving the classification accuracy of accurate traffic sign detection and recognition system using HOG and LBP features and PCA-based dimension reduction. In Proceedings of the International Conference on Sustainable Computing in Science, Technology and Management (SUSCOM). Jaipur, India. https://doi.org/10.2139/ssrn.3358756 DOI: https://doi.org/10.2139/ssrn.3358756
Stallkamp, J., Schlipsing, M., Salmen, J., & Igel, C. (2012, August). 2012 special issue: Man vs. computer: Benchmarking machine learning algorithms for traffic sign recognition. Neural Networks, 32, 323-332. DOI: https://doi.org/10.1016/j.neunet.2012.02.016
https://doi.org/10.1016/j.neunet.2012.02.016
Tan, H., Ma, Z., & Yang, B. (2014, June). Face recognition based on the fusion of global and local HOG features of face images. IET Computer Vision, 8(3), 224-234. https://doi.org/10.1049/iet-cvi.2012.0302 DOI: https://doi.org/10.1049/iet-cvi.2012.0302
Wang, B. (2022). Research on the optimal machine learning classifier for traffic signs. In SHS Web of Conferences. EDP Sciences. https://doi.org/10.1051/shsconf/202214403014 DOI: https://doi.org/10.1051/shsconf/202214403014
Weng, H., & Chiu, C. (2018, April). Resource efficient hardware implementation for real-time traffic sign recognition. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1120-1124). IEEE. https://doi.org/10.1109/ICASSP.2018.8462298 DOI: https://doi.org/10.1109/ICASSP.2018.8462298
WHO. (2018). Global status report on road safety. World Health Organization.
Yang, Y., Luo, H., Xu, H., et al. (2016). Towards real-time traffic sign detection and classification. IEEE Transactions on Intelligent Transportation Systems, 17(7), 2022-2031. https://doi.org/10.1109/TITS.2015.2509281 DOI: https://doi.org/10.1109/TITS.2015.2482461
Zhang, K., Guo, Y., Wang, X., Yuan, J., & Ding, Q. (2019). Multiple feature reweight DenseNet for image classification. IEEE Access, 7, 9872-9880. https://doi.org/10.1109/ACCESS.2018.2890127 DOI: https://doi.org/10.1109/ACCESS.2018.2890127
Zhu, Y., & Newsam, S. (2017). DenseNet for dense flow. In 2017 IEEE International Conference on Image Processing (ICIP) (pp. 790-794). IEEE. https://doi.org/10.1109/ICIP.2017.8296389 DOI: https://doi.org/10.1109/ICIP.2017.8296389
Zhu, Y., & Yan, W. Q. (2022). Traffic sign recognition based on deep learning. Multimedia Tools and Applications, 81(17), 17779-17791. https://doi.org/10.1007/s11042-022-12163-0 DOI: https://doi.org/10.1007/s11042-022-12163-0
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Prestone Simiyu, Dr. Raphael Angulu, Dr. Daniel Otanga

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.













