THERMAL SCIENCE

International Scientific Journal

Thermal Science - Online First

External Links

online first only

Revisiting distance metrics in k-nearest neighbors algorithms: Implications for sovereign country credit rating assessments

ABSTRACT
The k-Nearest Neighbors algorithm, a fundamental machine learning technique, typically employs the Euclidean distance metric for proximity-based data classification. This research focuses on the Feature Importance Infused k-Nearest Neighbors model, an advanced form of k-Nearest Neighbors. Diverging from traditional algorithm uniform weighted Euclidean distance, Feature Importance Infused k-Nearest Neighbors introduces a specialized distance weighting system. This system emphasizes critical features while reducing the impact of lesser ones, thereby enhancing classification accuracy. Empirical studies indicate a 1,7% average accuracy improvement with proposed model over conventional model, attributed to its effective handling of feature importance in distance calculations. Notably, a significant positive correlation was observed between the disparity in feature importance levels and the model's accuracy, highlighting proposed model's proficiency in handling variables with limited explanatory power. These findings suggest proposed model's potential and open avenues for future research, particularly in refining its feature importance weighting mechanism, broadening dataset applicability, and examining its compatibility with different distance metrics.
KEYWORDS
PAPER SUBMITTED: 2023-11-11
PAPER REVISED: 2024-01-30
PAPER ACCEPTED: 2024-02-01
PUBLISHED ONLINE: 2024-02-18
DOI REFERENCE: https://doi.org/10.2298/TSCI231111008C
REFERENCES
  1. Kalaiarasi, K., et al., Optimization of the average monthly cost of an EOQ inventory model for deteriorating items in machine learning using PYTHON, Thermal Science, 25 (2022), Spec. issue 2, pp. 347-358
  2. Cheng, Debo., et al., k NN algorithm with data-driven k value, Proceedings, 10th, Advanced Data Mining and Applications: 10th International Conference, Guilin, China, 2014, pp. 499-512
  3. Zhang, S., Challenges in KNN Classification, IEEE Transactions on Knowledge and Data Engineering, 34 (2022), 10, pp. 4663-4675
  4. Dastile, X., et al., Statistical and machine learning models in credit scoring: A systematic literature survey, Applied Soft Computing, 91 (2020), pp. 106263
  5. Mladenova, T., A Feature-Weighted Rule for the K-Nearest Neighbor, Proceedings, 5th, International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Bolu, Turkey, 2021, pp. 493-497
  6. Liang, J., An ensemble method, Proceedings, 4th, International Conference on Communication and Information Processing, New York, USA, 2018, pp. 186-190
  7. Huang, J., et al., An Improved kNN Based on Class Contribution and Feature Weighting, Proceedings, 10th, International Conference on Measuring Technology and Mechatronics Automation, Changsha, China, 2018, pp. 313-316
  8. ***, School of Computing and Information Sciences, archive.ics.uci.edu/ml
  9. Liangxiao, J., et al., Bayesian citation-KNN with distance weighting, International Journal of Machine Learning and Cybernetics, 5 (2014), 2, pp. 193-199
  10. Biswas, N., et al., A parameter independent fuzzy weighted k-nearest neighbor classifier, Pattern Recognition Letters, 101 (2018), pp. 80-87
  11. Peng, X., et al., An improved weighted K-nearest neighbor algorithm for indoor localization, Electronics, 9 (2020), 12, pp. 2117
  12. Ertuğrul, Ö. F., A novel distance metric based on differential evolution, Arabian Journal for Science and Engineering, 44 (2019), pp. 9641-9651
  13. Alsakka, R., Gwilym, O., Leads and lags in sovereign credit ratings, Journal of Banking & Finance, 34 (2010), 11, 2614-2626
  14. Ahmed, S. E., Çetin, A. İ., Determinants of Credit Ratings and Comparison of the Rating Prediction Performances of Machine Learning Algorithms, Proceedings, 17th, In E3S Web of Conferences 2023, Cape Town, South Africa, Vol. 409, p. 05013
  15. Ekmekcioglu, M., et al., Predicting Sovereign Credit Ratings Using Machine Learning Algorithms, Proceedings, 1st, Industrial Engineering in the Covid-19 Era: Selected Papers from the Hybrid Global Joint Conference on Industrial Engineering and Its Application Areas, GJCIE 2022, Switzerland, 2023, pp. 52-61
  16. Takawira, O., Mwamba, J. W. M., Sovereign Credit Ratings Analysis Using the Logistic Regression Model, Risks, 10 (2022), 4, pp. 70-93
  17. ***, Worldbank Databank, databank.worldbank.org/home.aspx
  18. ***, Human Development Reports, hdr.undp.org/data-center
  19. Ali, N., et al., Evaluation of k-nearest neighbour classifier performance for heterogeneous data sets, SN Applied Sciences, 1 (2019), pp.1-15
  20. Obiedat, R., et al., An Intelligent Hybrid Sentiment Analyzer for Personal Protective Medical Equipments Based on Word Embedding Technique: The COVID-19 Era, Symmetry, 13 (2021), 12, pp. 2287
  21. Gothai, E., et al., Map-Reduce based Distance Weighted k-Nearest Neighbor Machine Learning Algorithm for Big Data Applications. Scalable Computing, Practice and Experience, 23 (2022), 4, pp. 129-145
  22. Bajpai, A., et al., Performance enhancement of automatic speech recognition system using euclidean distance comparison and artificial neural network, Proceedings, 3th, International Conference On Internet of Things: Smart Innovation and Usages (IoT-SIU), IEEE 2018, pp. 1-5
  23. Abdulrahim, H., et al., Machine learning models to prediction OPIC crude oil production, Thermal Science, 26 (2022), Spec. issue 1, pp. 437-443