Published 2025-04-17
Keywords
- Financial Fraud,
- Fraud Detection,
- Unsupervised Learning,
- K-means Clustering,
- Anomalous Patterns
How to Cite
Abstract
Financial fraud is a prevalent issue that poses significant economic threats globally. With the rapid advancements in technology, traditional fraud detection methods are becoming inadequate, necessitating the development of more effective and efficient approaches. The current research landscape in financial fraud detection predominantly relies on supervised learning techniques, facing challenges such as imbalanced datasets and limited scalability. To address these limitations, this paper proposes a novel approach utilizing K-means clustering-based unsupervised learning for financial fraud detection. The innovative framework aims to enhance detection accuracy and scalability while reducing false positives. By leveraging unsupervised learning, the model can detect anomalous patterns without labeled training data, thereby improving fraud detection performance and adaptability in dynamic financial environments.
References
- J. Hartigan and M. A. Wong, "A k-means clustering algorithm," 1979.
- T. Kanungo et al., "An Efficient k-Means Clustering Algorithm: Analysis and Implementation," in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002.
- K. Wagstaff et al., "Constrained K-means Clustering with Background Knowledge," in International Conference on Machine Learning, 2001.
- Z. Huang et al., "Application of Machine Learning-Based K-means Clustering for Financial Fraud Detection," in Academic Journal of Science and Technology, 2024.
- K. P. Sinaga and M.-S. Yang, "Unsupervised K-Means Clustering Algorithm," in IEEE Access, 2020.
- L. Azzahra et al., "Metode K-Means Clustering Dalam Pengelompokan Penjualan Produk Frozen Food," in Jurnal Ilmu Komputer dan Sistem Informasi, 2024.
- A. K. Jain, "Data clustering: 50 years beyond K-means," in Pattern Recognition Letters, 2008.
- A. M. Ikotun et al., "K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data," in Information Sciences, 2022.
- F. Nie et al., "An Effective and Efficient Algorithm for K-Means Clustering With New Formulation," in IEEE Transactions on Knowledge and Data Engineering, 2023.
- A. Likas et al., "The global k-means clustering algorithm," in Pattern Recognition, 2003.
- Z. Huang et al., "Application of Machine Learning-Based K-means Clustering for Financial Fraud Detection," Academic Journal of Science and Technology, 2024.
- S. Islam et al., "A rule-based machine learning model for financial fraud detection," International Journal of Electrical and Computer Engineering, 2024.
- P. Kamuangu, "A Review on Financial Fraud Detection using AI and Machine Learning," Journal of Economics, Finance and Accounting Studies, 2024.
- P. O. Shoetan et al., "REVIEWING THE ROLE OF BIG DATA ANALYTICS IN FINANCIAL FRAUD DETECTION," Finance & Accounting Research Journal, 2024.
- Y. Cheng et al., "Advanced Financial Fraud Detection Using GNN-CL Model," International Conferences on Computers, Information Processing, and Advanced Education, 2024.
- N. Innan et al., "QFNN-FFD: Quantum Federated Neural Network for Financial Fraud Detection," arXiv.org, 2024.
- D. Cheng et al., "Graph Neural Networks for Financial Fraud Detection: A Review," Frontiers Comput. Sci., 2024.
- A. Adewumi et al., "Enhancing financial fraud detection using adaptive machine learning models and business analytics," International Journal of Scientific Research Updates, 2024.
- M. M. Ismail et al., "Enhancing Enterprise Financial Fraud Detection Using Machine Learning," Engineering, Technology & Applied Science Research, 2024.
- E. Galla et al., "Enhancing Performance of Financial Fraud Detection Through Machine Learning Model," Social Science Research Network, 2024.
- C. Y. Tang and C. Li, ‘Examining the Factors of Corporate Frauds in Chinese A-share Listed Enterprises’, OAJRC Social Science, vol. 4, no. 3, pp. 63–77, 2023.
- Q. Zhu, ‘Autonomous Cloud Resource Management through DBSCAN-based unsupervised learning’, Optimizations in Applied Machine Learning, vol. 5, no. 1, Art. no. 1, Jun. 2025, doi: 10.71070/oaml.v5i1.112.
- S. Dan and Q. Zhu, ‘Enhancement of data centric security through predictive ridge regression’, Optimizations in Applied Machine Learning, vol. 5, no. 1, Art. no. 1, May 2025, doi: 10.71070/oaml.v5i1.113.
- S. Dan and Q. Zhu, ‘Highly efficient cloud computing via Adaptive Hierarchical Federated Learning’, Optimizations in Applied Machine Learning, vol. 5, no. 1, Art. no. 1, Apr. 2025, doi: 10.71070/oaml.v5i1.114.
- Q. Zhu and S. Dan, ‘Data Security Identification Based on Full-Dimensional Dynamic Convolution and Multi-Modal CLIP’, Journal of Information, Technology and Policy, 2023.
- Q. Zhu, ‘An innovative approach for distributed cloud computing through dynamic Bayesian networks’, Journal of Computational Methods in Engineering Applications, 2024.
- Z. Luo, H. Yan, and X. Pan, ‘Optimizing Transformer Models for Resource-Constrained Environments: A Study on Model Compression Techniques’, Journal of Computational Methods in Engineering Applications, pp. 1–12, Nov. 2023, doi: 10.62836/jcmea.v3i1.030107.
- H. Yan and D. Shao, ‘Enhancing Transformer Training Efficiency with Dynamic Dropout’, Nov. 05, 2024, arXiv: arXiv:2411.03236. doi: 10.48550/arXiv.2411.03236.
- H. Yan, ‘Real-Time 3D Model Reconstruction through Energy-Efficient Edge Computing’, Optimizations in Applied Machine Learning, vol. 2, no. 1, 2022.
- Y. Shu, Z. Zhu, S. Kanchanakungwankul, and D. G. Truhlar, ‘Small Representative Databases for Testing and Validating Density Functionals and Other Electronic Structure Methods’, J. Phys. Chem. A, vol. 128, no. 31, pp. 6412–6422, Aug. 2024, doi: 10.1021/acs.jpca.4c03137.
- C. Kim, Z. Zhu, W. B. Barbazuk, R. L. Bacher, and C. D. Vulpe, ‘Time-course characterization of whole-transcriptome dynamics of HepG2/C3A spheroids and its toxicological implications’, Toxicology Letters, vol. 401, pp. 125–138, 2024.
- J. Shen et al., ‘Joint modeling of human cortical structure: Genetic correlation network and composite-trait genetic correlation’, NeuroImage, vol. 297, p. 120739, 2024.
- K. F. Faridi et al., ‘Factors associated with reporting left ventricular ejection fraction with 3D echocardiography in real‐world practice’, Echocardiography, vol. 41, no. 2, p. e15774, Feb. 2024, doi: 10.1111/echo.15774.
- Z. Zhu, ‘Tumor purity predicted by statistical methods’, in AIP Conference Proceedings, AIP Publishing, 2022.
- Z. Zhao, P. Ren, and Q. Yang, ‘Student self-management, academic achievement: Exploring the mediating role of self-efficacy and the moderating influence of gender insights from a survey conducted in 3 universities in America’, Apr. 17, 2024, arXiv: arXiv:2404.11029. doi: 10.48550/arXiv.2404.11029.
- Z. Zhao, P. Ren, and M. Tang, ‘Analyzing the Impact of Anti-Globalization on the Evolution of Higher Education Internationalization in China’, Journal of Linguistics and Education Research, vol. 5, no. 2, pp. 15–31, 2022.
- M. Tang, P. Ren, and Z. Zhao, ‘Bridging the gap: The role of educational technology in promoting educational equity’, The Educational Review, USA, vol. 8, no. 8, pp. 1077–1086, 2024.
- P. Ren, Z. Zhao, and Q. Yang, ‘Exploring the Path of Transformation and Development for Study Abroad Consultancy Firms in China’, Apr. 17, 2024, arXiv: arXiv:2404.11034. doi: 10.48550/arXiv.2404.11034.
- P. Ren and Z. Zhao, ‘Parental Recognition of Double Reduction Policy, Family Economic Status And Educational Anxiety: Exploring the Mediating Influence of Educational Technology Substitutive Resource’, Economics & Management Information, pp. 1–12, 2024.
- Z. Zhao, P. Ren, and M. Tang, ‘How Social Media as a Digital Marketing Strategy Influences Chinese Students’ Decision to Study Abroad in the United States: A Model Analysis Approach’, Journal of Linguistics and Education Research, vol. 6, no. 1, pp. 12–23, 2024.
- Z. Zhao and P. Ren, ‘Identifications of Active Explorers and Passive Learners Among Students: Gaussian Mixture Model-Based Approach’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, May 2025.
- Z. Zhao and P. Ren, ‘Prediction of Student Answer Accuracy based on Logistic Regression’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, Feb. 2025.
- Z. Zhao and P. Ren, ‘Prediction of Student Disciplinary Behavior through Efficient Ridge Regression’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, Mar. 2025.
- Z. Zhao and P. Ren, ‘Random Forest-Based Early Warning System for Student Dropout Using Behavioral Data’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, Apr. 2025.
- P. Ren and Z. Zhao, ‘Recognition and Detection of Student Emotional States through Bayesian Inference’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, May 2025.
- P. Ren and Z. Zhao, ‘Support Vector Regression-based Estimate of Student Absenteeism Rate’, Bulletin of Education and Psychology, vol. 5, no. 1, Art. no. 1, Jun. 2025.
- G. Zhang and T. Zhou, ‘Finite Element Model Calibration with Surrogate Model-Based Bayesian Updating: A Case Study of Motor FEM Model’, IAET, pp. 1–13, Sep. 2024, doi: 10.62836/iaet.v3i1.232.
- G. Zhang, W. Huang, and T. Zhou, ‘Performance Optimization Algorithm for Motor Design with Adaptive Weights Based on GNN Representation’, Electrical Science & Engineering, vol. 6, no. 1, Art. no. 1, Oct. 2024, doi: 10.30564/ese.v6i1.7532.
- T. Zhou, G. Zhang, and Y. Cai, ‘Unsupervised Autoencoders Combined with Multi-Model Machine Learning Fusion for Improving the Applicability of Aircraft Sensor and Engine Performance Prediction’, Optimizations in Applied Machine Learning, vol. 5, no. 1, Art. no. 1, Feb. 2025, doi: 10.71070/oaml.v5i1.83.
- Y. Tang and C. Li, ‘Exploring the Factors of Supply Chain Concentration in Chinese A-Share Listed Enterprises’, Journal of Computational Methods in Engineering Applications, pp. 1–17, 2023.
- C. Li and Y. Tang, ‘Emotional Value in Experiential Marketing: Driving Factors for Sales Growth–A Quantitative Study from the Eastern Coastal Region’, Economics & Management Information, pp. 1–13, 2024.
- C. Li and Y. Tang, ‘The Factors of Brand Reputation in Chinese Luxury Fashion Brands’, Journal of Integrated Social Sciences and Humanities, pp. 1–14, 2023.
- W. Huang, T. Zhou, J. Ma, and X. Chen, ‘An ensemble model based on fusion of multiple machine learning algorithms for remaining useful life prediction of lithium battery in electric vehicles’, Innovations in Applied Engineering and Technology, pp. 1–12, 2025.
- W. Huang and J. Ma, ‘Predictive Energy Management Strategy for Hybrid Electric Vehicles Based on Soft Actor-Critic’, Energy & System, vol. 5, no. 1, 2025.
- J. Ma, K. Xu, Y. Qiao, and Z. Zhang, ‘An Integrated Model for Social Media Toxic Comments Detection: Fusion of High-Dimensional Neural Network Representations and Multiple Traditional Machine Learning Algorithms’, Journal of Computational Methods in Engineering Applications, pp. 1–12, 2022.
- W. Huang, Y. Cai, and G. Zhang, ‘Battery Degradation Analysis through Sparse Ridge Regression’, Energy & System, vol. 4, no. 1, Art. no. 1, Dec. 2024, doi: 10.71070/es.v4i1.65.
- Z. Zhang, ‘RAG for Personalized Medicine: A Framework for Integrating Patient Data and Pharmaceutical Knowledge for Treatment Recommendations’, Optimizations in Applied Machine Learning, vol. 4, no. 1, 2024.
- Z. Zhang, K. Xu, Y. Qiao, and A. Wilson, ‘Sparse Attention Combined with RAG Technology for Financial Data Analysis’, Journal of Computer Science Research, vol. 7, no. 2, Art. no. 2, Mar. 2025, doi: 10.30564/jcsr.v7i2.8933.
- P.-M. Lu and Z. Zhang, ‘The Model of Food Nutrition Feature Modeling and Personalized Diet Recommendation Based on the Integration of Neural Networks and K-Means Clustering’, Journal of Computational Biology and Medicine, vol. 5, no. 1, 2025.
- Y. Qiao, K. Xu, Z. Zhang, and A. Wilson, ‘TrAdaBoostR2-based Domain Adaptation for Generalizable Revenue Prediction in Online Advertising Across Various Data Distributions’, Advances in Computer and Communication, vol. 6, no. 2, 2025.
- K. Xu, Y. Gan, and A. Wilson, ‘Stacked Generalization for Robust Prediction of Trust and Private Equity on Financial Performances’, Innovations in Applied Engineering and Technology, pp. 1–12, 2024.
- A. Wilson and J. Ma, ‘MDD-based Domain Adaptation Algorithm for Improving the Applicability of the Artificial Neural Network in Vehicle Insurance Claim Fraud Detection’, Optimizations in Applied Machine Learning, vol. 5, no. 1, 2025.