Earth Science Frontiers ›› 2025, Vol. 32 ›› Issue (4): 122-139.DOI: 10.13745/j.esf.sf.2025.4.66

Previous Articles     Next Articles

Metallogenic prediction based on ensemble learning models and Bayesian Optimization Algorithm

KONG Chunfang1,2,3,4(), TIAN Qian1, LIU Jian5, CAI Guorong1,5, ZHAO Jie1, XU Kai1,2,3,4,*()   

  1. 1. School of Computer, China University of Geosciences (Wuhan), Wuhan 430074, China
    2. Engineering Technology Innovation Center of Mineral Resource Explorations in Bedrock Zones, Ministry of Natural Resources, Guiyang 550081, China
    3. Guizhou Key Laboratory for Strategic Mineral Intelligent Exploration, Guiyang 550081, China
    4. Hubei Key Laboratory of Intelligent Geo-Information Processing, Wuhan 430074, China
    5. Geology Team 103, Bureau of Geology and Mineral Exploration and Development, Tongren 554300, China
  • Received:2025-05-12 Revised:2025-05-20 Online:2025-07-25 Published:2025-08-04

Abstract:

Exploration for hidden ore bodies is increasingly important and demands innovative prospecting methods. Data-driven metallogenic prediction models using ensemble learning are becoming powerful tools for deep mineral exploration. However, such models face challenges, particularly in time-consuming hyperparameter tuning requiring extensive computation and expertise. To address this, we propose a framework integrating multi-source geological knowledge with Bayesian Optimization (BO) for ensemble learning. Specifically, a manganese (Mn) metallogenic prediction database integrating multi-source geological knowledge is first constructed. Metallogenic prediction models for Mn ore in northeastern Guizhou are then established using Adaptive Boosting (AdaBoost) and Random Forest (RF). The hyperparameters of these base models are optimized using Bayesian Optimization (BO) via 5-fold cross-validation, resulting in the optimized BO-AdaBoost and BO-RF models. Model performance is evaluated using metrics including accuracy, precision, recall, F1-score, kappa, and AUC values. Results show significant improvements in AUC for both BO-optimized models compared to their non-optimized counterparts, demonstrating BO’s effectiveness for ensemble learning hyperparameter tuning. Furthermore, the BO-AdaBoost model achieves higher prediction accuracy (92.8%) and generalization performance than the BO-RF model (89.9%), highlighting its strong potential for metallogenic prediction. The prospectivity map generated by the BO-AdaBoost model provides critical guidance for exploring deep-hidden Mn deposits in northeastern Guizhou and can direct future mineral exploration and development.

Key words: ensemble learning, Adaptive Boosting (AdaBoost), Random Forest (RF), Bayesian Optimization (BO), hidden manganese ore, metallogenic prediction

CLC Number: