Earth Science Frontiers ›› 2024, Vol. 31 ›› Issue (4): 47-57.DOI: 10.13745/j.esf.sf.2024.5.5

Previous Articles     Next Articles

Machine-readable expression of unstructured geological information and intelligent prediction of mineralization associated anomaly areas in Pangxidong District, Guangdong, China

WANG Kunyi1,2,3(), ZHOU Yongzhang1,2,3,*()   

  1. 1. School of Earth Sciences and Engineering, Sun Yat-sen University, Zhuhai 519082, China
    2. Center for Earth Environment & Resources, Sun Yat-sen University, Guangzhou 510275, China
    3. Guangdong Provincial Key Lab of Geological Processes and Mineral Resources, Guangzhou 510275, China
  • Received:2024-02-21 Revised:2024-03-04 Online:2024-07-25 Published:2024-07-10

Abstract:

The application of big data mining and machine learning algorithms in mineralization prediction has become an important research trend, but unstructured geological data cannot be directly mined—first they need to be converted to machine-readable expressions. In this study of the Pangxidong ore district in western Guangdong Province, the unstructured geological information such as stratigraphy, lithology, faults are converted into machine-readable format, and two machine learning algorithms, namely, One-Class Support Vector Machine and Auto-Encoder Network, are applied to mine the geochemical test data of stream sediments as well as the comprehensive geological information on faults, stratigraphy, etc. to extract the features of mineralization anomalies and ultimately achieve intelligent delineation of the anomaly areas. Through combined application of One-Hot Encoder and the weighted variable method for spatially weighted principal component analysis, the structural transformation of the unstructured geological information is realized, and geological information is maximally preserved for data mining. It is demonstrated that the application of One-Class Support Vector Machine and Auto-Encoder Network can effectively solve the problem of data imbalance, as the numbers of ore and non-ore spots in the study area are seriously unbalanced. The prediction results generated using the integrated, synthesized multi-source geological data are relatively consistent with the observed spatial distribution of Pb-Zn deposits and the actual geological structure in the study area, indicating the two algorithms can effectively identify potential prospecting targets and ore deposits. Compared with traditional geochemical prospecting methods, the intelligent prediction method can process and integrate multi-source geological information about the ore-forming processes and identify mineralization anomaly areas. This method is applicable in prospecting areas without prior ore discovery, thereby improving the efficiency of ore prospecting and increasing the possibility of finding ore deposits.

Key words: big data mining, machine-readable expression, One-Hot Encoder, One-Class Support Vector Machine, Auto-Encoder Network, Pangxidong ore district, Qinzhou-Hangzhou metallogenic belt

CLC Number: