Earth Science Frontiers ›› 2025, Vol. 32 ›› Issue (4): 155-164.DOI: 10.13745/j.esf.sf.2025.4.77

Previous Articles     Next Articles

Intelligent search technology for Jiaodong gold deposits based on large models and GraphRAG

LI Bowen1,2(), WANG Yongzhi1,3,*(), DING Zhengjiang4,5, WANG Bin4,5, WEN Shibo3, DONG Yuhao1, JI Zheng3   

  1. 1. Integrated Information Mineral Prediction Research Institute, Jilin University, Changchun 130061, China
    2. Changchun Gold Research Institute Co.,Ltd., Changchun 130012, China
    3. College of Geoexploration Science and Technology, Jilin University, Changchun 130061, China
    4. Ministry of Natural Resources Technology Innovation Center for Deep Gold Resources Exploration and Mining, Weihai 264209, China
    5. No.6 Geological Team of Shandong Provincial Bureau of Geology and Mineral Resources, Weihai 264209, China
  • Received:2025-01-24 Revised:2025-04-20 Online:2025-07-25 Published:2025-08-04

Abstract:

The Jiaodong gold deposit is a major concentration area of gold resources in eastern China, characterized by complex geological information and an extensive knowledge system. Traditional information retrieval methods struggle to meet the advanced demands of semantic understanding and knowledge reasoning in mineral exploration. To enhance geological knowledge service efficiency, this study develops an intelligent question-answering system for the Jiaodong gold deposit domain based on GraphRAG (Graph-enhanced Retrieval-Augmented Generation) technology. The research utilizes academic papers from CNKI as the corpus, employs OCR and large language models (LLMs) for text parsing and semantic standardization to establish an ontological knowledge system covering core concepts such as mineralization types, ore-controlling structures, and mineral assemblages. The system uses prompt engineering-driven LLMs to automatically extract entities and relationships, constructing a structured knowledge graph integrated into Neo4j. Furthermore, by combining semantic embedding with community clustering algorithms, a knowledge indexing network enables natural language question answering, semantic query expansion, and knowledge provenance. Evaluation results demonstrate the system’s superiority over traditional RAG and general models (e.g., ChatGPT-4o) in answer accuracy, contextual precision, and knowledge interpretability, exhibiting enhanced professional adaptability and reasoning capabilities. The findings provide a novel technical pathway for intelligent information services in gold deposits and theoretical support for knowledge of graph-enhanced language models in geoscience knowledge management.

Key words: GraphRAG, knowledge graph, large language model, Jiaodong gold deposit, knowledge question answering

CLC Number: