矿区土地复垦历史文献数据挖掘方法及应用研究

    Research on data mining method and application of historical literatures of mining land reclamation

    • 摘要: 土地复垦是煤炭开采面临的最严峻问题之一,经过多年相关研究的经验积累,已经产生了大量的历史文献,因此,引入数据挖掘技术处理这些宝贵的历史文献资料十分必要。本文以徐州矿区为例,通过对土地复垦历史文献关键词分词编码化,构建TF*IDF算法和空间向量模型、聚类分析,采用Python语言进行数据挖掘,最后在ArcGIS基础上二次开发,显示数据挖掘结果。最终得到了徐州矿区的塌陷情况、复垦技术以及示范工程等重要历史信息,克服了土地复垦历史文献的数据冗余、数据冲突以及真伪识别等难点。结果表明,采用数据挖掘技术,不仅弥补了人工统计的不足,同时发挥了恢复历史“数据链”的特殊作用,可实现矿区土地复垦与生态重建信息的集成与知识发现,为矿区系统修复、综合治理提供基础数据支撑。

       

      Abstract: Land reclamation is one of the most serious problems in coal mining.Years of relevant research have produced a large number of historical literatures.Therefore, it is necessary to use data mining technology to process these historical literatures.Through encoding keywords of historical documents of land reclamation, constructing TF*IDF algorithm, spatial vector model and conducting cluster analysis, adopting Python for data mining, this paper, taking Xuzhou mining area as an example, carries out secondary development based on ArcGIS to display data mining results.Finally, the subsidence situation, reclamation technology, demonstration projects and other important historical information of Xuzhou mining area are obtained, and the difficulties of data redundancy, data conflict and authenticity identification of historical literature of land reclamation are overcome.The results show that the application of data mining technology not only makes up for the defects of manual sorting, but also can restore the historical "data chain", which can realize the knowledge discovery and integration of mining land reclamation and ecological reconstruction information, and provide data support for mining system restoration and comprehensive management.

       

    /

    返回文章
    返回