SEISMOLOGY AND GEOLOGY ›› 2022, Vol. 44 ›› Issue (6): 1615-1633.DOI: 10.3969/j.issn.0253-4967.2022.06.015

• Research paper • Previous Articles     Next Articles

AUTOMATIC FAULT IDENTIFICATION METHOD BASED ON IMPROVED DBSCAN ALGORITHM AND ITS APPLICATION TO TANGSHAN AREA

ZHANG Su-xiang1)(), SHENG Shu-zhong1),*(), XI Biao2), FANG Li-hua3), LÜ Jian4), WANG Gan-jiao4), ZHANG Xiao1)   

  1. 1)School of Geophysics and Measurement-control Technology, East China University of Technology, Nanchang 330013, China
    2)Key Laboratory of Exploration Technologies for Oil and Gas Resources, Ministry of Education, Yangtze University, Wuhan 430100, China
    3)Institute of Geophysics, China Earthquake Administration, Beijing 100081, China
    4)Jiangxi Earthquake Agency, Nanchang 330039, China
  • Received:2022-02-28 Revised:2022-05-09 Online:2022-12-20 Published:2023-01-21
  • Contact: SHENG Shu-zhong

基于改进的DBSCAN算法自动识别断层方法研究及其在唐山地区的应用

张苏祥1)(), 盛书中1),*(), 席彪2), 房立华3), 吕坚4), 王甘娇4), 张潇1)   

  1. 1)东华理工大学, 地球物理与测控技术学院, 南昌 330013
    2)长江大学, 油气资源勘探技术教育部重点实验室, 武汉 430100
    3)中国地震局地球物理研究所, 北京 100081
    4)江西省地震局, 南昌 330039
  • 通讯作者: 盛书中
  • 作者简介:张苏祥, 男, 1997年生, 2020年于防灾科技学院获地球物理学专业学士学位, 现为东华理工大学地球物理学专业在读硕士研究生, 主要从事发震构造和构造应力场等方面研究, E-mail: zhangsuxiang@ecut.edu.cn
  • 基金资助:
    国家自然科学基金(41704053);国家自然科学基金(42174074);国家自然科学基金(41904044);江西省研究生创新基金(YC2021-S623);东华理工大学博士科研启动资金(DHBK2019084)

Abstract:

With the continuous increasing density of the seismic network and the improvement of the seismograph observation capability, the number of observed seismic events has increased dramatically and the location accuracy has been continuously improved. Therefore, obtaining fault geometry and its parameters from massive seismic data has become an essential method for seismogenic structure research. At present, in the research of obtaining faults and their parameters based on seismic data, there are two main methods of selecting data: One is to select seismic data empirically based on the understanding of fault structures and the spatial distribution of seismic data, and then fit the fault plane from these data. However, it depends on prior information, i.e. the knowledge of existing fault structures and the linear distribution of earthquakes, and it is difficult to process relatively poor linear trends. The other is based on the spatial clustering of seismic data, which adopts unsupervised clustering technology in machine learning to select data. This method avoids the dependence on experience and is more suitable for fault segment data obtained from massive seismic data. Fault parameters can be inversed by fault segment data to determine the fault structure and give its quantitative parameters. However, the current clustering technique for obtaining fault parameters has some limitations, such as the selection of the optimal parameters being difficult, data with different densities being dealt with by the same parameters, and poor method generality. In order to automatically identify faults and obtain fault parameters based on the spatial distribution of earthquakes, and avoid the aforementioned limitations, a new method based on the improved DBSCAN algorithm is presented in this study.
The method proposed in this study uses the k-average nearest neighbor method(K-ANN)and the mathematical expectation method to generate the candidate sets of eps and minPts threshold parameters, which are selected as optimal parameters based on the density hierarchy stability. Considering the spatial density differences of seismic events on different faults and the same fault, this study performs layer-by-layer density clustering from high density to low density. First, the above steps achieve the automatic selection of optimal parameters for clustering and identifying fault segments. Secondly, the fault parameters of the identified fault segments are calculated by the combination of the simulated annealing(SA)global search method and the local search method of Gaussian Newton(GN). Then, the adjacent similar fault segments are merged. Finally, the faults and their parameters are obtained.
The reliability of the automatic fault identification method was verified by synthetic data and the double-difference location catalog of Tangshan area, China. The following results were obtained: Ⅰ. The improved DBSCAN algorithm can automatically identify the fault segments, which is verified by the application of synthetic data and the double-difference location data of the Tangshan area. Ⅱ. Based on the double-difference location data of the Tangshan area, eight fault segments were identified using the improved DBSCAN algorithm. The specific names of the 8 faults are as follows: Douhe fault segment, Weishan-Fengnan fault segment, Luanxian-Laoting fault segment, Lulong fault segment, Xujialou-Wangxizhuang fault segment, Luanxian fault north segment, Leizhuang fault segment, and Chenguantun fault segment, and their strike and dip angle are 229.1°, 230.4°, 132.2°, 31.7°, 191.3°, 31°, 229.5°, 84.9°, and 51.6°, 88.4°, 89.3°, 88.6°, 88.4°, 88.2°, 73.8° and 85.4°, respectively. The parameters of the first five faults are mostly consistent with those of previous research results. The last three faults are the newly identified faults in this study based on the seismic catalog, and the parameters of two of them have been confirmed by previous research results or focal mechanism parameters on the faults.
In a word, the improved DBSCAN algorithm in this study can realize fault segment automatic identification, but there are still some problems that need to be improved urgently. In the follow-up research, we will continue to improve the automatic fault identification method and increase its ability of automatic fault identification so as to provide more accurate fault data for related research.

Key words: automatic fault identification, fault plane parameters, Tangshan area, density-based spatial clustering of applications with noise(DBSCAN), unsupervised clustering techniques

摘要:

为实现根据地震空间分布自动识别断层和获取断层参数, 文中提出基于改进的DBSCAN算法自动识别断层的方法。该方法首先根据数据集的特性对无监督聚类技术(DBSCAN)进行改进, 实现自动选择聚类最优参数和断层段识别; 其次, 对识别出的断层段采用模拟退火全局搜索-高斯牛顿局部搜索结合法计算其断层参数, 再对邻近的相似断层段进行合并, 最终给出基于地震空间分布识别出的断层及其参数。文中采用人工合成数据验证了上述方法的可靠性, 并将其应用于唐山地区, 获得以下结果: 1)人工合成数据和唐山地区双差定位数据验证了本研究改进的DBSCAN算法可以自动识别断层。2)基于唐山地区双差精定位地震数据, 本研究自动识别出8个断裂段: 陡河断裂段、 巍山-丰南断裂段、 滦县-乐亭断裂段、 卢龙断裂段、 徐家楼-王喜庄断裂段、 滦县断裂北段、 雷庄断裂段和陈官屯断裂段。其中, 前5个断裂段的识别结果与前人研究结果基本一致; 后3条断裂段为文中基于地震目录新识别出的断层。可见, 文中给出的方法可自动化识别断层并获取断层参数, 这为断裂构造的识别提供了新的思路和方法。

关键词: 断层自动识别, 断层面参数, 唐山地区, DBSCAN, 无监督聚类技术

CLC Number: