A Time-Scaled ETAS Model for Earthquake Forecasting

Agniva Das
Department of Statistics, Faculty of Science
The Maharaja Sayajirao University of Baroda
Vadodara, Gujarat, India
agniva.d-statphd@msubaroda.ac.in
&Muralidharan K.
Department of Statistics, Faculty of Science
The Maharaja Sayajirao University of Baroda
Vadodara, Gujarat, India
muralikustat@gmail.com
Abstract

The Himalayan region, particularly Nepal, is highly susceptible to frequent and severe seismic activity, underscoring the urgent need for robust earthquake forecasting models. This study introduces a suite of time-scaled Epidemic-Type Aftershock Sequence (ETAS) models tailored for earthquake forecasting in Nepal, leveraging seismic data from 2000 to 2020. By incorporating alternative time-scaling approaches—such as calibration, proportional hazards, log-linear, and power time scales—the models capture nuanced temporal patterns of aftershocks, improving event classification between background and triggered occurrences. We evaluate model performance under various assumptions of earthquake magnitude distributions (exponential, gamma, and radially symmetric) and employ optimization techniques including the Davidon-Fletcher-Powell algorithm and Iterative Stochastic De-clustering. The results reveal that time-scaling significantly enhances model interpretability and predictive accuracy, with the ISDM-based ETAS model achieving the best fit. This work not only deepens the statistical understanding of earthquake dynamics in Nepal but also lays a foundation for implementing more effective early warning systems in seismically active regions.

Keywords Earthquake Prediction  \cdot Spatio-Temporal Point Process  \cdot Time-Scaled ETAS  \cdot Seismic Risk Modeling  \cdot Triggered vs Background Seismicity  \cdot Probabilistic Forecasting  \cdot Time Series Hazard Modeling  \cdot Maximum Likelihood Estimation  \cdot De-clustering Algorithm  \cdot Nepal Earthquake Data

1 Introduction

The Himalayan region, including Nepal, is prone to frequent and large earthquakes. Accurate forecasting of these earthquakes is crucial for minimizing loss of life and damage to infrastructure. Earthquake forecasting is a complex and interdisciplinary field, involving the integration of data from various sources, including seismology, geology, and statistics. One important aspect of earthquake forecasting is the study of aftershocks, which are smaller earthquakes that occur after a main shock. Aftershocks can provide valuable information on the state of the fault and the likelihood of future earthquakes. In this study, we propose a time-scaled Epidemic Type Aftershock Sequence (ETAS) model to forecast earthquakes in Nepal. The ETAS model is a statistical model that describes the temporal and spatial patterns of aftershocks following a main shock. The study of aftershocks following a main shock is crucial for understanding the state of the fault and the likelihood of future earthquakes [1, 2]. One important aspect of earthquake forecasting is the study of aftershocks, which are smaller earthquakes that occur after a main shock. Aftershocks can provide valuable information on the state of the fault and the likelihood of future earthquakes.

In recent years, various statistical models have been proposed for forecasting earthquakes, including the Epidemic Type Aftershock Sequence (ETAS) model [3]. The ETAS model is a statistical model that describes the temporal and spatial patterns of aftershocks following a main shock. The model is based on the assumption that the occurrence of aftershocks is influenced by both the main shock and the previous aftershocks.

The ETAS model [3] has been widely used in various regions around the world, including Japan [4], California [1], and Italy [2] wherein, the ETAS model was found to be able to accurately forecast earthquake occurrences.

However, the ETAS model has not been widely applied in the Himalayan region, including Nepal. The Himalayan region is characterized by a high level of seismic activity and a complex tectonic setting. Therefore, it is important to develop a model that is specifically tailored to the characteristics of this region.

In this study, we propose a time-scaled ETAS model to forecast earthquakes in Nepal. The time-scaled ETAS model takes into account the time-dependent behaviour of aftershocks, which is important for accurately forecasting earthquakes in regions with high seismic activity such as Nepal.

The dataset of earthquake occurrences in Nepal from 2000 to 2020 was collected and used to fit the time-scaled ETAS model. The model was then validated using the earthquake data from Nepal. The results of this study will provide valuable insights into the characteristics of earthquakes in Nepal and will be useful for earthquake early warning systems in the region.

The dataset used in this study was collected from ANSS global comprehensive catalogue (available at http://earthquake.usgs.gov/earthquakes/search/). It includes information on all earthquakes that occurred in Nepal between Jan 1, 1990 and May 31, 2022. The data includes the date, time, magnitude, and location (Nepal and surrounding areas including the rectangle formed by coordinates 2730273027\textendash 3027 – 30 Nsuperscript𝑁{}^{\circ}Nstart_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPT italic_N and 7988798879\textendash 8879 – 88 Esuperscript𝐸{}^{\circ}Estart_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPT italic_E) of each earthquake, with magnitude restriction m0=5.0subscript𝑚05.0m_{0}=5.0italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 5.0. We have used the depth variable for the proportional hazards time-scaling.

The dataset was cleaned and pre-processed to ensure that it met the requirements of the time-scaled ETAS model. This included removing any duplicate or missing data, and ensuring that the magnitude and location of each earthquake were accurate.

The dataset used in this study is the most comprehensive and up-to-date dataset of earthquakes in Nepal, and provides a valuable resource for studying the characteristics of earthquakes in the region.

The ETAS model is a statistical model that is used to describe the temporal and spatial patterns of earthquakes, specifically the aftershocks that follow a main shock event. The model is based on the assumption that the rate of aftershocks is influenced by both the magnitude and frequency of the main shock, as well as the spatial distribution of previous aftershocks [5]. It has been widely used in seismology to analyze seismic data and make predictions about future earthquake activity. Consider a temporal marked process X𝑋Xitalic_X consisting of N𝑁Nitalic_N events. Let (ti,xi,yi,mi)subscript𝑡𝑖subscript𝑥𝑖subscript𝑦𝑖subscript𝑚𝑖\left(t_{i},x_{i},y_{i},m_{i}\right)( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) denote the vector containing the time, longitude, latitude and magnitude of the ithsuperscript𝑖𝑡i^{th}italic_i start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT earthquake i=1,2,,N𝑖12𝑁i=1,2,\ldots,Nitalic_i = 1 , 2 , … , italic_N, whose distribution parametrized by constants β𝛽\betaitalic_β and θ𝜃\thetaitalic_θ, is marked by the conditional intensity function λ(t,x,y,mt)𝜆𝑡𝑥𝑦conditional𝑚subscript𝑡\lambda\left(t,\ x,\ y,\ m\ \mid\mathcal{H}_{t}\right)italic_λ ( italic_t , italic_x , italic_y , italic_m ∣ caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ), where, t={(ti,xi,yi,mi)Xti<t}subscript𝑡conditional-setsubscript𝑡𝑖subscript𝑥𝑖subscript𝑦𝑖subscript𝑚𝑖𝑋subscript𝑡𝑖𝑡\mathcal{H}_{t}=\left\{\left(t_{i},x_{i},y_{i},m_{i}\right)\in{X}\mid t_{i}<t\right\}caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = { ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ italic_X ∣ italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_t } (occurrence history of the earthquakes up to time t𝑡titalic_t), satisfying

λ(t,x,y,m|t)=vβ(m).λθ(t,x,yt)formulae-sequence𝜆𝑡𝑥𝑦conditional𝑚subscript𝑡subscript𝑣𝛽𝑚subscript𝜆𝜃𝑡𝑥conditional𝑦subscript𝑡\lambda\left(t,\ x,\ y,\ m\ \right|\ \mathcal{H}_{t})=v_{\beta}\left(m\right).% \lambda_{\theta}\left(t,x,y\mid\mathcal{H}_{t}\right)italic_λ ( italic_t , italic_x , italic_y , italic_m | caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_m ) . italic_λ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t , italic_x , italic_y ∣ caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) (1)

where,

vβ(m)=βexp[β(mm0)];β>0,m>m0formulae-sequencesubscript𝑣𝛽𝑚𝛽𝛽𝑚subscript𝑚0formulae-sequence𝛽0𝑚subscript𝑚0v_{\beta}\left(m\right)=\beta\exp{\left[-\beta\left(m-m_{0}\right)\right]};\ % \ \beta>0,m>m_{0}italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_m ) = italic_β roman_exp [ - italic_β ( italic_m - italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] ; italic_β > 0 , italic_m > italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (2)

is the probability density function of the magnitude of an event and

λθ(t,x,y|t)=u~(x,y)+i:ti<tκA,α(mi)gc,p(tti)fD,γ,q(xxi;yyi;mi)subscript𝜆𝜃𝑡𝑥conditional𝑦subscript𝑡~𝑢𝑥𝑦subscript:𝑖subscript𝑡𝑖𝑡subscript𝜅𝐴𝛼subscript𝑚𝑖subscript𝑔𝑐𝑝𝑡subscript𝑡𝑖subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖\lambda_{\theta}\left(t,x,y\ \right|\ \mathcal{H}_{t})=\widetilde{u}\left(x,y% \right)+\sum_{i\colon t_{i}<t}{\kappa_{A,\alpha}\left(m_{i}\right)\ g_{c,p}% \left(t-t_{i}\right)\ f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)}italic_λ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t , italic_x , italic_y | caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = over~ start_ARG italic_u end_ARG ( italic_x , italic_y ) + ∑ start_POSTSUBSCRIPT italic_i : italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_t end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (3)

where, u~(x,y)~𝑢𝑥𝑦\widetilde{u}(x,y)over~ start_ARG italic_u end_ARG ( italic_x , italic_y ) is the stationary background seismicity rate, κA,α(mi)subscript𝜅𝐴𝛼subscript𝑚𝑖\kappa_{A,\alpha}\left(m_{i}\right)italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is the expected number of triggered events (aftershocks) generated from an event of magnitude misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and it can be expressed as

κA,α(mi)=Aexp[α(mm0)];m>m0formulae-sequencesubscript𝜅𝐴𝛼subscript𝑚𝑖𝐴𝛼𝑚subscript𝑚0𝑚subscript𝑚0\kappa_{A,\alpha}\left(m_{i}\right)=A\exp{\left[\alpha\left(m-m_{0}\right)% \right]};\ \ \ m>m_{0}italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_A roman_exp [ italic_α ( italic_m - italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] ; italic_m > italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (4)

where, A>0𝐴0A>0italic_A > 0 and α>0𝛼0\alpha>0italic_α > 0 are unknown (fixed) parameters, gc,p(tti)subscript𝑔𝑐𝑝𝑡subscript𝑡𝑖g_{c,p}\left(t-t_{i}\right)italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is the probability density function of the occurrence time of a triggered event generated from an event of magnitude misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT occurring at time tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, assumed to be a function of the time lag (tti)𝑡subscript𝑡𝑖(t-t_{i})( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and independent of misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, expressed as

gc,p(tti)=(p1c)(1+ttic)p;t>tiformulae-sequencesubscript𝑔𝑐𝑝𝑡subscript𝑡𝑖𝑝1𝑐superscript1𝑡subscript𝑡𝑖𝑐𝑝𝑡subscript𝑡𝑖g_{c,p}\left(t-t_{i}\right)=\left(\frac{p-1}{c}\right)\left(1+\frac{t-t_{i}}{c% }\right)^{-p};\ \ t>t_{i}italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ( divide start_ARG italic_p - 1 end_ARG start_ARG italic_c end_ARG ) ( 1 + divide start_ARG italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_c end_ARG ) start_POSTSUPERSCRIPT - italic_p end_POSTSUPERSCRIPT ; italic_t > italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (5)

where, c>0𝑐0c>0italic_c > 0 and p>1𝑝1p>1italic_p > 1 are unknown (fixed) parameters andfD,γ,q(xxi;yyi;mi)subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is the probability density function of the occurrence location of a triggered event generated from an event of magnitude misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT occurring at the location (xi,yi)subscript𝑥𝑖subscript𝑦𝑖\left(x_{i},\ y_{i}\right)( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) assumed to be dependent on each other, such that the radially symmetric joint density function is given by

fD,γ,q(xxi;yyi;mi)subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖\displaystyle f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) =q1πDexp[γ(mim0)]absent𝑞1𝜋𝐷𝛾subscript𝑚𝑖subscript𝑚0\displaystyle=\frac{q-1}{\pi\ D\exp{\left[\gamma\left(m_{i}-m_{0}\right)\right% ]}}= divide start_ARG italic_q - 1 end_ARG start_ARG italic_π italic_D roman_exp [ italic_γ ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] end_ARG
×(1+(xxi)2+(yyi)2Dexp[γ(mim0)])qabsentsuperscript1superscript𝑥subscript𝑥𝑖2superscript𝑦subscript𝑦𝑖2𝐷𝛾subscript𝑚𝑖subscript𝑚0𝑞\displaystyle\times\left(1\ +\frac{\left(x-x_{i}\right)^{2}+\left(y-y_{i}% \right)^{2}}{D\exp{\left[\gamma\left(m_{i}-m_{0}\right)\right]}}\right)^{-q}× ( 1 + divide start_ARG ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_D roman_exp [ italic_γ ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] end_ARG ) start_POSTSUPERSCRIPT - italic_q end_POSTSUPERSCRIPT (6)

where, D>0𝐷0D>0italic_D > 0, γ>0𝛾0\gamma>0italic_γ > 0 and q>1𝑞1q>1italic_q > 1 are unknown (fixed) parameters.
Note: This assumption is overlooked in the case where we have used the time-scaled ETAS later in this section. Instead, there we have assumed exponential and gamma distributions for event magnitudes.

Each event is categorised as either a background (spontaneous) event or an event that is triggered by a preceding event. The background events are produced by a Poisson process characterised by an intensity u~(x,y)~𝑢𝑥𝑦\widetilde{u}(x,y)over~ start_ARG italic_u end_ARG ( italic_x , italic_y ), which remains stationary over time. The relaxing coefficient μ𝜇\muitalic_μ in u~(x,y)=μu(x,y)~𝑢𝑥𝑦𝜇𝑢𝑥𝑦\widetilde{u}\left(x,y\right)=\mu\ u(x,y)over~ start_ARG italic_u end_ARG ( italic_x , italic_y ) = italic_μ italic_u ( italic_x , italic_y ) is introduced to accelerate the convergence of the model fitting algorithm. Prior events, whether classified as background or triggered, produce subsequent events in accordance with a non-stationary Poisson process characterised by the following intensity function:

i:ti<tκA,α(mi)gc,p(tti)fD,γ,q(xxi;yyi;mi)subscript:𝑖subscript𝑡𝑖𝑡subscript𝜅𝐴𝛼subscript𝑚𝑖subscript𝑔𝑐𝑝𝑡subscript𝑡𝑖subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖\sum_{i\colon t_{i}<t}{\kappa_{A,\alpha}\left(m_{i}\right)\ g_{c,p}\left(t-t_{% i}\right)\ f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)}∑ start_POSTSUBSCRIPT italic_i : italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_t end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (7)

The expected number of triggered events generated from a typical event, regardless of magnitude is

m0κA,α(m)vβ(m)𝑑m=Aββαsuperscriptsubscriptsubscript𝑚0subscript𝜅𝐴𝛼𝑚subscript𝑣𝛽𝑚differential-d𝑚𝐴𝛽𝛽𝛼\int_{m_{0}\ }^{\infty}\kappa_{A,\alpha}\left(m\right)\ v_{\beta}\left(m\right% )\ dm=\frac{A\beta}{\beta-\alpha}∫ start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m ) italic_v start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT ( italic_m ) italic_d italic_m = divide start_ARG italic_A italic_β end_ARG start_ARG italic_β - italic_α end_ARG (8)

Now, Aββα<1𝐴𝛽𝛽𝛼1\frac{A\beta}{\beta-\alpha}<1divide start_ARG italic_A italic_β end_ARG start_ARG italic_β - italic_α end_ARG < 1 implies the existence of a stationary model, which in turn results in the total spatial intensity function being of the form:

Λ(x,y)Λ𝑥𝑦\displaystyle\Lambda\left(x,y\right)roman_Λ ( italic_x , italic_y ) =limT1T0Tλθ(t,x,y|t)𝑑tabsentsubscript𝑇1𝑇superscriptsubscript0𝑇subscript𝜆𝜃𝑡𝑥conditional𝑦subscript𝑡differential-d𝑡\displaystyle=\lim_{T\rightarrow\infty}{\frac{1}{T}\ \int_{0}^{T}{\lambda_{% \theta}\left(t,x,y\ \right|\ \mathcal{H}_{t})}\ dt}= roman_lim start_POSTSUBSCRIPT italic_T → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t , italic_x , italic_y | caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_t
=μu(x,y)+limT(1Ti:ti<TκA,α(mi)fD,γ,q(xxi;yyi;mi)tiTgc,p(tti)𝑑t)absent𝜇𝑢𝑥𝑦subscript𝑇1𝑇subscript:𝑖subscript𝑡𝑖𝑇subscript𝜅𝐴𝛼subscript𝑚𝑖subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖superscriptsubscriptsubscript𝑡𝑖𝑇subscript𝑔𝑐𝑝𝑡subscript𝑡𝑖differential-d𝑡\displaystyle=\mu u\left(x,y\right)+\lim_{T\rightarrow\infty}{\left(\frac{1}{T% }\sum_{i\colon{t}_{i}<T}{\kappa_{A,\alpha}\left(m_{i}\right)\ f_{D,\gamma,q}% \left(x-x_{i};y-y_{i};m_{i}\right)\int_{t_{i}}^{T}{g_{c,p}\left(t-t_{i}\right)% \ dt}}\right)}= italic_μ italic_u ( italic_x , italic_y ) + roman_lim start_POSTSUBSCRIPT italic_T → ∞ end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_i : italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_T end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_d italic_t )
μu(x,y)+1Ti:ti<TκA,α(mi)fD,γ,q(xxi;yyi;mi)absent𝜇𝑢𝑥𝑦1𝑇subscript:𝑖subscript𝑡𝑖𝑇subscript𝜅𝐴𝛼subscript𝑚𝑖subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖\displaystyle\approx\mu u\left(x,y\right)+\frac{1}{T}\sum_{i\colon{\ t}_{i}<T}% {\kappa_{A,\alpha}\left(m_{i}\right)f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}% \right)}≈ italic_μ italic_u ( italic_x , italic_y ) + divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_i : italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_T end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (9)

and the clustering (triggering) coefficient which quantifies the clustering effect relative to the total spatial intensity at any location (x,y)𝑥𝑦\left(x,y\right)( italic_x , italic_y ) is given by

ω(x,y)=1u(x,y)Λ(x,y)𝜔𝑥𝑦1𝑢𝑥𝑦Λ𝑥𝑦\omega\left(x,y\right)=1-\frac{u\left(x,y\right)}{\Lambda\left(x,y\right)}italic_ω ( italic_x , italic_y ) = 1 - divide start_ARG italic_u ( italic_x , italic_y ) end_ARG start_ARG roman_Λ ( italic_x , italic_y ) end_ARG (10)

[6] suggested the probability of event j𝑗jitalic_j being a background event as

1pj=μu(xj,yj)λθ(tj,xj,yj|tj)1-p_{j}=\frac{\mu u\left(x_{j},\ y_{j}\right)}{\lambda_{\theta}\left(t_{j},\ x% _{j},\ y_{j}\middle|\mathcal{H}_{t_{j}}\right)}1 - italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG italic_μ italic_u ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | caligraphic_H start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG (11)

where, pj=i:ti<tjpijsubscript𝑝𝑗subscript:𝑖subscript𝑡𝑖subscript𝑡𝑗subscript𝑝𝑖𝑗p_{j}=\sum_{i\colon t_{i}<t_{j}}p_{ij}italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i : italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is interpreted as the probability that event j𝑗jitalic_j is triggered by a previous event, where

pij={κA,α(mi)gc,p(tti)fD,γ,q(xxi;yyi;mi)λθ(tj,xj,yj|tj)tj>ti0tjtip_{ij}=\begin{cases}\frac{\kappa_{A,\alpha}\left(m_{i}\right)\ g_{c,p}\left(t-% t_{i}\right)\ f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)}{\lambda_{% \theta}\left(t_{j},\ x_{j},\ y_{j}\middle|\mathcal{H}_{t_{j}}\right)}&t_{j}>t_% {i}\\ 0&t_{j}\leq t_{i}\end{cases}italic_p start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = { start_ROW start_CELL divide start_ARG italic_κ start_POSTSUBSCRIPT italic_A , italic_α end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_g start_POSTSUBSCRIPT italic_c , italic_p end_POSTSUBSCRIPT ( italic_t - italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | caligraphic_H start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG end_CELL start_CELL italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW (12)

where pijsubscript𝑝𝑖𝑗p_{ij}italic_p start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is interpreted as the probability that event j𝑗jitalic_j is triggered by event i𝑖iitalic_i. Since, the log-likelihood function can be expressed as

l(β,θ|T)𝑙𝛽conditional𝜃subscript𝑇\displaystyle l\left(\beta,\theta\right|\mathcal{H}_{T})italic_l ( italic_β , italic_θ | caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) =i=1Nδilog(λβ,θ(ti,xi,yi,mi|ti))absentsuperscriptsubscript𝑖1𝑁subscript𝛿𝑖subscript𝜆𝛽𝜃subscript𝑡𝑖subscript𝑥𝑖subscript𝑦𝑖conditionalsubscript𝑚𝑖subscriptsubscript𝑡𝑖\displaystyle=\sum_{i=1}^{N}{\delta_{i}\log{(\lambda_{\beta,\ \theta}(t_{i},\ % x_{i},y_{i},m_{i}|\mathcal{H}_{t_{i}}))}}= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( italic_λ start_POSTSUBSCRIPT italic_β , italic_θ end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | caligraphic_H start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) )
m0tstarttstart+TSλβ,θ(t,x,y,m|t)𝑑x𝑑y𝑑t𝑑msuperscriptsubscriptsubscript𝑚0superscriptsubscriptsubscript𝑡𝑠𝑡𝑎𝑟𝑡subscript𝑡𝑠𝑡𝑎𝑟𝑡𝑇subscript𝑆subscript𝜆𝛽𝜃𝑡𝑥𝑦conditional𝑚subscript𝑡differential-d𝑥differential-d𝑦differential-d𝑡differential-d𝑚\displaystyle-\int_{m_{0}}^{\infty}\int_{t_{start}}^{t_{start}+T}\int\int_{S}{% \lambda_{\beta,\theta}\left(t,\ x,\ y,\ m\ \right|\ \mathcal{H}_{t})\ dx\ dy\ % dt\ dm}- ∫ start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_s italic_t italic_a italic_r italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_s italic_t italic_a italic_r italic_t end_POSTSUBSCRIPT + italic_T end_POSTSUPERSCRIPT ∫ ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_β , italic_θ end_POSTSUBSCRIPT ( italic_t , italic_x , italic_y , italic_m | caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_x italic_d italic_y italic_d italic_t italic_d italic_m (13)

where,

δi={1iis a target event.0otherwise.subscript𝛿𝑖cases1𝑖is a target event.0otherwise.\delta_{i}=\begin{cases}1&i\ \text{is a target event.}\\ 0&\text{otherwise.}\end{cases}italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { start_ROW start_CELL 1 end_CELL start_CELL italic_i is a target event. end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL otherwise. end_CELL end_ROW

rendering it separable into two parts as

l(β,θT)=l1(βT)+l2(θT)𝑙𝛽conditional𝜃subscript𝑇subscript𝑙1conditional𝛽subscript𝑇subscript𝑙2conditional𝜃subscript𝑇l\left(\beta,\theta\mid\mathcal{H}_{T}\right)=l_{1}\left(\beta\ \mid\mathcal{H% }_{T}\right)+l_{2}\left(\theta\mid\mathcal{H}_{T}\right)italic_l ( italic_β , italic_θ ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) = italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_β ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) + italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) (14)

where,

l1(βT)subscript𝑙1conditional𝛽subscript𝑇\displaystyle l_{1}\left(\beta\ \mid\mathcal{H}_{T}\right)italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_β ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) =i=1Nδilog(λβ,θ(ti,xi,yi,mi|ti))absentsuperscriptsubscript𝑖1𝑁subscript𝛿𝑖subscript𝜆𝛽𝜃subscript𝑡𝑖subscript𝑥𝑖subscript𝑦𝑖conditionalsubscript𝑚𝑖subscriptsubscript𝑡𝑖\displaystyle=\sum_{i=1}^{N}{\delta_{i}\log{(\lambda_{\beta,\ \theta}(t_{i},\ % x_{i},y_{i},m_{i}|\mathcal{H}_{t_{i}}))}}= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( italic_λ start_POSTSUBSCRIPT italic_β , italic_θ end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | caligraphic_H start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) (15)
l2(θT)subscript𝑙2conditional𝜃subscript𝑇\displaystyle l_{2}\left(\theta\mid\mathcal{H}_{T}\right)italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) =m0tstarttstart+TSλβ,θ(t,x,y,m|t)𝑑x𝑑y𝑑t𝑑mabsentsuperscriptsubscriptsubscript𝑚0superscriptsubscriptsubscript𝑡𝑠𝑡𝑎𝑟𝑡subscript𝑡𝑠𝑡𝑎𝑟𝑡𝑇subscript𝑆subscript𝜆𝛽𝜃𝑡𝑥𝑦conditional𝑚subscript𝑡differential-d𝑥differential-d𝑦differential-d𝑡differential-d𝑚\displaystyle=\int_{m_{0}}^{\infty}\int_{t_{start}}^{t_{start}+T}\int\int_{S}{% \lambda_{\beta,\theta}\left(t,\ x,\ y,\ m\ \right|\ \mathcal{H}_{t})\ dx\ dy\ % dt\ dm}= ∫ start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_s italic_t italic_a italic_r italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_s italic_t italic_a italic_r italic_t end_POSTSUBSCRIPT + italic_T end_POSTSUPERSCRIPT ∫ ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_β , italic_θ end_POSTSUBSCRIPT ( italic_t , italic_x , italic_y , italic_m | caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_x italic_d italic_y italic_d italic_t italic_d italic_m (16)

and the MLE of β𝛽\betaitalic_β is given by

β^=Ni=1Nδi(mim0)^𝛽superscript𝑁superscriptsubscript𝑖1𝑁subscript𝛿𝑖subscript𝑚𝑖subscript𝑚0\hat{\beta}=\frac{N^{\prime}}{\sum_{i=1}^{N}{\delta_{i}\left(m_{i}-m_{0}\right% )}}over^ start_ARG italic_β end_ARG = divide start_ARG italic_N start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG

and θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG (MLE of θ𝜃\thetaitalic_θ) is obtained by minimizing

ξ(θ)=l2(θT)𝜉𝜃subscript𝑙2conditional𝜃subscript𝑇\xi\left(\theta\right)=-l_{2}\left(\theta\mid\mathcal{H}_{T}\right)italic_ξ ( italic_θ ) = - italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ∣ caligraphic_H start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT )

iteratively using a suitable optimization algorithm. We have followed the example of [5, 4, 6] by using the Davidon-Fletcher-Powell algorithm. An alternative approach [7] that has been used to fit some of the time-scaled models is described below. [8, 9] and [10] have both proposed methods for time scaling that are alternatives to the ideal time scale. One alternative time scaling method proposed by [11] is called the "calibration time scale". Calibration time scale is a time scale that is used to make a non-identifiable model identifiable by adjusting the scaling of the time axis. The method uses a transformation of the time axis, which is chosen to make the model identifiable, without changing the underlying hazard function. Another alternative is the "Proportional Hazard Time scale" [12], which is based on the proportional hazards model. In this method, the hazard function is assumed to be proportional to a baseline hazard function and an explanatory variable. The time scale is chosen such that the coefficient of the explanatory variable is equal to one, which simplifies the interpretation of the model.

Here are a few examples of different time scale options [13, 14, 15, 16] and their corresponding assumptions:

  • Ideal Time Scale: The hazard function is constant over time. This assumption is often made when there is no clear understanding of the underlying process generating the data, or when the data is too sparse to accurately estimate a more complex hazard function. This can be represented as tZ=ϕ1(t)=tsuperscript𝑡𝑍subscriptitalic-ϕ1𝑡𝑡t^{Z}=\phi_{1}\left(t\right)=titalic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) = italic_t, say.

  • Calibration Time Scale: The hazard function is non-constant over time, but a transformation of the time axis is used to make the model identifiable without changing the underlying hazard function. The method assumes that the model is non-identifiable without the calibration time scale, but identifiable with it. Here, tZ=ϕ2(t,ω)=tωsuperscript𝑡𝑍subscriptitalic-ϕ2𝑡𝜔𝑡𝜔t^{Z}=\phi_{2}\left(t,\ \omega\right)=\frac{t}{\omega}italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t , italic_ω ) = divide start_ARG italic_t end_ARG start_ARG italic_ω end_ARG, say.

  • Proportional Hazards Time Scale: The hazard function is proportional to a baseline hazard function and an explanatory variable. The time scale is chosen such that the coefficient of the explanatory variable is equal to one, which simplifies the interpretation of the model. This method assumes that the hazard function is proportional to a baseline hazard function and an explanatory variable. Mathematically, tZ=ϕ3(t,Z(t))=tZ(t)superscript𝑡𝑍subscriptitalic-ϕ3𝑡𝑍𝑡𝑡𝑍𝑡t^{Z}=\phi_{3}\left(t,Z\left(t\right)\right)=\frac{t}{Z\left(t\right)}italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t , italic_Z ( italic_t ) ) = divide start_ARG italic_t end_ARG start_ARG italic_Z ( italic_t ) end_ARG, say.

  • Log-Linear Time Scale: The hazard function is a log-linear function of time. This method assumes that the hazard function is a log-linear function of time, which can be useful when the data exhibit a clear trend over time. Here, tZ=ϕ4(t)=logtsuperscript𝑡𝑍subscriptitalic-ϕ4𝑡𝑡t^{Z}=\phi_{4}\left(t\right)=\log{t}italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( italic_t ) = roman_log italic_t, say.

  • Power Time Scale: The hazard function is a power function of time. This method assumes that the hazard function is a power function of time, which can be useful when the data exhibit a clear trend over time. Hence, we have tZ=ϕ5(t,ω)=tωsuperscript𝑡𝑍subscriptitalic-ϕ5𝑡𝜔superscript𝑡𝜔t^{Z}=\phi_{5}\left(t,\omega\right)=t^{\omega}italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ( italic_t , italic_ω ) = italic_t start_POSTSUPERSCRIPT italic_ω end_POSTSUPERSCRIPT, say.

It’s important to note that the choice of time scale can have a significant impact on the results and interpretation of the model and it’s essential to validate the assumptions and check the results with other models or different time scales.

The time-scaled ETAS model is a statistical framework that characterises the temporal and spatial distributions of aftershocks following a main shock. It modifies the time scale to facilitate the differentiation between background events and triggered events based on the intervals between occurrences. The model utilises the Epidemic Type Aftershock Sequence (ETAS) framework and incorporates the time-dependent characteristics of aftershocks. A set of factors, such as the main shock’s intensity, the background rate of earthquakes, and the pace at which aftershocks decrease over time, constitute the model. The parameters are estimated from the data utilising a maximum likelihood estimation technique. A collection of spatial and temporal kernels, estimated via maximum likelihood estimation, define the spatio-temporal component of the time-scaled ETAS model, which also takes into consideration the temporal clustering of aftershocks and the spatial distribution of earthquakes.

We have obtained results for model performance of the ETAS model via two approaches. The first approach leads to assuming the ETAS model as a Marked Point Process and maximizing the resulting log-likelihood (read: minimizing negative log-likelihood) iteratively using the optimization procedure proposed by [17]. By this approach, we have, for each type of time-scaled data, considered the ETAS model as a Ground Intensity Function for two models where the underlying probability distribution of event magnitudes is either exponential or gamma, and estimates of the parameters have been thus compared. The second approach entails a similar path, except that for likelihood maximization (or negative log likelihood minimization), we opt for the Davidon – Fletcher – Powell (DFP) algorithm for parameter optimization, similar to the model proposed by [3, 6, 4], coined Iterative Stochastic De-clustering Method (ISDM) in the paper [5].

2 Results

Table 1 compares the outputs of the ETAS model (un-scaled time) as fitted using the Iterative Stochastic De-clustering Method (assuming that event magnitudes follow a radially symmetric joint density function) and the alternative models where the ETAS model is used as a Ground Intensity Function, assuming exponential and gamma distributions of event magnitudes.

Table 1: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ1(t)=t}superscript𝑡𝑍subscriptitalic-ϕ1𝑡𝑡\left\{{t}^{Z}={\phi}_{1}\left({t}\right)={t}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) = italic_t }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 0.001451 0.001398
A𝐴Aitalic_A 5.3×1055.3superscript1055.3\times 10^{-5}5.3 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT 6.91×1056.91superscript1056.91\times 10^{-5}6.91 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT
α𝛼\alphaitalic_α 1.713889 1.647376
c𝑐citalic_c 0.540771 0.59556
p𝑝pitalic_p 1.133662 1.126354
D𝐷Ditalic_D 0.228004 0.265632
β𝛽\betaitalic_β 0.367643
Log-likelihood -7837.83 -7795.65

Clearly, we can see that both the models perform equally in this regard, although the Gamma model fares marginally better than the Exponential model in this case. For both the models that we see above,μ𝜇\muitalic_μ or the background seismicity rate of the process represents the average number of events occurring per unit time in the absence of any triggering effect, which is very low, indicating that more often, the earthquakes occurring in this region have been due to some triggering event. α𝛼\alphaitalic_α, the triggering rate parameter is seen to be almost identical, and is slightly higher in the exponential model than the gamma model indicating that using the gamma model may result in some relatively negligible underestimation of the triggering rate. The gamma model does have a more pronounced clustering parameter c, which represents the probability of an event being able to trigger one or more consequent events. The parameter p, or the probability distribution parameter of the model which represents the distribution of the time between an event and its triggered events, is also slightly higher in the exponential model than the gamma model. This may be attributed to the parameter β𝛽\betaitalic_β, which is responsible for the variation in the magnitude of earthquakes in the region. D𝐷Ditalic_D denotes the spatial decay parameter of the model and it represents the rate at which the triggering effect decreases with distance. As can be inferred from the table itself, we can see that they are almost identical in case of either exponential or gamma distribution of magnitudes. A may be interpreted as the minimum number of aftershocks expected in case of an earthquake at minimum threshold m=m0𝑚subscript𝑚0m=m_{0}italic_m = italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Again, as the estimate of A is very close to zero, we can expect that aftershocks are rare, and for number of aftershocks to be high, the magnitude of the triggering earthquake needs to be very high.

However, comparing the same model with the ETAS model fitted using ISDM and likelihood optimized using the DFP algorithm, we have the results in Table 2.

Table 2: Parameter Estimates using ETAS model using ISDM optimized using DFP {tZ=ϕ1(t)=t}superscript𝑡𝑍subscriptitalic-ϕ1𝑡𝑡\left\{{t}^{Z}={\phi}_{1}\left({t}\right)={t}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) = italic_t }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 1.1153 0.0383
A𝐴Aitalic_A 0.2102 0.0906
α𝛼\alphaitalic_α 1.5979 0.0263
c𝑐citalic_c 0.0071 0.1552
p𝑝pitalic_p 1.1395 0.0139
β𝛽\betaitalic_β 3.5912 0.124
D𝐷Ditalic_D 0.0033 0.191
q𝑞qitalic_q 1.8398 0.0457
γ𝛾\gammaitalic_γ 1.2236 0.073
Log-likelihood -497.874 (AIC = 1011.749)

From Table 2, we can safely say that the ISDM approach yielded much better results from the point of view of fit. From the interpretation viewpoint as well, we can see that this set of parameter estimates indicate significantly higher background intensity, higher value of number of aftershocks at minimum magnitude threshold (approximately 1 in 5, in this case), almost identical α𝛼\alphaitalic_α indicating that the triggering rate estimate may be accurate. We see a significantly lower value of c𝑐citalic_c in this case, which may indicate low clustering rate, but we have applied an iteratively de-clustered algorithm here, so the lower value of c𝑐citalic_c may be justified. The p𝑝pitalic_p parameter may be identical for both the models, indicating that the distribution of time between earthquakes may be similar, although the same cannot be said for D𝐷Ditalic_D, which is lower than that of the previous model, indicating that the rate of decrease in triggering effect per unit distance is lower. Additionally, a significantly higher value of β𝛽\betaitalic_β indicates higher variation in the magnitude over the entire area.γ𝛾\gammaitalic_γ and q𝑞qitalic_q are the shape and the scale parameters of the radially symmetric PDF fD,γ,q(xxi;yyi;mi)subscript𝑓𝐷𝛾𝑞𝑥subscript𝑥𝑖𝑦subscript𝑦𝑖subscript𝑚𝑖f_{D,\gamma,q}\left(x-x_{i};y-y_{i};m_{i}\right)italic_f start_POSTSUBSCRIPT italic_D , italic_γ , italic_q end_POSTSUBSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_y - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) respectively, and may be treated as hyperparameters, to be used for tuning the model for better accuracy. Tables 3, 4, 5, 6 and 7 are models fitted using the ETAS Ground Intensity Function with Exponential and Gamma event magnitude approach after using various time-scales discussed earlier.

Table 3: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ2(t,ω)=tω,ω=5}formulae-sequencesuperscript𝑡𝑍subscriptitalic-ϕ2𝑡𝜔𝑡𝜔𝜔5\left\{{t}^{Z}={\phi}_{2}\left({t},\ {\omega}\right)=\frac{{t}}{{\omega}},\ {% \omega}={5}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t , italic_ω ) = divide start_ARG italic_t end_ARG start_ARG italic_ω end_ARG , italic_ω = 5 }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 0.01451 0.01397782
A𝐴Aitalic_A 0.00053 0.000691442
α𝛼\alphaitalic_α 1.71387 1.647391
c𝑐citalic_c 0.05408 0.05955264
p𝑝pitalic_p 1.13366 1.126352
D𝐷Ditalic_D 0.228 0.2656316
β𝛽\betaitalic_β 0.116257
Log-likelihood -5433.9 -5391.755
Table 4: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ2(t,ω)=tω,ω=1000}formulae-sequencesuperscript𝑡𝑍subscriptitalic-ϕ2𝑡𝜔𝑡𝜔𝜔1000\left\{{t}^{Z}={\phi}_{2}\left({t},\ {\omega}\right)=\frac{{t}}{{\omega}},\ {% \omega}={1000}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t , italic_ω ) = divide start_ARG italic_t end_ARG start_ARG italic_ω end_ARG , italic_ω = 1000 }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 1.450546 1.397766
A𝐴Aitalic_A 0.05301076 0.06915157
α𝛼\alphaitalic_α 1.713874 1.65
c𝑐citalic_c 0.000540705 0.000595452
p𝑝pitalic_p 1.133651 1.126342
D𝐷Ditalic_D 0.2280044 0.2656312
β𝛽\betaitalic_β 0.01162555
Log-likelihood -626.1306 -583.9576

Here, we can see that increasing the time scale to near 1000 provides better log-likelihood. However, higher values of ω𝜔\omegaitalic_ω have not yielded better results. One may use a search algorithm to find a scale parameter to maximize the log-likelihood, but such algorithms may require costlier processing hardware. The parameter estimates in Table 4 is closer in interpretation to the ISDM model than the unscaled ETAS model with exponentially or gamma distributed magnitudes and also induces better fit in that regard. For the results in Table 5, we have used the average depth of all minor earthquakes, between two major quakes (magnitude>5)magnitude5(\text{magnitude}>5)( magnitude > 5 ), as a usage measure.

Table 5: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ3(t,Z(t))=tZ(t)}superscript𝑡𝑍subscriptitalic-ϕ3𝑡𝑍𝑡𝑡𝑍𝑡\left\{{t}^{Z}={\phi}_{3}\left({t},\ {Z}\left({t}\right)\right)=\frac{{t}}{{Z}% \left({t}\right)}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t , italic_Z ( italic_t ) ) = divide start_ARG italic_t end_ARG start_ARG italic_Z ( italic_t ) end_ARG }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 0.0002956871 0.0002937223
A𝐴Aitalic_A 0.2932234 0.3480650
α𝛼\alphaitalic_α 0.5707210 0.4834184
c𝑐citalic_c 0.03981759 0.05134718
p𝑝pitalic_p 1.113115 1.117977
D𝐷Ditalic_D 0.2280044 0.3000280
β𝛽\betaitalic_β 0.3114590
Log-likelihood -5148.079 -5075.610

Just like the rest of the cases, the Gamma model fares only slightly better than the Exponential model. However, the choice of the Depth variable as a usage measure is not found to be suitable for scaling purposes. The parameter estimates are closer to the unscaled ETAS model than the iteratively de-clustered ETAS model. Hence using depth as a covariate for time scaling seems unsuitable. One may opt for other usage measures like increase in height of a particular Himalayan range per unit time, which may be an interesting variable to use in this regard. Another variable that may be potentially used as a usage measure can be the rate of depletion of ice caps in the Himalayas with time. Some interesting outtakes may potentially arise from such analyses. However, collection of such data may be arduous at the ground level, as government records may not even be available in certain cases.

Table 6: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ1(t)=t}superscript𝑡𝑍subscriptitalic-ϕ1𝑡𝑡\left\{{t}^{Z}={\phi}_{1}\left({t}\right)={t}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) = italic_t }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 0.66566 0.21001
A𝐴Aitalic_A 0.12828 2.19703
α𝛼\alphaitalic_α 1.36808 0.9307
c𝑐citalic_c 5.47117 0.58414
p𝑝pitalic_p 0.91649 1.32135
D𝐷Ditalic_D 0.228 1.57623
β𝛽\betaitalic_β 1.57623
Log-likelihood -4461.1 -3637.2

Although the log-scale does not induce a good enough log-likelihood to compete with the calibration or proportional hazards time scale, it does fare significantly better than the unscaled model. This goes to show that a combination of other scales with the log scale may potentially provide a better model.

Table 7: Parameter Estimates using ETAS model as Ground Intensity Function {tZ=ϕ1(t)=t}superscript𝑡𝑍subscriptitalic-ϕ1𝑡𝑡\left\{{t}^{Z}={\phi}_{1}\left({t}\right)={t}\right\}{ italic_t start_POSTSUPERSCRIPT italic_Z end_POSTSUPERSCRIPT = italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) = italic_t }
Parameter Exponential Model Gamma Model
μ𝜇\muitalic_μ 0.2996577 0.2603446
A𝐴Aitalic_A 0.2401597 0.3450314
α𝛼\alphaitalic_α 1.48376 1.39805
c𝑐citalic_c 0.0002384452 0.0002525749
p𝑝pitalic_p 0.9817204 0.9741814
D𝐷Ditalic_D 0.2280044 0.2703550
β𝛽\betaitalic_β 0.01578156
Log-likelihood -974.8525 -928.1033

The Power Scale taken at ω=12𝜔12\omega=\frac{1}{2}italic_ω = divide start_ARG 1 end_ARG start_ARG 2 end_ARG give the best result at a log-likelihood in the neighbourhood of negative one thousand, which is still better than the log-linear scale or even the proportional hazards scale. Here again, the estimates of model parameters indicate a situation closer that estimated by the iteratively de-clustered model. One may note that usage of calibration time scale in this regard, translates to easier likelihood optimization as opposed to other scales. One may use, as in our case, iterative algorithms in search of calibration parameter values that maximize log-likelihood. Using the power-scale implies that a stronger algorithm may be required to find that optimum.

However, the calibration time scale performs best when compared with the other time scales in terms of model fit. Having said that, the best performance has been that of the model generated using ISDM optimized using FDP algorithm. The figures provided below may provide some insight onto the earthquake patterns observed in Nepal since January 11, 1990.

Fig. 1 indicates an outlier at April 25, 2015, where multiple great earthquakes occurred in quick succession in the regions as indicated by the circles in the above map.

Refer to caption
Figure 1: Location of epicentres (top-left panel), logarithm of frequency by magnitude (bottom-left panel), cumulative frequency v/s time (bottom-middle panel) and latitude, longitude and magnitude v/s time (right panels) of earthquakes with magnitude 5absent5\geq 5≥ 5 occurring between 1990-01-11 and 2022-05-20 in Nepal and its proximity (27– 30 Nsuperscript𝑁{}^{\circ}Nstart_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPT italic_N and 79–89 EsuperscriptE{}^{\circ}\text{E}start_FLOATSUPERSCRIPT ∘ end_FLOATSUPERSCRIPT E), extracted from the ANSS global comprehensive catalogue

Fig. 2 marks the spatial neighbourhoods where background intensity is higher and neighbourhoods that are clustering prone.

Refer to caption
Figure 2: Plots of estimates of the background seismicity rate, total spatial intensity, clustering (triggering) coefficient and conditional intensity at t=tstart+T𝑡subscript𝑡𝑠𝑡𝑎𝑟𝑡𝑇t=t_{start}+Titalic_t = italic_t start_POSTSUBSCRIPT italic_s italic_t italic_a italic_r italic_t end_POSTSUBSCRIPT + italic_T for the fitted ETAS model to the Nepal catalogue

From Fig. 2, it can be said that background seismicity rate is higher in the north-western part of Nepal (Karnali region), whereas the total spatial seismicity rate and the clustering coefficient is significantly higher in central Nepal (Kathmandu and Narayani region). This indicates a higher potential of powerful aftershocks in the central Nepal region. However, earthquakes are more frequent in the North-Western region.

Refer to caption
Figure 3: Diagnostic plots for the fitted ETAS model to the Nepal catalogue: the temporal residuals (top-left), smoothed spatial residuals (top-right), transformed times τisubscript𝜏𝑖\tau_{i}italic_τ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT against i𝑖iitalic_i (bottom-left) and the Q-Q plot of Uisubscript𝑈𝑖U_{i}italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (bottom-right)

Fig. 3(top-left and top-right) shows the results of model diagnostics performed on the ETAS model. It has been observed that earthquakes closer to the Kathmandu region have low residuals, meaning that the background intensity and the clustering coefficient of that region is well represented by the model. However, as we move eastwards or westwards, we see a sharp increase in the residuals. This may be the result of varying levels of terrain elevation that have not been accounted for in our models. There may also be other underlying factors that need to be investigated in greater detail. In Fig. 3(bottom-left, and bottom-right), it can be seen that points on the plots of transformed times τjsubscript𝜏𝑗\tau_{j}italic_τ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and the Q-Q plot of Ujsubscript𝑈𝑗U_{j}italic_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT lie approximately on the y=x𝑦𝑥y=xitalic_y = italic_x line. To confirm, we performed a Kolmogorov-Smirnov test for goodness-of-fit. The p𝑝pitalic_p-value of 0.4386(>0.05)annotated0.4386absent0.050.4386(>0.05)0.4386 ( > 0.05 ) indicates that Ujsubscript𝑈𝑗U_{j}italic_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT follows a U(0,1)𝑈01U\left(0,1\right)italic_U ( 0 , 1 ) distribution, at a 5%percent55\%5 % level of significance.

Refer to caption
Figure 4: Most likely (Probability >0.95absent0.95>0.95> 0.95) background and triggered events detected by estimated de-clustering probabilities pjsubscript𝑝𝑗p_{j}italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT

Fig. 4 indicates that most of the quakes near the Kathmandu – Janakpur region are triggered whereas quakes occurring in the North-West region (Karnali region) mostly consist of background seismic activity.

3 Conclusions

From the results obtained above, the following outtakes can be derived:

  • The Davidon-Powell-Fletcher algorithm, fares better than the Nelder-Mead Optimization algorithm for maximum likelihood estimation in this case. Additionally, the probability distribution of the magnitude, if taken to be either exponential or gamma, fits very poorly on the data. The radially symmetric density assumption on the other hand yields a superior fit.

  • Judging by interpretability, we should take into account, the fact that, in case of the exponential or gamma assumption of magnitude distribution, the inherent interpretability of the model is increased, which loosely translates to better foundations for decision-making. The parameters of the radially symmetric density are complex to interpret and to base decisions on. However, they may be used as hyper-parameters in “deep”-er models, for obtaining better forecasts.

  • Although by a very small margin, among the theoretical distributions that have been postulated for magnitude probability distribution, the gamma model fares better in all types of time scales than the exponential model. That may be due to an extra parameter which makes the difference. However, having said that, the cost of including an extra parameter may prove costlier in terms of likelihood per additional covariate, if too many instances are to be trained to the model.

  • The residual map indicates lower values of residuals in the central part of the country whereas as we go from the centre outwards, residuals become higher. This may be a direct connection with the extent of urbanization in those areas to the magnitude and frequency. The areas having low variance in raw spatial residuals, are the more urbanized areas of the country, whereas relatively less inhabited areas have very high variation of magnitude. This may be solved by scaling the magnitudes along with time.

  • The conditional density and the clustering coefficient for central locations of the country have a higher probability of aftershocks than the locations, and as we go eastwards or westwards from the centre. One may take the usage measure as some variable highly correlated with the magnitude for this purpose.

Although a lot of research in the sphere of seismological modelling is centered around exploiting the flexibility offered by the ETAS model to accommodate additional constraints or to bypass assumptions so as to introduce robustness, as evident by the recent works [18, 19, 20, 21, 22, 23], and the recent advances in the domain of time-scaling for lifetime data [24, 25, 26], collaboration between researchers in these two spheres seldom occurs, resulting in dearth of opportunities and unexplored possibilities. Ours seems to be one of the earlier attempts at bridging these two domains in comprehensive manner. However, the future seems hopeful, as there lies tremendous prospect in cross-collaborations with the computational domain as well, given that research in deep modelling is at the peak, with deep models for both, seismicity as well as for time-between-event responses, is now commonplace.

Acknowledgments

This was was supported in part by the Department of Statistics, Faculty of Sciences, The Maharaja Sayajirao University of Baroda, Vadodara, Gujarat, India, PIN–390002.

References

  • [1] Laura Gulia and Stefan Wiemer. The influence of tectonic regimes on the earthquake size distribution: A case study for italy. Geophysical Research Letters, 37(10), 2010.
  • [2] Stefan Wiemer and Kei Katsumata. Spatial variability of seismicity parameters in aftershock zones. Journal of Geophysical Research: Solid Earth, 104(B6):13135–13151, 1999.
  • [3] Yosihiko Ogata. Statistical models for earthquake occurrences and residual analysis for point processes. Journal of the American Statistical association, 83(401):9–27, 1988.
  • [4] Yosihiko Ogata and Jiancang Zhuang. Space–time etas models and an improved extension. Tectonophysics, 413(1-2):13–23, 2006.
  • [5] Abdollah Jalilian. Etas: an r package for fitting the space-time etas model to earthquake data. Journal of Statistical Software, 88:1–39, 2019.
  • [6] Jiancang Zhuang, Yosihiko Ogata, and David Vere-Jones. Analyzing earthquake clustering features by using stochastic reconstruction. Journal of Geophysical Research: Solid Earth, 109(B5), 2004.
  • [7] Rupal Shah, K Muralidharan, and Ayush Parajuli. Temporal point process models for nepal earthquake aftershocks. International Journal of Statistics and Reliability Engineering, 7(2):275–285, 2020.
  • [8] Thierry Duchesne and Jerry Lawless. Alternative time scales and failure time models. Lifetime data analysis, 6:157–179, 2000.
  • [9] Thierry Duchesne and Jerry Lawless. Semiparametric inference methods for general time scale models. Lifetime Data Analysis, 8:263–276, 2002.
  • [10] Jerald F Lawless. Statistical models and methods for lifetime data. John Wiley & Sons, 2011.
  • [11] Thierry Duchesne. Multiple Time Scales in Survival Analysis. PhD thesis, University of Waterloo, 1999.
  • [12] Kh B Kordonsky and I Gertsbakh. Multiple time scales and the lifetime coefficient of variation: engineering applications. Lifetime data analysis, 3:139–156, 1997.
  • [13] Kh B Kordonsky and IB Gertsbakh. Choice of the best time scale for system reliability analysis. European Journal of Operational Research, 65(2):235–246, 1993.
  • [14] Kh B Kordonsky and IB Gertsbakh. System state monitoring and lifetime scales—i. Reliability Engineering & System Safety, 47(1):1–14, 1995.
  • [15] David Oakes. Multiple time scales in survival analysis. Lifetime data analysis, 1:7–18, 1995.
  • [16] Christof Sumereder and Hans Michael Muhr. Estimation of residual lifetime-theory and practical problems. In 8th Höfler’s Days. Elektroinstitut Milan Vidmar, 2005.
  • [17] John A Nelder and Roger Mead. A simplex method for function minimization. The computer journal, 7(4):308–313, 1965.
  • [18] Emrah Altun, Deepesh Bhati, and Naushad Mamode Khan. A new approach to model the counts of earthquakes: Inarpqx (1) process. SN Applied Sciences, 3:1–17, 2021.
  • [19] Nader Davoudi, Hamid Reza Tavakoli, Mehdi Zare, and Abdollah Jalilian. Aftershock probabilistic seismic hazard analysis for bushehr province in iran using etas model. Natural Hazards, 100(3):1159–1170, 2020.
  • [20] F Hirose, K Tamaribuchi, and K Maeda. Characteristics of foreshocks revealed by an earthquake forecasting method based on precursory swarm activity. Journal of Geophysical Research: Solid Earth, 126(9):e2021JB021673, 2021.
  • [21] Sujun Hong and Hirotaka Hachiya. Multi-stream based marked point process. In Asian Conference on Machine Learning, pages 1269–1284. PMLR, 2021.
  • [22] Samarth Motagi, Sirish Namilae, Audrey Gbaguidi, Scott Parr, and Dahai Liu. Point-process modeling of secondary crashes. Plos one, 18(12):e0295343, 2023.
  • [23] H Tavakoli, Nader Davoodi, Abdollah Jalilian, and Mehdi Zare. Aftershock forcasting experiment for bushehr province of iran using the epidemic-type aftershock sequence (etas) model. In 16th European Conf. Earthquake Engineering, pages 18–21, 2018.
  • [24] Jerald F Lawless and Martin J Crowder. Models and estimation for systems with recurrent events and usage processes. Lifetime data analysis, 16:547–570, 2010.
  • [25] Anastasiya Nekrasova and Vladimir Kossobokov. Seismic risk assessment for the infrastructure in the regions adjacent to the russian federation baikal–amur mainline based on the unified scaling law for earthquakes. Natural Hazards, 116(2):1995–2010, 2023.
  • [26] Annamraju Syamsundar, Vallayil N Achutha Naikan, and Shaomin Wu. Alternative scales in reliability models for a repairable system. Reliability Engineering & System Safety, 193:106599, 2020.