Line 13: Line 13:
 
==1 Introduction==
 
==1 Introduction==
  
Wildfires are complex phenomena with serious socio-environmental consequences, including economic and biodiversity losses, among others. Anthropogenic factors are responsible for nearly all wildfires in Mexico State, according to data from the National Forestry Commission (Conafor, its Spanish acronym) [serieconafor] (see Figure [[#img-1|1]]).
+
Wildfires are complex phenomena with serious socio-environmental consequences, including economic and biodiversity losses, among others. Anthropogenic factors are responsible for nearly all wildfires in Mexico State, according to data from the National Forestry Commission (Conafor, its Spanish acronym) <span id='citeF-1'></span>[[#cite-1|[1]]] (see Figure [[#img-1|1]]).
  
 
<div id='img-1'></div>
 
<div id='img-1'></div>
Line 23: Line 23:
 
|}
 
|}
  
There is plenty of specialized literature available on wildfires (see [rodriguez2014incendios] and [rodriguez2015incendios]). The authors of [garcia2012analisis] use a logistic regression model to assess the risk of wildfire in Puebla, Mexico, taking into account land cover, meteorological, topographic, and social variables. Using two different data sources: Conafor's open data and Modis' (Moderate Resolution Imaging Spectroradiometer) data, the authors of [de2017spatial] show that wildfire spatial patterns in Mexico tend to cluster. The spatial and temporal relationships between Conafor's wildfire records from 2005 to 2015 and the Standardized Precipitation-Evapotranspiration Index (SPEI) were investigated [marin2018drought]. Machine learning techniques were used to determine the wildfire propensity in Mexico using Conafor's open data [hernandezprediccion].
+
There is plenty of specialized literature available on wildfires (see <span id='citeF-2'></span>[[#cite-2|[2]]] and <span id='citeF-3'></span>[[#cite-3|[3]]]). The authors of <span id='citeF-4'></span>[[#cite-4|[4]]] use a logistic regression model to assess the risk of wildfire in Puebla, Mexico, taking into account land cover, meteorological, topographic, and social variables. Using two different data sources: Conafor's open data and Modis' (Moderate Resolution Imaging Spectroradiometer) data, the authors of <span id='citeF-5'></span>[[#cite-5|[5]]] show that wildfire spatial patterns in Mexico tend to cluster. The spatial and temporal relationships between Conafor's wildfire records from 2005 to 2015 and the Standardized Precipitation-Evapotranspiration Index (SPEI) were investigated <span id='citeF-6'></span>[[#cite-6|[6]]]. Machine learning techniques were used to determine the wildfire propensity in Mexico using Conafor's open data <span id='citeF-7'></span>[[#cite-7|[7]]].
  
 
The spatio-temporal behavior of wildfires could be critical for improving fire management strategies. The point processes approach can be used to model random events in time, space, or space-time, such as wildfires. In this study, we used point processes theory to describe the spatio-temporal behavior of wildfires in Mexico State from 2010 to 2018.
 
The spatio-temporal behavior of wildfires could be critical for improving fire management strategies. The point processes approach can be used to model random events in time, space, or space-time, such as wildfires. In this study, we used point processes theory to describe the spatio-temporal behavior of wildfires in Mexico State from 2010 to 2018.
Line 29: Line 29:
 
==2 Point processes basic theory==
 
==2 Point processes basic theory==
  
A point process is a random set in which the number of points and their locations are both random [baddeley2008analysing]. A point process could occur in any completely separable metric space <math display="inline">\mathcal{S}</math>, such as <math display="inline">d</math>-dimensional Euclidean space <math display="inline">\mathbb{R}^d</math>.
+
A point process is a random set in which the number of points and their locations are both random <span id='citeF-8'></span>[[#cite-8|[8]]]. A point process could occur in any completely separable metric space <math display="inline">\mathcal{S}</math>, such as <math display="inline">d</math>-dimensional Euclidean space <math display="inline">\mathbb{R}^d</math>.
  
 
Definition 1: The point process <math display="inline">Y</math>, with state space <math display="inline">\mathcal{S}</math>, is a measurable mapping from a probability space <math display="inline">(\Omega , \mathcal{F}, \mathbb{P})</math> to the measure space of the point process' realizations equipped with the counting measure, <math display="inline">\left(\mathcal{Y}_{\mathcal{S}}^{\# }, \mathcal{B} \left(\mathcal{Y}_{\mathcal{S}}^{\# } \right), \mu _{\# } \right)</math>.  Where <math display="inline">\mathcal{Y}_{\mathcal{S}}^{\# } = \left\lbrace \mu _{\# } : \mathcal{B}\left(\mathcal{S} \right)\rightarrow \mathbb{N} \; \middle | \; \mu _{\# }(A) < \infty , A \in \mathcal{B} \left(\mathcal{S} \right)\right\rbrace </math> is the space of all finite counting measures on a <math display="inline">\sigma </math>-algebra <math display="inline">\mathcal{B} \left(\mathcal{S} \right)</math> of subsets of <math display="inline">\mathcal{S}</math>, <math display="inline">\mathcal{B} \left(\mathcal{Y}_{\mathcal{S}}^{\# } \right)</math> is a <math display="inline">\sigma </math>-algebra of subsets of the space <math display="inline">\mathcal{Y}_{\mathcal{S}}^{\# }</math> and <math display="inline">\mu _{\# }</math> is the counting measure.
 
Definition 1: The point process <math display="inline">Y</math>, with state space <math display="inline">\mathcal{S}</math>, is a measurable mapping from a probability space <math display="inline">(\Omega , \mathcal{F}, \mathbb{P})</math> to the measure space of the point process' realizations equipped with the counting measure, <math display="inline">\left(\mathcal{Y}_{\mathcal{S}}^{\# }, \mathcal{B} \left(\mathcal{Y}_{\mathcal{S}}^{\# } \right), \mu _{\# } \right)</math>.  Where <math display="inline">\mathcal{Y}_{\mathcal{S}}^{\# } = \left\lbrace \mu _{\# } : \mathcal{B}\left(\mathcal{S} \right)\rightarrow \mathbb{N} \; \middle | \; \mu _{\# }(A) < \infty , A \in \mathcal{B} \left(\mathcal{S} \right)\right\rbrace </math> is the space of all finite counting measures on a <math display="inline">\sigma </math>-algebra <math display="inline">\mathcal{B} \left(\mathcal{S} \right)</math> of subsets of <math display="inline">\mathcal{S}</math>, <math display="inline">\mathcal{B} \left(\mathcal{Y}_{\mathcal{S}}^{\# } \right)</math> is a <math display="inline">\sigma </math>-algebra of subsets of the space <math display="inline">\mathcal{Y}_{\mathcal{S}}^{\# }</math> and <math display="inline">\mu _{\# }</math> is the counting measure.
Line 45: Line 45:
 
The mapping <math display="inline">\varphi _A</math> takes measures <math display="inline">\mu _{\# } \in \mathcal{Y}_{\mathcal{S}}^{\# }</math> and maps them into <math display="inline">\mu _{\# } (A)</math>. As a result, the mapping <math display="inline">\varphi _A</math> in terms of the point process <math display="inline">Y</math> is <math display="inline">\varphi _A : Y(\omega , \cdot ) \mapsto Y(\omega , A)</math>.
 
The mapping <math display="inline">\varphi _A</math> takes measures <math display="inline">\mu _{\# } \in \mathcal{Y}_{\mathcal{S}}^{\# }</math> and maps them into <math display="inline">\mu _{\# } (A)</math>. As a result, the mapping <math display="inline">\varphi _A</math> in terms of the point process <math display="inline">Y</math> is <math display="inline">\varphi _A : Y(\omega , \cdot ) \mapsto Y(\omega , A)</math>.
  
Furthermore, the commutative diagram reveals the equivalences: <math display="inline">Y(\omega , A) = \varphi _A \left(Y(\omega , \cdot ) \right)= Y_A(\omega )</math> and <math display="inline">Y_A^{-1} (B) = Y^{-1} \left(\varphi _A^{-1} (B) \right)</math>, for any <math display="inline">B \in \mathcal{P}\left(\mathbb{N} \right)</math>, where <math display="inline">\mathcal{P}\left(\mathbb{N} \right)</math> denotes the power set of <math display="inline">\mathbb{N}</math>, so <math display="inline">\left(\mathbb{N}, \mathcal{P} \left(\mathbb{N} \right)\right)</math> is a measurable space, [daley2007introduction], [baddeley2007spatial].
+
Furthermore, the commutative diagram reveals the equivalences: <math display="inline">Y(\omega , A) = \varphi _A \left(Y(\omega , \cdot ) \right)= Y_A(\omega )</math> and <math display="inline">Y_A^{-1} (B) = Y^{-1} \left(\varphi _A^{-1} (B) \right)</math>, for any <math display="inline">B \in \mathcal{P}\left(\mathbb{N} \right)</math>, where <math display="inline">\mathcal{P}\left(\mathbb{N} \right)</math> denotes the power set of <math display="inline">\mathbb{N}</math>, so <math display="inline">\left(\mathbb{N}, \mathcal{P} \left(\mathbb{N} \right)\right)</math> is a measurable space, <span id='citeF-9'></span>[[#cite-9|[9]]], <span id='citeF-10'></span>[[#cite-10|[10]]].
  
The following are some fundamental properties of a point process [baddeley2007spatial]:
+
The following are some fundamental properties of a point process <span id='citeF-10'></span>[[#cite-10|[10]]]:
  
 
<ol>
 
<ol>
Line 102: Line 102:
 
For simplification, we will write <math display="inline">Y(\omega , A) = Y(A)</math> in the foregoing. When the point process <math display="inline">Y</math> is observed, we have a point pattern denoted by <math display="inline">\boldsymbol{Y}</math>.
 
For simplification, we will write <math display="inline">Y(\omega , A) = Y(A)</math> in the foregoing. When the point process <math display="inline">Y</math> is observed, we have a point pattern denoted by <math display="inline">\boldsymbol{Y}</math>.
  
In order to generate models, some assumptions about a point process must be made. Stationarity and isotropy are the most important assumptions. The former refers to statistical invariance under translations, whereas the latter refers to statistical invariance under rotations [baddeley2007spatial], [baddeley2015spatial]. Nonetheless, some research on non-stationary and anisotropic processes has been conducted (see [gabriel2022mapping] and [villanueva2008modified]).
+
In order to generate models, some assumptions about a point process must be made. Stationarity and isotropy are the most important assumptions. The former refers to statistical invariance under translations, whereas the latter refers to statistical invariance under rotations <span id='citeF-10'></span>[[#cite-10|[10]]], <span id='citeF-11'></span>[[#cite-11|[11]]]. Nonetheless, some research on non-stationary and anisotropic processes has been conducted (see <span id='citeF-12'></span>[[#cite-12|[12]]] and <span id='citeF-13'></span>[[#cite-13|[13]]]).
  
 
Definition 2: A point process <math display="inline">Y</math> on <math display="inline">\mathcal{S}</math> is stationary if, for any fixed <math display="inline">\boldsymbol{s} \in \mathcal{S}</math>, the distribution of the process <math display="inline">Y + \boldsymbol{s}</math> is identical to the distribution of <math display="inline">Y</math>.
 
Definition 2: A point process <math display="inline">Y</math> on <math display="inline">\mathcal{S}</math> is stationary if, for any fixed <math display="inline">\boldsymbol{s} \in \mathcal{S}</math>, the distribution of the process <math display="inline">Y + \boldsymbol{s}</math> is identical to the distribution of <math display="inline">Y</math>.
Line 108: Line 108:
 
===2.1 Poisson process===
 
===2.1 Poisson process===
  
The general Poisson point process in some space <math display="inline">\mathcal{S}</math> can be defined as follows [baddeley2007spatial], [baddeley2015spatial].
+
The general Poisson point process in some space <math display="inline">\mathcal{S}</math> can be defined as follows <span id='citeF-10'></span>[[#cite-10|[10]]], <span id='citeF-11'></span>[[#cite-11|[11]]].
  
 
<span id='theorem-Poisson process def'></span>Definition 3: The Poisson process <math display="inline">Y</math> on <math display="inline">\mathcal{S}</math> with intensity measure <math display="inline">\Lambda </math> is a point process such that:
 
<span id='theorem-Poisson process def'></span>Definition 3: The Poisson process <math display="inline">Y</math> on <math display="inline">\mathcal{S}</math> with intensity measure <math display="inline">\Lambda </math> is a point process such that:
Line 132: Line 132:
 
|}
 
|}
  
where <math display="inline">\lambda > 0</math> and <math display="inline">\mu _L</math> is the Lebesgue measure, then we have the spatio-temporal homogeneous Poisson point process [gabriel2013stpp].
+
where <math display="inline">\lambda > 0</math> and <math display="inline">\mu _L</math> is the Lebesgue measure, then we have the spatio-temporal homogeneous Poisson point process <span id='citeF-14'></span>[[#cite-14|[14]]].
  
The simplest stochastic mechanism for generating point patterns is the homogeneous Poisson point process. As a data model, it is almost never plausible. Regardless, it is the fundamental reference or benchmark model of a point process [baddeley2008analysing].
+
The simplest stochastic mechanism for generating point patterns is the homogeneous Poisson point process. As a data model, it is almost never plausible. Regardless, it is the fundamental reference or benchmark model of a point process <span id='citeF-8'></span>[[#cite-8|[8]]].
  
The homogeneous Poisson point process is also known as complete spatial (or spatio-temporal) randomness. Additionally, the Poisson point process is stationary and isotropic [baddeley2007spatial].
+
The homogeneous Poisson point process is also known as complete spatial (or spatio-temporal) randomness. Additionally, the Poisson point process is stationary and isotropic <span id='citeF-10'></span>[[#cite-10|[10]]].
  
 
Figure [[#img-3|3]] depicts a spatial point pattern generated by a homogeneous Poisson point process.
 
Figure [[#img-3|3]] depicts a spatial point pattern generated by a homogeneous Poisson point process.
Line 154: Line 154:
 
===3.1 <span id='lb-3.1'></span>Empty-space function F===
 
===3.1 <span id='lb-3.1'></span>Empty-space function F===
  
Let <math display="inline">Y</math> be a stationary point process on <math display="inline">\mathcal{S}</math>. The shortest distance between a given point <math display="inline">\boldsymbol{s} \in \mathcal{S}</math> and the nearest observed point <math display="inline">\boldsymbol{y}_i \in \boldsymbol{Y}</math> is denoted as <math display="inline">\mathrm{d}(\boldsymbol{s}, \boldsymbol{Y}) = \min _{i} \left\lbrace \lVert \boldsymbol{s} - \boldsymbol{y}_i \rVert \right\rbrace </math>. It is called the empty-space distance, spherical contact distance, or simply contact distance [baddeley2008analysing], [baddeley2007spatial], [baddeley2015spatial].
+
Let <math display="inline">Y</math> be a stationary point process on <math display="inline">\mathcal{S}</math>. The shortest distance between a given point <math display="inline">\boldsymbol{s} \in \mathcal{S}</math> and the nearest observed point <math display="inline">\boldsymbol{y}_i \in \boldsymbol{Y}</math> is denoted as <math display="inline">\mathrm{d}(\boldsymbol{s}, \boldsymbol{Y}) = \min _{i} \left\lbrace \lVert \boldsymbol{s} - \boldsymbol{y}_i \rVert \right\rbrace </math>. It is called the empty-space distance, spherical contact distance, or simply contact distance <span id='citeF-8'></span>[[#cite-8|[8]]], <span id='citeF-10'></span>[[#cite-10|[10]]], <span id='citeF-11'></span>[[#cite-11|[11]]].
  
 
<div id='img-4'></div>
 
<div id='img-4'></div>
Line 211: Line 211:
 
The nearest-neighbour distance, denoted by <math display="inline">\mathrm{d}_i = \min _{i \neq j} \left\lbrace \lVert \boldsymbol{y}_i - \boldsymbol{y}_j \rVert \right\rbrace </math>, is the distance between each point <math display="inline">\boldsymbol{y}_i \in \boldsymbol{Y}</math> and its nearest neighbour in the set <math display="inline">\boldsymbol{Y}  
 
The nearest-neighbour distance, denoted by <math display="inline">\mathrm{d}_i = \min _{i \neq j} \left\lbrace \lVert \boldsymbol{y}_i - \boldsymbol{y}_j \rVert \right\rbrace </math>, is the distance between each point <math display="inline">\boldsymbol{y}_i \in \boldsymbol{Y}</math> and its nearest neighbour in the set <math display="inline">\boldsymbol{Y}  
  
\{ \boldsymbol{y}_i \} </math>, [baddeley2008analysing], [baddeley2007spatial]. It is worth noting that <math display="inline">\mathrm{d}_i</math> can also be written as <math display="inline">\mathrm{d}_i = \mathrm{d}\left(\boldsymbol{y}_i, \boldsymbol{Y}  
+
\{ \boldsymbol{y}_i \} </math>, <span id='citeF-8'></span>[[#cite-8|[8]]], <span id='citeF-10'></span>[[#cite-10|[10]]]. It is worth noting that <math display="inline">\mathrm{d}_i</math> can also be written as <math display="inline">\mathrm{d}_i = \mathrm{d}\left(\boldsymbol{y}_i, \boldsymbol{Y}  
  
\{ \boldsymbol{y}_i \} \right)</math>, [baddeley2015spatial]. This distance is depicted in Figure [[#img-5|5]].
+
\{ \boldsymbol{y}_i \} \right)</math>, <span id='citeF-11'></span>[[#cite-11|[11]]]. This distance is depicted in Figure [[#img-5|5]].
  
 
<div id='img-5'></div>
 
<div id='img-5'></div>
Line 249: Line 249:
 
|}
 
|}
  
In this case, we have that <math display="inline">F(r) = G(r)</math>, i.e., under complete spatial randomness, the points of the Poisson process are independent of each other, so conditioning does not affect them. Therefore, <math display="inline">F</math> is equivalent to <math display="inline">G</math>, [baddeley2008analysing].
+
In this case, we have that <math display="inline">F(r) = G(r)</math>, i.e., under complete spatial randomness, the points of the Poisson process are independent of each other, so conditioning does not affect them. Therefore, <math display="inline">F</math> is equivalent to <math display="inline">G</math>, <span id='citeF-8'></span>[[#cite-8|[8]]].
  
 
===3.3 Intensity===
 
===3.3 Intensity===
  
The intensity function describes the first-order properties of a point process [cressie1991statistics], [diggle2013statistical].
+
The intensity function describes the first-order properties of a point process <span id='citeF-15'></span>[[#cite-15|[15]]], <span id='citeF-16'></span>[[#cite-16|[16]]].
  
The average number of points per spatial (or spatio-temporal) unit defines the intensity of a point process. In this regard, intensity is analogous to the expected value of a random variable [baddeley2007spatial].
+
The average number of points per spatial (or spatio-temporal) unit defines the intensity of a point process. In this regard, intensity is analogous to the expected value of a random variable <span id='citeF-10'></span>[[#cite-10|[10]]].
  
 
Similarly, we can investigate the analogue of a point process' variance or covariance throughout the second-order properties.
 
Similarly, we can investigate the analogue of a point process' variance or covariance throughout the second-order properties.
Line 285: Line 285:
 
|}
 
|}
  
for some function <math display="inline">\lambda </math> and any <math display="inline">A \subset \mathbb{R}^d</math>. Then <math display="inline">\lambda </math> is called the intensity function of <math display="inline">Y</math> [baddeley2007spatial]. If <math display="inline">\lambda </math> is constant, then <math display="inline">Y</math> is said to be homogeneous, otherwise is said to be inhomogeneous [moller2003statistical]. Likewise, if the intensity function exists, we can interpret it as follows:
+
for some function <math display="inline">\lambda </math> and any <math display="inline">A \subset \mathbb{R}^d</math>. Then <math display="inline">\lambda </math> is called the intensity function of <math display="inline">Y</math> <span id='citeF-10'></span>[[#cite-10|[10]]]. If <math display="inline">\lambda </math> is constant, then <math display="inline">Y</math> is said to be homogeneous, otherwise is said to be inhomogeneous <span id='citeF-17'></span>[[#cite-17|[17]]]. Likewise, if the intensity function exists, we can interpret it as follows:
  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
Line 296: Line 296:
 
|}
 
|}
  
The <math display="inline">K</math> function and pair correlation are both second-moment properties, so the second-order intensity must be defined [diggle2013statistical].
+
The <math display="inline">K</math> function and pair correlation are both second-moment properties, so the second-order intensity must be defined <span id='citeF-16'></span>[[#cite-16|[16]]].
  
 
Definition 7: Let <math display="inline">Y</math> be a point process on <math display="inline">\mathcal{S}</math>. The second-order intensity is defined as
 
Definition 7: Let <math display="inline">Y</math> be a point process on <math display="inline">\mathcal{S}</math>. The second-order intensity is defined as
Line 313: Line 313:
 
===3.4 <span id='lb-3.4'></span>K function===
 
===3.4 <span id='lb-3.4'></span>K function===
  
The <math display="inline">K</math> function counts the number of locations within a certain radius of a given point (see Figure [[#img-6|6]]), [baddeley2015spatial], [bivand2008applied]. Ripley defined it in [ripley1977modelling]. We present the following definition [baddeley2008analysing], [diggle2013statistical].
+
The <math display="inline">K</math> function counts the number of locations within a certain radius of a given point (see Figure [[#img-6|6]]), <span id='citeF-11'></span>[[#cite-11|[11]]], <span id='citeF-18'></span>[[#cite-18|[18]]]. Ripley defined it in <span id='citeF-19'></span>[[#cite-19|[19]]]. We present the following definition <span id='citeF-8'></span>[[#cite-8|[8]]], <span id='citeF-16'></span>[[#cite-16|[16]]].
  
 
Definition 8: Let <math display="inline">Y</math> be a stationary and isotropic point process on <math display="inline">\mathcal{S}</math> with intensity <math display="inline">\lambda </math>. The <math display="inline">K</math> function is defined as
 
Definition 8: Let <math display="inline">Y</math> be a stationary and isotropic point process on <math display="inline">\mathcal{S}</math> with intensity <math display="inline">\lambda </math>. The <math display="inline">K</math> function is defined as
Line 338: Line 338:
 
|}
 
|}
  
If <math display="inline">\mathcal{S} = \mathbb{R}^d</math> and the point process <math display="inline">Y</math> is assumed to be stationary, then hold <math display="inline">\lambda _2 \left(\boldsymbol{s}, \boldsymbol{u} \right)= \lambda _2 \left(\boldsymbol{s} - \boldsymbol{u} \right)</math>. Also, if <math display="inline">Y</math> is isotropic, hence <math display="inline">\lambda _2(\boldsymbol{s} - \boldsymbol{u}) = \lambda _2(r)</math>, where <math display="inline">r = \lVert \boldsymbol{s} - \boldsymbol{u} \rVert </math>. These conditions implies that [cressie1991statistics], [diggle2013statistical],
+
If <math display="inline">\mathcal{S} = \mathbb{R}^d</math> and the point process <math display="inline">Y</math> is assumed to be stationary, then hold <math display="inline">\lambda _2 \left(\boldsymbol{s}, \boldsymbol{u} \right)= \lambda _2 \left(\boldsymbol{s} - \boldsymbol{u} \right)</math>. Also, if <math display="inline">Y</math> is isotropic, hence <math display="inline">\lambda _2(\boldsymbol{s} - \boldsymbol{u}) = \lambda _2(r)</math>, where <math display="inline">r = \lVert \boldsymbol{s} - \boldsymbol{u} \rVert </math>. These conditions implies that <span id='citeF-15'></span>[[#cite-15|[15]]], <span id='citeF-16'></span>[[#cite-16|[16]]],
  
 
<span id="eq-3"></span>
 
<span id="eq-3"></span>
Line 353: Line 353:
 
The above expression provides a relationship between the <math display="inline">K</math> function and the second-order intensity under the assumptions of stationarity and isotropy.
 
The above expression provides a relationship between the <math display="inline">K</math> function and the second-order intensity under the assumptions of stationarity and isotropy.
  
If <math display="inline">Y</math> is a homogeneous Poisson process on <math display="inline">\mathbb{R}^d</math>, then the <math display="inline">K</math> function is [baddeley2007spatial],
+
If <math display="inline">Y</math> is a homogeneous Poisson process on <math display="inline">\mathbb{R}^d</math>, then the <math display="inline">K</math> function is <span id='citeF-10'></span>[[#cite-10|[10]]],
  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
Line 366: Line 366:
 
===3.5 <span id='lb-3.5'></span>Pair correlation function g===
 
===3.5 <span id='lb-3.5'></span>Pair correlation function g===
  
In general, the pair correlation function is a quotient of probabilities; that is, the probability of observing a pair of points separated by a given distance is divided by the same probability, assuming a Poisson point process [baddeley2008analysing]. In the strictest sense, it is neither a distribution nor a correlation function [diggle2013statistical].
+
In general, the pair correlation function is a quotient of probabilities; that is, the probability of observing a pair of points separated by a given distance is divided by the same probability, assuming a Poisson point process <span id='citeF-8'></span>[[#cite-8|[8]]]. In the strictest sense, it is neither a distribution nor a correlation function <span id='citeF-16'></span>[[#cite-16|[16]]].
  
Some authors consider the pair correlation function to be the most informative second-order property because it provides information more simply than, say, the <math display="inline">K</math> function [illian2008statistical]. We present the following definition [baddeley2007spatial], [moller2003statistical].
+
Some authors consider the pair correlation function to be the most informative second-order property because it provides information more simply than, say, the <math display="inline">K</math> function <span id='citeF-20'></span>[[#cite-20|[20]]]. We present the following definition <span id='citeF-10'></span>[[#cite-10|[10]]], <span id='citeF-17'></span>[[#cite-17|[17]]].
  
 
Definition 9: Let <math display="inline">Y</math> be a point process on <math display="inline">\mathcal{S}</math> with intensity function <math display="inline">\lambda </math> and second-moment density <math display="inline">g_2</math>. The pair correlation function <math display="inline">g</math> is defined as
 
Definition 9: Let <math display="inline">Y</math> be a point process on <math display="inline">\mathcal{S}</math> with intensity function <math display="inline">\lambda </math> and second-moment density <math display="inline">g_2</math>. The pair correlation function <math display="inline">g</math> is defined as
Line 394: Line 394:
 
for any compact set <math display="inline">C \subset \mathcal{S} \times \mathcal{S}</math>, where <math display="inline">\nu </math> is a suitable measure on <math display="inline">\left(\mathcal{S}, \mathcal{B}\left(\mathcal{S} \right)\right)</math> (e.g., if <math display="inline">\mathcal{S} = \mathbb{R}^d</math>, so <math display="inline">\nu = \mu _L</math>), and <math display="inline">\nu _{[2]}(A_1 \times A_2) = \mathbb{E}\left(Y(A_1) \; Y(A_2) \right)- \mathbb{E}\left(Y\left(A_1 \cap A_2 \right)\right)</math>, with <math display="inline">A_1, A_2 \subset \mathcal{S}</math>, is the second factorial moment measure of <math display="inline">Y</math>.
 
for any compact set <math display="inline">C \subset \mathcal{S} \times \mathcal{S}</math>, where <math display="inline">\nu </math> is a suitable measure on <math display="inline">\left(\mathcal{S}, \mathcal{B}\left(\mathcal{S} \right)\right)</math> (e.g., if <math display="inline">\mathcal{S} = \mathbb{R}^d</math>, so <math display="inline">\nu = \mu _L</math>), and <math display="inline">\nu _{[2]}(A_1 \times A_2) = \mathbb{E}\left(Y(A_1) \; Y(A_2) \right)- \mathbb{E}\left(Y\left(A_1 \cap A_2 \right)\right)</math>, with <math display="inline">A_1, A_2 \subset \mathcal{S}</math>, is the second factorial moment measure of <math display="inline">Y</math>.
  
If <math display="inline">Y</math> is stationary and isotropic, it follows from ([[#eq-3|3]]) that [diggle2013statistical], [illian2008statistical],
+
If <math display="inline">Y</math> is stationary and isotropic, it follows from ([[#eq-3|3]]) that <span id='citeF-16'></span>[[#cite-16|[16]]], <span id='citeF-20'></span>[[#cite-20|[20]]],
  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
Line 405: Line 405:
 
|}
 
|}
  
We can define <math display="inline">g</math> graphically by taking two concentric circles with radius <math display="inline">r</math> and <math display="inline">r + \Delta r </math>, where <math display="inline">\Delta r</math> is a small increment, and counting the points that fall within the ring (see Figure [[#img-7|7]]), [baddeley2015spatial].
+
We can define <math display="inline">g</math> graphically by taking two concentric circles with radius <math display="inline">r</math> and <math display="inline">r + \Delta r </math>, where <math display="inline">\Delta r</math> is a small increment, and counting the points that fall within the ring (see Figure [[#img-7|7]]), <span id='citeF-11'></span>[[#cite-11|[11]]].
  
 
<div id='img-7'></div>
 
<div id='img-7'></div>
Line 461: Line 461:
 
Figure [[#img-8|8]] shows the spatial point pattern. The wildfires do not appear to be the result of a Poisson process.
 
Figure [[#img-8|8]] shows the spatial point pattern. The wildfires do not appear to be the result of a Poisson process.
  
There are multiple ways to prove if a point pattern comes from a Poisson point process (see [baddeley2015spatial]).
+
There are multiple ways to prove if a point pattern comes from a Poisson point process (see <span id='citeF-11'></span>[[#cite-11|[11]]]).
  
 
<div id='img-8'></div>
 
<div id='img-8'></div>
Line 471: Line 471:
 
|}
 
|}
  
The simulation envelopes provide a formal way to decide if the spatial pattern comes from the Poisson process. It is equivalent to performing a hypothesis test. The simulation envelopes are obtained under the assumption of a Poisson process [baddeley2008analysing], [baddeley2015spatial], [bivand2008applied].
+
The simulation envelopes provide a formal way to decide if the spatial pattern comes from the Poisson process. It is equivalent to performing a hypothesis test. The simulation envelopes are obtained under the assumption of a Poisson process <span id='citeF-8'></span>[[#cite-8|[8]]], <span id='citeF-11'></span>[[#cite-11|[11]]], <span id='citeF-18'></span>[[#cite-18|[18]]].
  
 
If the empirical curve falls within the envelope, we can conclude that the point pattern comes from a Poisson process.
 
If the empirical curve falls within the envelope, we can conclude that the point pattern comes from a Poisson process.
  
Figures [[#img-9|9]] and [[#img-10|10]] show the estimated <math display="inline">F</math> and <math display="inline">G</math> functions, as well as the theoretical functions for the Poisson process and simulation envelopes. For this, we use the <code>R</code> package <code>spatstat</code> [spatstat2005].
+
Figures [[#img-9|9]] and [[#img-10|10]] show the estimated <math display="inline">F</math> and <math display="inline">G</math> functions, as well as the theoretical functions for the Poisson process and simulation envelopes. For this, we use the <code>R</code> package <code>spatstat</code> <span id='citeF-21'></span>[[#cite-21|[21]]].
  
 
Clearly, the spatial point pattern does not follow the Poisson model.
 
Clearly, the spatial point pattern does not follow the Poisson model.
Line 495: Line 495:
 
|}
 
|}
  
In Figure [[#img-9|9]] note that <math display="inline">\widehat{F}_{\hbox{obs}}(r) < F_{\hbox{theo}}(r)</math>, i.e., the point pattern has longer empty-space distances than a Poisson process. This suggests a clustered point pattern [baddeley2008analysing]. While in Figure [[#img-10|10]] we observe that <math display="inline">\widehat{G}_{\hbox{obs}}(r) > G_{\hbox{theo}}(r)</math>, i.e., the point pattern has shorter nearest-neighbour distances than a Poisson model, indicating a clustered pattern [baddeley2008analysing].
+
In Figure [[#img-9|9]] note that <math display="inline">\widehat{F}_{\hbox{obs}}(r) < F_{\hbox{theo}}(r)</math>, i.e., the point pattern has longer empty-space distances than a Poisson process. This suggests a clustered point pattern <span id='citeF-8'></span>[[#cite-8|[8]]]. While in Figure [[#img-10|10]] we observe that <math display="inline">\widehat{G}_{\hbox{obs}}(r) > G_{\hbox{theo}}(r)</math>, i.e., the point pattern has shorter nearest-neighbour distances than a Poisson model, indicating a clustered pattern <span id='citeF-8'></span>[[#cite-8|[8]]].
  
 
Figure [[#img-11|11]] depicts the estimated intensity using a Gaussian kernel with bandwidth of 17 km. It can be used to locate wildfire hotspots.
 
Figure [[#img-11|11]] depicts the estimated intensity using a Gaussian kernel with bandwidth of 17 km. It can be used to locate wildfire hotspots.
Line 519: Line 519:
 
|}
 
|}
  
The augmented Dickey-Fuller test is used to prove that the time series is seasonal (see details in [timeseries2016davis]). This test is included in the <code>R</code> package <code>tseries</code> [tseries2022], where the null hypothesis is that the time series is non-stationary, against the alternative hypothesis that the time series is stationary.
+
The augmented Dickey-Fuller test is used to prove that the time series is seasonal (see details in <span id='citeF-22'></span>[[#cite-22|[22]]]). This test is included in the <code>R</code> package <code>tseries</code> <span id='citeF-23'></span>[[#cite-23|[23]]], where the null hypothesis is that the time series is non-stationary, against the alternative hypothesis that the time series is stationary.
  
 
Table [[#table-1|1]] displays the results of the augmented Dickey-Fuller test for the wildfire time series, with a significance level of <math display="inline">\alpha = 0.05</math>.
 
Table [[#table-1|1]] displays the results of the augmented Dickey-Fuller test for the wildfire time series, with a significance level of <math display="inline">\alpha = 0.05</math>.
Line 537: Line 537:
 
===4.3 Spatio-temporal analysis===
 
===4.3 Spatio-temporal analysis===
  
To demonstrate clustering or regularity in a spatio-temporal point pattern, the space-time inhomogeneous <math display="inline">K</math> function (STIK) and space-time pair correlation function (STPC) can be used [gabriel2013stpp].
+
To demonstrate clustering or regularity in a spatio-temporal point pattern, the space-time inhomogeneous <math display="inline">K</math> function (STIK) and space-time pair correlation function (STPC) can be used <span id='citeF-14'></span>[[#cite-14|[14]]].
  
On the assumption that the point process <math display="inline">Y</math> on <math display="inline">\mathbb{R}^d</math> is second-order stationary, that is, their first-order and second-order properties are invariant under translations, the <math display="inline">K</math> function is [gabriel2009second],
+
On the assumption that the point process <math display="inline">Y</math> on <math display="inline">\mathbb{R}^d</math> is second-order stationary, that is, their first-order and second-order properties are invariant under translations, the <math display="inline">K</math> function is <span id='citeF-24'></span>[[#cite-24|[24]]],
  
 
<span id="eq-4"></span>
 
<span id="eq-4"></span>
Line 552: Line 552:
 
|}
 
|}
  
In addition, a spatio-temporal point process is second-order intensity reweighted stationary and isotropic if its intensity function is bounded away from zero, and its <math display="inline">g</math> function is solely determined by <math display="inline">(u, v)</math>, where <math display="inline">u = \lVert \boldsymbol{s}_i - \boldsymbol{s}_j \rVert </math> and <math display="inline">v = \left |t_i - t_j \right |</math>, with <math display="inline">\boldsymbol{s}_i, \boldsymbol{s}_j \in \mathbb{R}^2</math>, <math display="inline">t_i, t_j \in \mathbb{R}_+</math>, [gabriel2013stpp].
+
In addition, a spatio-temporal point process is second-order intensity reweighted stationary and isotropic if its intensity function is bounded away from zero, and its <math display="inline">g</math> function is solely determined by <math display="inline">(u, v)</math>, where <math display="inline">u = \lVert \boldsymbol{s}_i - \boldsymbol{s}_j \rVert </math> and <math display="inline">v = \left |t_i - t_j \right |</math>, with <math display="inline">\boldsymbol{s}_i, \boldsymbol{s}_j \in \mathbb{R}^2</math>, <math display="inline">t_i, t_j \in \mathbb{R}_+</math>, <span id='citeF-14'></span>[[#cite-14|[14]]].
  
Let <math display="inline">Y</math> be a second-order intensity reweighted stationary and isotropic spatio-temporal point process with intensity <math display="inline">\lambda </math>; then, from ([[#eq-4|4]]), its STIK function is, [gabriel2013stpp], [gabriel2009second],
+
Let <math display="inline">Y</math> be a second-order intensity reweighted stationary and isotropic spatio-temporal point process with intensity <math display="inline">\lambda </math>; then, from ([[#eq-4|4]]), its STIK function is, <span id='citeF-14'></span>[[#cite-14|[14]]], <span id='citeF-24'></span>[[#cite-24|[24]]],
  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
 
{| class="formulaSCP" style="width: 100%; text-align: left;"  
Line 580: Line 580:
 
Figures [[#img-13|13]] and [[#img-14|14]] show the estimated STIK function in contour and perspective plots, respectively.
 
Figures [[#img-13|13]] and [[#img-14|14]] show the estimated STIK function in contour and perspective plots, respectively.
  
The values <math display="inline">\widehat{K}_{ST}(u, v) - \pi u^2 v</math> were plotted in order to use them as a measure of spatiotemporal aggregation or regularity. According to [gabriel2009second], <math display="inline">\widehat{K}_{ST}(u, v) - \pi u^2 v < 0</math> indicates regularity.
+
The values <math display="inline">\widehat{K}_{ST}(u, v) - \pi u^2 v</math> were plotted in order to use them as a measure of spatiotemporal aggregation or regularity. According to <span id='citeF-24'></span>[[#cite-24|[24]]], <math display="inline">\widehat{K}_{ST}(u, v) - \pi u^2 v < 0</math> indicates regularity.
  
 
<div id='img-13'></div>
 
<div id='img-13'></div>
Line 600: Line 600:
 
Figures [[#img-15|15]] and [[#img-16|16]] illustrate estimated STPC function in contour and perspective plots, respectively.
 
Figures [[#img-15|15]] and [[#img-16|16]] illustrate estimated STPC function in contour and perspective plots, respectively.
  
For a spatio-temporal Poisson point process, <math display="inline">g(u, v) = 1</math>. This reference can be used to determine how much more or less likely it is that a pair of events will occur at specific locations than in a Poisson process of equal intensity [gabriel2013stpp].
+
For a spatio-temporal Poisson point process, <math display="inline">g(u, v) = 1</math>. This reference can be used to determine how much more or less likely it is that a pair of events will occur at specific locations than in a Poisson process of equal intensity <span id='citeF-14'></span>[[#cite-14|[14]]].
  
 
<div id='img-15'></div>
 
<div id='img-15'></div>
Line 628: Line 628:
 
Finally, as shown in Figures [[#img-13|13]], [[#img-14|14]], [[#img-15|15]], and [[#img-16|16]], we demonstrate that the spatio-temporal behavior is regular. This means that wildfires tend to occur in the same season and in the same areas each year. This regular spatio-temporal behavior suggests that the underlying point process is predictable in some ways.
 
Finally, as shown in Figures [[#img-13|13]], [[#img-14|14]], [[#img-15|15]], and [[#img-16|16]], we demonstrate that the spatio-temporal behavior is regular. This means that wildfires tend to occur in the same season and in the same areas each year. This regular spatio-temporal behavior suggests that the underlying point process is predictable in some ways.
  
This research could be expanded by looking into models such as spatio-temporal log-Gaussian Cox processes [taylor2013lgcp], which can be used to make spatio-temporal predictions.
+
This research could be expanded by looking into models such as spatio-temporal log-Gaussian Cox processes <span id='citeF-25'></span>[[#cite-25|[25]]], which can be used to make spatio-temporal predictions.
  
 
==Acknowledgments==
 
==Acknowledgments==
Line 636: Line 636:
 
==Appendix==
 
==Appendix==
  
This analysis was performed using the statistical programming language <code>R</code> [r2022]. The developed code is available in the repository:
+
This analysis was performed using the statistical programming language <code>R</code> <span id='citeF-26'></span>[[#cite-26|[26]]]. The developed code is available in the repository:
  
 
https://github.com/LuisMunive/Spatio-temporal-point-process-analysis-of-Mexico-State-wildfires.
 
https://github.com/LuisMunive/Spatio-temporal-point-process-analysis-of-Mexico-State-wildfires.
  
 
===BIBLIOGRAPHY===
 
===BIBLIOGRAPHY===
 +
 +
<div id="cite-1"></div>
 +
'''[[#citeF-1|[1]]]''' Conafor. (2018) "Historical yearly series of wildfires 2010-2018 period" Extracted from: https://datos.gob.mx/busca/dataset/incendios-forestales/resource/5720e224-3d0c-4eed-ac65-ea7aac7d72e8
 +
 +
<div id="cite-2"></div>
 +
'''[[#citeF-2|[2]]]''' Rodríguez-Trejo, Dante Arturo. (2014) "Incendios de Vegetación: su ecología, manejo e historia vol. 1", Volume 1
 +
 +
<div id="cite-3"></div>
 +
'''[[#citeF-3|[3]]]''' Rodríguez-Trejo, Dante Arturo. (2015) "Incendios de Vegetación: su ecología, manejo e historia vol. 2", Volume 2
 +
 +
<div id="cite-4"></div>
 +
'''[[#citeF-4|[4]]]''' Carrillo-García, Rosa Laura and Rodríguez-Trejo, Dante Arturo and Tchikoué, Hubert and Monterroso-Rivas, Alejandro Ismael and Santillan-Pérez, Javier. (2012) "Análisis espacial de peligro de incendios forestales en Puebla, México", Volume 37. Asociación Interciencia. Interciencia 9 678&#8211;683
 +
 +
<div id="cite-5"></div>
 +
'''[[#citeF-5|[5]]]''' Cisneros-González, Darío and Pérez-Verdín, Gustavo and Pompa-García, Marín and Rodríguez-Trejo, Dante Arturo and Zúñiga-Vásquez, José Manuel. (2017) "Spatial modeling of forest fires in Mexico: an integration of two data sources", Volume 38. Bosque 3 563&#8211;574
 +
 +
<div id="cite-6"></div>
 +
'''[[#citeF-6|[6]]]''' Pompa-García, Marín and Camarero J., Julio and Rodríguez-Trejo, Dante Arturo and Vega-Nieva, Daniel José. (2018) "Drought and spatiotemporal variability of forest fires across Mexico", Volume 28. Springer Science & Business Media. Chinese Geographical Science 1 25&#8211;37
 +
 +
<div id="cite-7"></div>
 +
'''[[#citeF-7|[7]]]''' Munive-Hernández, Luis Ramón. (2021) "Predicción espacial de incendios forestales usando aprendizaje máquina"
 +
 +
<div id="cite-8"></div>
 +
'''[[#citeF-8|[8]]]''' Baddeley, Adrian and others. (2008) "Analysing spatial point patterns in R", Volume 3. Workshop notes version
 +
 +
<div id="cite-9"></div>
 +
'''[[#citeF-9|[9]]]''' Daley, Daryl J and Vere-Jones, David. (2007) "An introduction to the theory of point processes: volume II: general theory and structure". Springer Science & Business Media
 +
 +
<div id="cite-10"></div>
 +
'''[[#citeF-10|[10]]]''' Baddeley, Adrian and Bárány, Imre and Schneider, Rolf. (2007) "Spatial point processes and their applications". Springer Science & Business Media. Stochastic Geometry: Lectures Given at the CIME Summer School Held in Martina Franca, Italy, September 13&#8211;18, 2004 1&#8211;75
 +
 +
<div id="cite-11"></div>
 +
'''[[#citeF-11|[11]]]''' Baddeley, Adrian and Rubak, Ege and Turner, Rolf. (2015) "Spatial point patterns: methodology and applications with R". CRC Press
 +
 +
<div id="cite-12"></div>
 +
'''[[#citeF-12|[12]]]''' Gabriel, Edith and Rodriguez-Cortes, Francisco and Coville, Jérome and Mateu, Jorge and Chadoeuf, Joël. (2022) "Mapping the intensity function of a non-stationary point process in unobserved areas". Springer Science & Business Media. Stochastic Environmental Research and Risk Assessment 1&#8211;17
 +
 +
<div id="cite-13"></div>
 +
'''[[#citeF-13|[13]]]''' Villanueva-Morales, Antonio. (2008) "Modified pseudo-likelihood estimation for Markov random fields with Winsorized Poisson conditional distributions". Digital Repository@ Iowa State University, http://lib. dr. iastate. edu/
 +
 +
<div id="cite-14"></div>
 +
'''[[#citeF-14|[14]]]''' Gabriel, Edith and Rowlingson, Barry S and Diggle, Peter J. (2013) "stpp: an R package for plotting, simulating and analyzing Spatio-Temporal Point Patterns", Volume 53. Journal of Statistical Software 1&#8211;29
 +
 +
<div id="cite-15"></div>
 +
'''[[#citeF-15|[15]]]''' Cressie, Noel. (1991) "Statistics for spatial data". John Wiley & Sons
 +
 +
<div id="cite-16"></div>
 +
'''[[#citeF-16|[16]]]''' Diggle, Peter J. (2013) "Statistical analysis of spatial and spatio-temporal point patterns". CRC Press
 +
 +
<div id="cite-17"></div>
 +
'''[[#citeF-17|[17]]]''' Mller, Jesper and Waagepetersen, Rasmus Plenge. (2003) "Statistical inference and simulation for spatial point processes". Chapman and Hall/CRC
 +
 +
<div id="cite-18"></div>
 +
'''[[#citeF-18|[18]]]''' Bivand, Roger S and Pebesma, Edzer J and Gómez-Rubio, Virgilio and Pebesma, Edzer Jan. (2008) "Applied spatial data analysis with R", Volume 747248717. Springer Science & Business Media
 +
 +
<div id="cite-19"></div>
 +
'''[[#citeF-19|[19]]]''' Ripley, Brian D. (1977) "Modelling spatial patterns", Volume 39. Wiley Online Library. Journal of the Royal Statistical Society: Series B (Methodological) 2 172&#8211;192
 +
 +
<div id="cite-20"></div>
 +
'''[[#citeF-20|[20]]]''' Illian, Janine and Penttinen, Antti and Stoyan, Helga and Stoyan, Dietrich. (2008) "Statistical analysis and modelling of spatial point patterns". John Wiley & Sons
 +
 +
<div id="cite-21"></div>
 +
'''[[#citeF-21|[21]]]''' Adrian Baddeley and Rolf Turner. (2005) "spatstat: An R Package for Analyzing Spatial Point Patterns", Volume 12. Journal of Statistical Software 6 1&#8211;42
 +
 +
<div id="cite-22"></div>
 +
'''[[#citeF-22|[22]]]''' Peter J. Brockwell and Richard A. Davis. (2016) "Introduction to Time Series and Forecasting". Springer Science & Business Media. Springer Texts in Statistics
 +
 +
<div id="cite-23"></div>
 +
'''[[#citeF-23|[23]]]''' Adrian Trapletti and Kurt Hornik. (2022) "tseries: Time Series Analysis and Computational Finance"
 +
 +
<div id="cite-24"></div>
 +
'''[[#citeF-24|[24]]]''' Gabriel, Edith and Diggle, Peter J. (2009) "Second-order analysis of inhomogeneous spatio-temporal point process data", Volume 63. Wiley Online Library. Statistica Neerlandica 1 43&#8211;51
 +
 +
<div id="cite-25"></div>
 +
'''[[#citeF-25|[25]]]''' Taylor, Benjamin M and Davies, Tilman M and Rowlingson, Barry S and Diggle, Peter J. (2013) "lgcp: an R package for inference with spatial and spatio-temporal log-Gaussian Cox processes", Volume 52. Journal of Statistical Software 1&#8211;40
 +
 +
<div id="cite-26"></div>
 +
'''[[#citeF-26|[26]]]''' R Core Team. (2022) "R: A Language and Environment for Statistical Computing". R Foundation for Statistical Computing

Revision as of 04:15, 27 November 2022


Abstract

Wildfires are an example of a phenomenon that can be investigated using point process theory. We analyze public data from the National Forestry Commission. It consists of wildfire records, specifically their coordinates and dates of occurrence in Mexico State from 2010 to 2018. The spatial component was examined, and we found that wildfires tend to cluster. Afterwards, a time series analysis was conducted. This shows that the data comes from a stationary stochastic process. Finally, some spatio-temporal features that demonstrate the point process' regular behavior in space and time were investigated. This research could be a reference to describe wildfire behavior in a specific space and time.

keywords Environmental statistics, point processes, spatio-temporal statistics, wildfires.

1 Introduction

Wildfires are complex phenomena with serious socio-environmental consequences, including economic and biodiversity losses, among others. Anthropogenic factors are responsible for nearly all wildfires in Mexico State, according to data from the National Forestry Commission (Conafor, its Spanish acronym) [1] (see Figure 1).

Mexico State wildfire causes (2010-2018).
Figure 1: Mexico State wildfire causes (2010-2018).

There is plenty of specialized literature available on wildfires (see [2] and [3]). The authors of [4] use a logistic regression model to assess the risk of wildfire in Puebla, Mexico, taking into account land cover, meteorological, topographic, and social variables. Using two different data sources: Conafor's open data and Modis' (Moderate Resolution Imaging Spectroradiometer) data, the authors of [5] show that wildfire spatial patterns in Mexico tend to cluster. The spatial and temporal relationships between Conafor's wildfire records from 2005 to 2015 and the Standardized Precipitation-Evapotranspiration Index (SPEI) were investigated [6]. Machine learning techniques were used to determine the wildfire propensity in Mexico using Conafor's open data [7].

The spatio-temporal behavior of wildfires could be critical for improving fire management strategies. The point processes approach can be used to model random events in time, space, or space-time, such as wildfires. In this study, we used point processes theory to describe the spatio-temporal behavior of wildfires in Mexico State from 2010 to 2018.

2 Point processes basic theory

A point process is a random set in which the number of points and their locations are both random [8]. A point process could occur in any completely separable metric space , such as -dimensional Euclidean space .

Definition 1: The point process , with state space , is a measurable mapping from a probability space to the measure space of the point process' realizations equipped with the counting measure, . Where Failed to parse (unknown function "\middle"): {\textstyle \mathcal{Y}_{\mathcal{S}}^{\# } = \left\lbrace \mu _{\# } : \mathcal{B}\left(\mathcal{S} \right)\rightarrow \mathbb{N} \; \middle | \; \mu _{\# }(A) < \infty , A \in \mathcal{B} \left(\mathcal{S} \right)\right\rbrace }

is the space of all finite counting measures on a -algebra  of subsets of ,  is a -algebra of subsets of the space  and  is the counting measure.

The commutative diagram in Figure 2 illustrates the point process definition.

Figure 2: Commutative diagram of point process definition.

The mapping takes measures and maps them into . As a result, the mapping in terms of the point process is .

Furthermore, the commutative diagram reveals the equivalences: and , for any , where denotes the power set of , so is a measurable space, [9], [10].

The following are some fundamental properties of a point process [10]:

  1. Is additive, this is
  2. whenever , and of course

  3. Is locally finite
  4. for any .

  5. Is simple
  6. for any point .

For simplification, we will write in the foregoing. When the point process is observed, we have a point pattern denoted by .

In order to generate models, some assumptions about a point process must be made. Stationarity and isotropy are the most important assumptions. The former refers to statistical invariance under translations, whereas the latter refers to statistical invariance under rotations [10], [11]. Nonetheless, some research on non-stationary and anisotropic processes has been conducted (see [12] and [13]).

Definition 2: A point process on is stationary if, for any fixed , the distribution of the process is identical to the distribution of .

2.1 Poisson process

The general Poisson point process in some space can be defined as follows [10], [11].

Definition 3: The Poisson process on with intensity measure is a point process such that:

  1. For every compact set , the random variable .
  2. If are disjoint compact sets, then are independent random variables.

Where the intensity measure is defined, for any , as .

If the state space is and the expected value of the point process in , with and , can be written as follows:

where and is the Lebesgue measure, then we have the spatio-temporal homogeneous Poisson point process [14].

The simplest stochastic mechanism for generating point patterns is the homogeneous Poisson point process. As a data model, it is almost never plausible. Regardless, it is the fundamental reference or benchmark model of a point process [8].

The homogeneous Poisson point process is also known as complete spatial (or spatio-temporal) randomness. Additionally, the Poisson point process is stationary and isotropic [10].

Figure 3 depicts a spatial point pattern generated by a homogeneous Poisson point process.

Simulation of a spatial homogeneous Poisson process.
Figure 3: Simulation of a spatial homogeneous Poisson process.

3 Point pattern's data analysis

Distances between points are a straightforward way to examine a point pattern. The most common statistics used in exploratory analysis of a point pattern are as follows.

3.1 Empty-space function F

Let be a stationary point process on . The shortest distance between a given point and the nearest observed point is denoted as . It is called the empty-space distance, spherical contact distance, or simply contact distance [8], [10], [11].

Empty-space distance illustration.
Figure 4: Empty-space distance illustration.

Note that

(2)

where is the neighborhood of radius centered on .

In other words, as shown in Figure 4, the empty-space distance satisfies the logical equivalence of the biconditional (2), .

Moreover, because is measurable, the event is measurable, implying that the contact distance is a well-defined random element.

Definition 4: Let be a stationary point process on . The empty-space function is the cumulative distribution function of the empty-space distance

If is a homogeneous Poisson process on with intensity , then the empty-space function is

where , denotes the volume of the unitary -ball in and is the usual gamma function.

3.2 Nearest-neighbour function G

The nearest-neighbour distance, denoted by , is the distance between each point and its nearest neighbour in the set , [8], [10]. It is worth noting that can also be written as , [11]. This distance is depicted in Figure 5.

Nearest-neighbour distance illustration.
Figure 5: Nearest-neighbour distance illustration.

Definition 5: Let be a stationary point process on . The nearest-neighbour function is the cumulative distribution function of the nearest-neighbour distance

Failed to parse (syntax error): G(r) = \mathbb{P}\left(\mathrm{d}\left(\boldsymbol{s}, \boldsymbol{Y}
Failed to parse (syntax error): \{ \boldsymbol{s} \} \right)\leq r \; \middle | \; \boldsymbol{s} \in \boldsymbol{Y} \right),

where and is any location in the state space .

If is a homogeneous Poisson process on with intensity , then the nearest-neighbour function is

In this case, we have that , i.e., under complete spatial randomness, the points of the Poisson process are independent of each other, so conditioning does not affect them. Therefore, is equivalent to , [8].

3.3 Intensity

The intensity function describes the first-order properties of a point process [15], [16].

The average number of points per spatial (or spatio-temporal) unit defines the intensity of a point process. In this regard, intensity is analogous to the expected value of a random variable [10].

Similarly, we can investigate the analogue of a point process' variance or covariance throughout the second-order properties.

As we will see in the following, the intensity measure of a point process is clearly a set function, whereas the “instantaneous” intensity function is an atomic function.

Definition 6: Let be a point process on . The first-order intensity is defined as

where is a suitable measure on and defines a infinitesimally small region around .

If is a point process on with intensity measure , it satisfies

for some function and any . Then is called the intensity function of [10]. If is constant, then is said to be homogeneous, otherwise is said to be inhomogeneous [17]. Likewise, if the intensity function exists, we can interpret it as follows:

The function and pair correlation are both second-moment properties, so the second-order intensity must be defined [16].

Definition 7: Let be a point process on . The second-order intensity is defined as

We already have the fundamental elements for defining the following pair of second-order properties.

3.4 K function

The function counts the number of locations within a certain radius of a given point (see Figure 6), [11], [18]. Ripley defined it in [19]. We present the following definition [8], [16].

Definition 8: Let be a stationary and isotropic point process on with intensity . The function is defined as

Failed to parse (syntax error): K(r) = \frac{1}{\lambda } \mathbb{E} \left(Y\left(\boldsymbol{Y} \cap B_r(\boldsymbol{s})
Failed to parse (syntax error): \{ \boldsymbol{s} \} \right)\; \middle | \; \boldsymbol{s} \in \boldsymbol{Y} \right),

where and is any location in .

K function illustration.
Figure 6: function illustration.

If and the point process is assumed to be stationary, then hold . Also, if is isotropic, hence , where . These conditions implies that [15], [16],

(3)

The above expression provides a relationship between the function and the second-order intensity under the assumptions of stationarity and isotropy.

If is a homogeneous Poisson process on , then the function is [10],

3.5 Pair correlation function g

In general, the pair correlation function is a quotient of probabilities; that is, the probability of observing a pair of points separated by a given distance is divided by the same probability, assuming a Poisson point process [8]. In the strictest sense, it is neither a distribution nor a correlation function [16].

Some authors consider the pair correlation function to be the most informative second-order property because it provides information more simply than, say, the function [20]. We present the following definition [10], [17].

Definition 9: Let be a point process on with intensity function and second-moment density . The pair correlation function is defined as

for any , where the second-moment density is such that

for any compact set , where is a suitable measure on (e.g., if , so ), and , with , is the second factorial moment measure of .

If is stationary and isotropic, it follows from (3) that [16], [20],

We can define graphically by taking two concentric circles with radius and , where is a small increment, and counting the points that fall within the ring (see Figure 7), [11].

Pair correlation function g illustration.
Figure 7: Pair correlation function illustration.

If is stationary and isotropic, the expected number of locations in the ring is . Dividing it by the expected value of points assuming a Poisson process, we obtain

g_r(r) = \frac{\left(K(r + r) - K(r) \right)}{ _L\left(B_1(0) \right)\left(\left(r + r\right)^d - r^d \right)}

= \frac{K(r + r) - K(r)}{_L\left(B_1(0) \right)\left(_k=0^d dk r^d - k r^k - r^d \right)}.

All binomial expansion components in the denominator of the second line in (3.5) lose significance except for , so

Taking the following limit, we get

If is a homogeneous Poisson process on , then the pair correlation function is .

4 Wildfires' data analysis

Conafor data are licensed for free use (see details in https://datos.gob.mx/libreusomx). It includes wildfire geographical coordinates and dates, as well as variables like forest type affected and severity, among other things.

4.1 Spatial analysis

This spatial analysis focuses on the and functions to determine whether the wildfire spatial point pattern is aggregated, complete spatial random, or regular. In addition, the intensity was estimated to support the evidence about point pattern behavior.

Plotting the spatial point pattern is a good starting point for understanding its behavior.

Figure 8 shows the spatial point pattern. The wildfires do not appear to be the result of a Poisson process.

There are multiple ways to prove if a point pattern comes from a Poisson point process (see [11]).

Spatial point pattern of Mexico State wildfires.
Figure 8: Spatial point pattern of Mexico State wildfires.

The simulation envelopes provide a formal way to decide if the spatial pattern comes from the Poisson process. It is equivalent to performing a hypothesis test. The simulation envelopes are obtained under the assumption of a Poisson process [8], [11], [18].

If the empirical curve falls within the envelope, we can conclude that the point pattern comes from a Poisson process.

Figures 9 and 10 show the estimated and functions, as well as the theoretical functions for the Poisson process and simulation envelopes. For this, we use the R package spatstat [21].

Clearly, the spatial point pattern does not follow the Poisson model.

Estimated F function and simulation envelopes.
Figure 9: Estimated function and simulation envelopes.
Estimated G function and simulation envelopes.
Figure 10: Estimated function and simulation envelopes.

In Figure 9 note that , i.e., the point pattern has longer empty-space distances than a Poisson process. This suggests a clustered point pattern [8]. While in Figure 10 we observe that , i.e., the point pattern has shorter nearest-neighbour distances than a Poisson model, indicating a clustered pattern [8].

Figure 11 depicts the estimated intensity using a Gaussian kernel with bandwidth of 17 km. It can be used to locate wildfire hotspots.

Estimated intensity.
Figure 11: Estimated intensity.

4.2 Time series analysis

This time series analysis was carried out to describe the temporal behavior of wildfires. Figure 12 displays the daily number of wildfires. This immediately suggests that the wildfire time series is seasonal.

Time series of Mexico State wildfires.
Figure 12: Time series of Mexico State wildfires.

The augmented Dickey-Fuller test is used to prove that the time series is seasonal (see details in [22]). This test is included in the R package tseries [23], where the null hypothesis is that the time series is non-stationary, against the alternative hypothesis that the time series is stationary.

Table 1 displays the results of the augmented Dickey-Fuller test for the wildfire time series, with a significance level of .


Table. 1 Augmented Dickey-Fuller test results.
Test statistic -value
-5.1037

4.3 Spatio-temporal analysis

To demonstrate clustering or regularity in a spatio-temporal point pattern, the space-time inhomogeneous function (STIK) and space-time pair correlation function (STPC) can be used [14].

On the assumption that the point process on is second-order stationary, that is, their first-order and second-order properties are invariant under translations, the function is [24],

(4)

In addition, a spatio-temporal point process is second-order intensity reweighted stationary and isotropic if its intensity function is bounded away from zero, and its function is solely determined by , where and , with , , [14].

Let be a second-order intensity reweighted stationary and isotropic spatio-temporal point process with intensity ; then, from (4), its STIK function is, [14], [24],

where is the spatio-temporal pair correlation function of .

For any inhomogeneous spatio-temporal Poisson process with intensity bounded away from zero,

Figures 13 and 14 show the estimated STIK function in contour and perspective plots, respectively.

The values were plotted in order to use them as a measure of spatiotemporal aggregation or regularity. According to [24], indicates regularity.

Estimated STIK function contour plot.
Figure 13: Estimated STIK function contour plot.
Estimated STIK function perspective plot.
Figure 14: Estimated STIK function perspective plot.

Figures 15 and 16 illustrate estimated STPC function in contour and perspective plots, respectively.

For a spatio-temporal Poisson point process, . This reference can be used to determine how much more or less likely it is that a pair of events will occur at specific locations than in a Poisson process of equal intensity [14].

Estimated STPC function contour plot.
Figure 15: Estimated STPC function contour plot.
Estimated STPC function perspective plot.
Figure 16: Estimated STPC function perspective plot.

Surface behavior is regular; that is, there is yearly seasonality at distances less than 10 km, implying spatio-temporal regularity.

5 Conclusions and perspectives

The spatio-temporal point pattern of Mexico State wildfires from 2010 to 2018 tends to cluster spatially, as shown by Figures 8, 9, 10, and 11.

While the temporal behavior is stationary, as illustrated in Figure 12 and Table 1, there is a yearly wildfire season during the first semester of each year.

Finally, as shown in Figures 13, 14, 15, and 16, we demonstrate that the spatio-temporal behavior is regular. This means that wildfires tend to occur in the same season and in the same areas each year. This regular spatio-temporal behavior suggests that the underlying point process is predictable in some ways.

This research could be expanded by looking into models such as spatio-temporal log-Gaussian Cox processes [25], which can be used to make spatio-temporal predictions.

Acknowledgments

The authors would like to express their gratitude to the Universidad Autónoma Chapingo.

Appendix

This analysis was performed using the statistical programming language R [26]. The developed code is available in the repository:

https://github.com/LuisMunive/Spatio-temporal-point-process-analysis-of-Mexico-State-wildfires.

BIBLIOGRAPHY

[1] Conafor. (2018) "Historical yearly series of wildfires 2010-2018 period" Extracted from: https://datos.gob.mx/busca/dataset/incendios-forestales/resource/5720e224-3d0c-4eed-ac65-ea7aac7d72e8

[2] Rodríguez-Trejo, Dante Arturo. (2014) "Incendios de Vegetación: su ecología, manejo e historia vol. 1", Volume 1

[3] Rodríguez-Trejo, Dante Arturo. (2015) "Incendios de Vegetación: su ecología, manejo e historia vol. 2", Volume 2

[4] Carrillo-García, Rosa Laura and Rodríguez-Trejo, Dante Arturo and Tchikoué, Hubert and Monterroso-Rivas, Alejandro Ismael and Santillan-Pérez, Javier. (2012) "Análisis espacial de peligro de incendios forestales en Puebla, México", Volume 37. Asociación Interciencia. Interciencia 9 678–683

[5] Cisneros-González, Darío and Pérez-Verdín, Gustavo and Pompa-García, Marín and Rodríguez-Trejo, Dante Arturo and Zúñiga-Vásquez, José Manuel. (2017) "Spatial modeling of forest fires in Mexico: an integration of two data sources", Volume 38. Bosque 3 563–574

[6] Pompa-García, Marín and Camarero J., Julio and Rodríguez-Trejo, Dante Arturo and Vega-Nieva, Daniel José. (2018) "Drought and spatiotemporal variability of forest fires across Mexico", Volume 28. Springer Science & Business Media. Chinese Geographical Science 1 25–37

[7] Munive-Hernández, Luis Ramón. (2021) "Predicción espacial de incendios forestales usando aprendizaje máquina"

[8] Baddeley, Adrian and others. (2008) "Analysing spatial point patterns in R", Volume 3. Workshop notes version

[9] Daley, Daryl J and Vere-Jones, David. (2007) "An introduction to the theory of point processes: volume II: general theory and structure". Springer Science & Business Media

[10] Baddeley, Adrian and Bárány, Imre and Schneider, Rolf. (2007) "Spatial point processes and their applications". Springer Science & Business Media. Stochastic Geometry: Lectures Given at the CIME Summer School Held in Martina Franca, Italy, September 13–18, 2004 1–75

[11] Baddeley, Adrian and Rubak, Ege and Turner, Rolf. (2015) "Spatial point patterns: methodology and applications with R". CRC Press

[12] Gabriel, Edith and Rodriguez-Cortes, Francisco and Coville, Jérome and Mateu, Jorge and Chadoeuf, Joël. (2022) "Mapping the intensity function of a non-stationary point process in unobserved areas". Springer Science & Business Media. Stochastic Environmental Research and Risk Assessment 1–17

[13] Villanueva-Morales, Antonio. (2008) "Modified pseudo-likelihood estimation for Markov random fields with Winsorized Poisson conditional distributions". Digital Repository@ Iowa State University, http://lib. dr. iastate. edu/

[14] Gabriel, Edith and Rowlingson, Barry S and Diggle, Peter J. (2013) "stpp: an R package for plotting, simulating and analyzing Spatio-Temporal Point Patterns", Volume 53. Journal of Statistical Software 1–29

[15] Cressie, Noel. (1991) "Statistics for spatial data". John Wiley & Sons

[16] Diggle, Peter J. (2013) "Statistical analysis of spatial and spatio-temporal point patterns". CRC Press

[17] Mller, Jesper and Waagepetersen, Rasmus Plenge. (2003) "Statistical inference and simulation for spatial point processes". Chapman and Hall/CRC

[18] Bivand, Roger S and Pebesma, Edzer J and Gómez-Rubio, Virgilio and Pebesma, Edzer Jan. (2008) "Applied spatial data analysis with R", Volume 747248717. Springer Science & Business Media

[19] Ripley, Brian D. (1977) "Modelling spatial patterns", Volume 39. Wiley Online Library. Journal of the Royal Statistical Society: Series B (Methodological) 2 172–192

[20] Illian, Janine and Penttinen, Antti and Stoyan, Helga and Stoyan, Dietrich. (2008) "Statistical analysis and modelling of spatial point patterns". John Wiley & Sons

[21] Adrian Baddeley and Rolf Turner. (2005) "spatstat: An R Package for Analyzing Spatial Point Patterns", Volume 12. Journal of Statistical Software 6 1–42

[22] Peter J. Brockwell and Richard A. Davis. (2016) "Introduction to Time Series and Forecasting". Springer Science & Business Media. Springer Texts in Statistics

[23] Adrian Trapletti and Kurt Hornik. (2022) "tseries: Time Series Analysis and Computational Finance"

[24] Gabriel, Edith and Diggle, Peter J. (2009) "Second-order analysis of inhomogeneous spatio-temporal point process data", Volume 63. Wiley Online Library. Statistica Neerlandica 1 43–51

[25] Taylor, Benjamin M and Davies, Tilman M and Rowlingson, Barry S and Diggle, Peter J. (2013) "lgcp: an R package for inference with spatial and spatio-temporal log-Gaussian Cox processes", Volume 52. Journal of Statistical Software 1–40

[26] R Core Team. (2022) "R: A Language and Environment for Statistical Computing". R Foundation for Statistical Computing

Back to Top

Document information

Published on 13/12/22
Submitted on 26/10/22

Licence: CC BY-NC-SA license

Document Score

0

Views 32
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?