m (teste)
m (TEste)
Line 39: Line 39:
 
Keywords: Physical anisotropy; Multigrid components; Diffusion; Local Fourier analysis.
 
Keywords: Physical anisotropy; Multigrid components; Diffusion; Local Fourier analysis.
  
<br/>1. Introduction
+
== <br />1. Introduction ==
 
+
 
Computational Fluid Dynamics (CFD) is a branch of Computational Science that studies numerical methods for the simulation of fluid flows problems. It is known that these methods usually require a high computational cost. In general, this happens because the problems that have to be solved require the resolution of algebraic equation systems whose coefficients matrices are large and sparse [1].
 
Computational Fluid Dynamics (CFD) is a branch of Computational Science that studies numerical methods for the simulation of fluid flows problems. It is known that these methods usually require a high computational cost. In general, this happens because the problems that have to be solved require the resolution of algebraic equation systems whose coefficients matrices are large and sparse [1].
  
Line 87: Line 86:
 
The remainder of this work is organized as follows: section 2 presents the mathematical and numerical models. Section 3 discusses considerations regarding the LFA used. Section 4 presents the results of the convergence analysis and complexity analysis. Section 5 presents the conclusion.
 
The remainder of this work is organized as follows: section 2 presents the mathematical and numerical models. Section 3 discusses considerations regarding the LFA used. Section 4 presents the results of the convergence analysis and complexity analysis. Section 5 presents the conclusion.
  
2. Mathematical and Numerical Models
+
== 2. Mathematical and Numerical Models ==
 
+
 
For the problem presented below, the calculus domain used is given by  <math display="inline">{0}\leq{x}\leq{1}
 
For the problem presented below, the calculus domain used is given by  <math display="inline">{0}\leq{x}\leq{1}
 
</math>, <math display="inline">{0}\leq{y}\leq{1}
 
</math>, <math display="inline">{0}\leq{y}\leq{1}
Line 96: Line 94:
 
</math> are the number of points in the directions ''x'' and ''y'', respectively, including the boundaries.
 
</math> are the number of points in the directions ''x'' and ''y'', respectively, including the boundaries.
  
==2.1. Mathematical Model and Discretization==
+
===2.1. Mathematical Model and Discretization===
  
 
The diffusion anisotropy problem will be assessed through the two-dimension diffusion equation given by Eq. (2) [5,6]
 
The diffusion anisotropy problem will be assessed through the two-dimension diffusion equation given by Eq. (2) [5,6]
Line 124: Line 122:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image15.png|300px]] and [[Image:draft_Vassoler Rutz_344011008-image16.png|186px]] .       
+
| <math display="inline">S=2\left[ \varepsilon \left( 1-6{{x}^{2}} \right){{y}^{2}}\left( 1-{{y}^{2}} \right)+\left( 1-6{{y}^{2}} \right){{x}^{2}}\left( 1-{{x}^{2}} \right) \right]</math> and <math display="inline">T(x,y)=({{x}^{2}}-{{x}^{4}})({{y}^{4}}-{{y}^{2}})</math>.       
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(3)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(3)
Line 134: Line 132:
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
|-
 
|-
|  
+
|<math>{{a}_{P}}{{T}_{P}}+{{a}_{W}}{{T}_{W}}+{{a}_{N}}{{T}_{N}}+{{a}_{E}}{{T}_{E}}+{{a}_{S}}{{T}_{S}}={{b}_{P}}.</math>
{| style="text-align: center; margin:auto;width: 100%;"
+
|-
+
| [[Image:draft_Vassoler Rutz_344011008-image17.png|270px]] .     
+
|}
+
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(4)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(4)
 
|}
 
|}
  
  
where ''T ''is the system unknown.
+
where <math>T</math> is the system unknown.
  
Fig. 1(b) depicts the notation of the grid points in Fig. 1(a). The points P (central), W (west), E (east), N (north) and S (south) in Fig. 1(b) correspond to the points [[Image:draft_Vassoler Rutz_344011008-image18.png|36px]] , [[Image:draft_Vassoler Rutz_344011008-image19.png|54px]] ,  
+
Fig. 1(b) depicts the notation of the grid points in Fig. 1(a). The points P (central), W (west), E (east), N (north) and S (south) in Fig. 1(b) correspond to the points <math display="inline">(i,j),(i-1,j),(i+1,j),(i,j+1),(i,j-1)</math> in Fig. 1(a), respectively.
{|
+
{| style="width: 58%;margin: 1em auto 0.1em auto;" 
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image20.png|54px]]
+
| style="text-align: center;width: 58%;" | [[Image:draft_Vassoler Rutz_344011008-image23.png|336px]]         (a)                                                    (b)                       
| [[Image:draft_Vassoler Rutz_344011008-image21.png|center|54px]]
+
|}<span id="_Ref428667097"></span><span id="_Ref428707178"></span><div id="_Toc429758828" class="center" style="width: auto; margin-left: auto; margin-right: auto;">
|}
+
Figure 1: Points of a uniform two-dimensional grid.</div>
,  [[Image:draft_Vassoler Rutz_344011008-image22.png|54px]] in Fig. 1(a), respectively.
+
  
{| style="width: 58%;margin: 1em auto 0.1em auto;"
+
The discretization of Eq. (2) results in Eq. (4), and for the inner points, considering <math display="inline">h_{x}^{{}}=\frac{1}{N_{x}^{{}}-1}</math>  and 
|-
+
|  style="text-align: center;width: 58%;"| [[Image:draft_Vassoler Rutz_344011008-image23.png|336px]] (a)                                 (b)
+
|}
+
  
 +
<math display="inline">h_{y}^{{}}=\frac{1}{N_{y}^{{}}-1}</math> ,
  
<span id='_Ref428667097'></span><span id='_Ref428707178'></span><div id="_Toc429758828" class="center" style="width: auto; margin-left: auto; margin-right: auto;">
+
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"
Figure 1: Points of a uniform two-dimensional grid.</div>
+
 
+
The discretization of Eq. (2) results in Eq. (4), and for the inner points, considering  [[Image:draft_Vassoler Rutz_344011008-image24.png|78px]] and  [[Image:draft_Vassoler Rutz_344011008-image25.png|78px]] ,
+
 
+
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
+
 
|-
 
|-
 
|  
 
|  
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image26.png|108px]] [[Image:draft_Vassoler Rutz_344011008-image27.png|102px]] [[Image:draft_Vassoler Rutz_344011008-image28.png|102px]] [[Image:draft_Vassoler Rutz_344011008-image29.png|48px]] .     
+
| <math>{{a}_{P}}=\left( \frac{2\varepsilon }{h_{x}^{2}}+\frac{2}{h_{y}^{2}} \right){{a}_{W}}={{a}_{E}}=-\frac{\varepsilon }{h_{x}^{2}}{{a}_{N}}={{a}_{S}}=-\frac{1}{h_{y}^{2}}{{b}_{P}}={{S}_{P}}.</math>    
 
|}
 
|}
| style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(5)
+
| style="text-align: center;width: 5px;text-align: right;white-space: nowrap;" |(5)
 
|}
 
|}
  
 +
For the boundaries north, south, east and west,<math display="inline">{{a}_{P}}=1,  {{a}_{W}}={{a}_{E}}={{a}_{N}}={{a}_{S}}=0.</math> .
  
For the boundaries north, south, east and west,  [[Image:draft_Vassoler Rutz_344011008-image30.png|42px]] ,  [[Image:draft_Vassoler Rutz_344011008-image31.png|156px]] .
+
===2.2. Multigrid method and computational details===
 
+
==2.2. Multigrid method and computational details==
+
  
 
The multigrid method accelerates the convergence rate of iterative methods. It consists in employing a group of grids with different refinement levels. At each grid refinement level, the more oscillatory errors are smoothed, and only low frequency errors remain. After passing to another grid, the remaining low frequency errors become more oscillatory. The efficiency of this process, called smoothing, depends on the choice of a suitable solver.
 
The multigrid method accelerates the convergence rate of iterative methods. It consists in employing a group of grids with different refinement levels. At each grid refinement level, the more oscillatory errors are smoothed, and only low frequency errors remain. After passing to another grid, the remaining low frequency errors become more oscillatory. The efficiency of this process, called smoothing, depends on the choice of a suitable solver.
Line 183: Line 168:
 
For employing multigrid, besides a solver with good smoothing properties, grid transfer operators are required (restriction and prolongation).
 
For employing multigrid, besides a solver with good smoothing properties, grid transfer operators are required (restriction and prolongation).
  
In [8] and [16], restriction through partial weighting in the directions ''x'' and ''y'', henceforth denoted by  <math display="inline">{\mbox{PW}}_\mbox{x}</math> and  <math display="inline">{\mbox{PW}}_\mbox{y}</math> , respectively, applied in problems involving geometric anisotropy was proposed. These operators, are given in stencil notation as
+
In [8] and [16], restriction through partial weighting in the directions <math>x</math> and <math>y</math>, henceforth denoted by  <math display="inline">{\mbox{PW}}_\mbox{x}</math> and  <math display="inline">{\mbox{PW}}_\mbox{y}</math> , respectively, applied in problems involving geometric anisotropy was proposed. These operators, are given in stencil notation as
  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
Line 210: Line 195:
 
|}
 
|}
  
 
+
The ILU decomposition of this same operator will be given by
==The ILU decomposition of this same operator will be given by ==
+
  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
Line 224: Line 208:
  
  
note that in this case,  [[Image:draft_Vassoler Rutz_344011008-image38.png|60px]] .
+
note that in this case,  <math display="inline">b=f=0</math>.
  
 
Thus, the ILU decomposition is represented by:
 
Thus, the ILU decomposition is represented by:
Line 233: Line 217:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image39.png|102px]] ,
+
| <math>{{\overset{{}}{\mathop{L}}\,}_{h}}={{\overset{\hat{\ }}{\mathop{L}}\,}_{h}}{{\overset{\hat{\ }}{\mathop{U}}\,}_{h}}-{{R}_{h}},</math>
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(9)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(9)
 
|}
 
|}
  
 
+
where <math>{{\overset{\hat{\ }}{\mathop{L}}\,}_{h}}</math>  is a lower triangular matrix,  <math>{{\overset{\hat{\ }}{\mathop{U}}\,}_{h}}</math> is an upper triangular matrix and  <math>{{R}_{h}}</math> is the residual matrix.
where [[Image:draft_Vassoler Rutz_344011008-image40.png|18px]] is a lower triangular matrix,  [[Image:draft_Vassoler Rutz_344011008-image41.png|18px]] is an upper triangular matrix and  [[Image:draft_Vassoler Rutz_344011008-image42.png|18px]] is the residual matrix.
+
  
 
The iterative process for solving Eq. (4) can be:
 
The iterative process for solving Eq. (4) can be:
Line 248: Line 231:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image43.png|90px]] ,
+
|<math>{{r}^{m}}=b-A{{T}^{m}},</math><math>{{\overset{\hat{\ }}{\mathop{L}}\,}_{h}}{{y}^{m}}={{r}^{m}}\,,</math><math>{{\overset{\hat{\ }}{\mathop{U}}\,}_{h}}\,{{\sigma }^{m}}={{y}^{m}},</math>
  
[[Image:draft_Vassoler Rutz_344011008-image44.png|78px]] ,
+
<math>{{T}^{m+1}}={{T}^{m}}+{{\sigma }^{m}}.</math>
 
+
[[Image:draft_Vassoler Rutz_344011008-image45.png|78px]] ,
+
 
+
[[Image:draft_Vassoler Rutz_344011008-image46.png|102px]] .
+
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(10)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(10)
Line 267: Line 246:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image48.png|120px]] [[Image:draft_Vassoler Rutz_344011008-image49.png|120px]] [[Image:draft_Vassoler Rutz_344011008-image50.png|180px]] .
+
| <math>{{\overset{\hat{\ }}{\mathop{L}}\,}_{h}}={{\left( \begin{matrix}
 +
  0 & 0 & 0 \\
 +
  \gamma  & \delta  & 0  \\
 +
  0 & \alpha  & \beta  \\
 +
\end{matrix} \right)}_{h}}, {{\overset{\hat{\ }}{\mathop{U}}\,}_{h}}={{\left( \begin{matrix}
 +
  \zeta  & \eta  & 0  \\
 +
  0 & \delta  & \mu  \\
 +
  0 & 0 & 0  \\
 +
\end{matrix} \right)}_{h}}, {{R}_{h}}={{\left( \begin{matrix}
 +
  {{p}_{2}} & 0 & 0 & 0 & {}  \\
 +
  {} & 0 & {{p}_{3}} & 0 & {}  \\
 +
  {} & 0 & 0 & 0 & {{p}_{1}} \\
 +
\end{matrix} \right)}_{h}}.
  
 +
</math>
  
 
|}
 
|}
Line 282: Line 274:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image52.png|114px]] [[Image:draft_Vassoler Rutz_344011008-image53.png|120px]] [[Image:draft_Vassoler Rutz_344011008-image54.png|132px]] .
+
| .
 
+
<math>{{\overset{\hat{\ }}{\mathop{L}}\,}_{h}}={{\left( \begin{matrix}
 +
  \zeta & 0 & 0  \\
 +
  \gamma  & \delta  & 0  \\
 +
  0 & \alpha  & 0  \\
 +
\end{matrix} \right)}_{h}}, {{\overset{\hat{\ }}{\mathop{U}}\,}_{h}}={{\left( \begin{matrix}
 +
  0 & \eta & 0  \\
 +
  0 & \delta  & \mu  \\
 +
  0 & 0 & \beta  \\
 +
\end{matrix} \right)}_{h}}, {{R}_{h}}={{\left( \begin{matrix}
 +
  {{p}_{1}} & {} & {}  \\
 +
  0 & 0 & 0  \\
 +
  0 & {{p}_{3}} & 0  \\
 +
  0 & 0 & 0  \\
 +
  {} & {} & {{p}_{2}}  \\
 +
\end{matrix} \right)}_{h}}.
 +
</math>
  
 
|}
 
|}
Line 292: Line 299:
 
The linear equation system given by Eq. (1) was solved using the geometric multigrid method [5,6] with correction scheme (CS), V-cycle and zero initial estimate.
 
The linear equation system given by Eq. (1) was solved using the geometric multigrid method [5,6] with correction scheme (CS), V-cycle and zero initial estimate.
  
The coarsening ratio is given by  [[Image:draft_Vassoler Rutz_344011008-image55.png|30px]] (standard coarsening) [17]. The grid transfer operators employed were: Injection Restriction (INJ), Half Weighting (HW), Full Weighting (FW) (see [6]), Partial Weighting in ''x'' ( <math display="inline">{\mbox{PW}}_\mbox{x}</math> ), Partial Weighting in ''y'' ( <math display="inline">{\mbox{PW}}_\mbox{y}</math> ) and prolongation by bilinear interpolation and 7-point interpolation. The equation systems obtained by means of discretization were resolved using 7-point ILU solvers in different directions ( <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> ,  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> , among others).
+
The coarsening ratio is given by <math>r=2</math> (standard coarsening) [17]. The grid transfer operators employed were: Injection Restriction (INJ), Half Weighting (HW), Full Weighting (FW) (see [6]), Partial Weighting in <math display="inline">x</math> ( <math display="inline">{\mbox{PW}}_\mbox{x}</math> ), Partial Weighting in <math display="inline">y</math> ( <math display="inline">{\mbox{PW}}_\mbox{y}</math> ) and prolongation by bilinear interpolation and 7-point interpolation. The equation systems obtained by means of discretization were resolved using 7-point ILU solvers in different directions ( <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> ,  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> , among others).
  
 
The stop criterion used to interrupt the iterative process is based on the dimensionless residual norm. The residual of the algebraic equation system is defined by
 
The stop criterion used to interrupt the iterative process is based on the dimensionless residual norm. The residual of the algebraic equation system is defined by
Line 301: Line 308:
 
{| style="text-align: center; margin:auto;"  
 
{| style="text-align: center; margin:auto;"  
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image59.png|90px]]
+
|<math>{{r}^{m}}=b-A{{T}^{m}}</math>
 
|}
 
|}
 
| style="width: 5px;text-align: right;white-space: nowrap;" | (13)
 
| style="width: 5px;text-align: right;white-space: nowrap;" | (13)
Line 307: Line 314:
  
  
where  [[Image:draft_Vassoler Rutz_344011008-image60.png|18px]] is the solution of the unknown in the iteration ''m''.
+
where  <math>T^m</math> is the solution of the unknown in the iteration ''m''.
  
Considering  [[Image:draft_Vassoler Rutz_344011008-image61.png|66px]] and  [[Image:draft_Vassoler Rutz_344011008-image62.png|60px]] , if  [[Image:draft_Vassoler Rutz_344011008-image63.png|48px]] the iterative process is interrupted if  [[Image:draft_Vassoler Rutz_344011008-image64.png|66px]] .
+
Considering  <math>{{L}^{m}}={{\left\| r_{{}}^{m} \right\|}_{1}}</math> and  <math>{{L}^{0}}={{\left\| r_{{}}^{0} \right\|}_{1}}</math> , if  <math>\frac{{{L}^{m}}}{{{L}^{0}}}\le tol</math> the iterative process is interrupted if  <math>tol={{10}^{-10}}</math> .
  
 
Double precision arithmetic was used for the simulations. The numerical codes were implemented in the Fortran 2003 language, using the Intel 9.1 Visual Fortran application. All numerical results were obtained in a computer with Intel Core i7 2.66 GHz processor, 16 GB RAM and Windows 8 operating system, 64-bit version.
 
Double precision arithmetic was used for the simulations. The numerical codes were implemented in the Fortran 2003 language, using the Intel 9.1 Visual Fortran application. All numerical results were obtained in a computer with Intel Core i7 2.66 GHz processor, 16 GB RAM and Windows 8 operating system, 64-bit version.
  
3. Local Fourier Analysis
+
== 3. Local Fourier Analysis ==
 
+
 
LFA allows to predict the performance of the multigrid method as it supplies estimates of the convergence rate of its components. Therefore, LFA becomes a powerful tool in quantitative analysis and for the project of efficient multigrid methods.
 
LFA allows to predict the performance of the multigrid method as it supplies estimates of the convergence rate of its components. Therefore, LFA becomes a powerful tool in quantitative analysis and for the project of efficient multigrid methods.
  
In order to perform the LFA, general discrete linear operators with constant coefficients are considered, which are defined in an infinite grid  [[Image:draft_Vassoler Rutz_344011008-image65.png|18px]] , where the influence of the boundaries can be dismissed [6].
+
In order to perform the LFA, general discrete linear operators with constant coefficients are considered, which are defined in an infinite grid  <math>G_h</math> , where the influence of the boundaries can be dismissed [6].
  
 
Consider the grid functions of the form of
 
Consider the grid functions of the form of
Line 326: Line 332:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image66.png|96px]] , with  [[Image:draft_Vassoler Rutz_344011008-image67.png|102px]] ,
+
| <math display="inline">{{\phi }_{h}}\left( \theta ,x \right)={{e}^{{i\theta x}/{h}\;}}</math> , with  <math display="inline">\theta =({{\theta }_{1}},{{\theta }_{2}})\in {{R}^{2}}</math> ,
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(14)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(14)
Line 332: Line 338:
  
  
where  [[Image:draft_Vassoler Rutz_344011008-image68.png|6px]] varies in the infinite grid  [[Image:draft_Vassoler Rutz_344011008-image69.png|18px]] and  [[Image:draft_Vassoler Rutz_344011008-image70.png|6px]] is a parameter that characterize the frequency of the function concerning to grid  [[Image:draft_Vassoler Rutz_344011008-image69.png|18px]] .
+
where  [[Image:draft_Vassoler Rutz_344011008-image68.png|6px]] varies in the infinite grid  <math>G_h</math> and  [[Image:draft_Vassoler Rutz_344011008-image70.png|6px]] is a parameter that characterize the frequency of the function concerning to grid  <math>G_h</math> .
  
For  [[Image:draft_Vassoler Rutz_344011008-image71.png|78px]] , every function of the grid  [[Image:draft_Vassoler Rutz_344011008-image72.png|54px]] are eigenfunctions of a discrete operator that can be written by a stencil.
+
For  <math>-\pi \le \theta <\pi </math> , every function of the grid  <math>{{\phi }_{h}}\left( \theta ,x \right)</math> are eigenfunctions of a discrete operator that can be written by a stencil.
  
 
Thus,
 
Thus,
Line 343: Line 349:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image73.png|174px]] .
+
| <math>{{L}_{h}}\,{{\phi }_{h}}\left( \theta ,x \right)=\overset{\tilde{\ }}{\mathop{{{L}_{h}}(}}\,\,\theta )\,{{\phi }_{h}}\left( \theta ,x \right).</math>
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(15)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(15)
 
|}
 
|}
 
  
 
where
 
where
  
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
+
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"
 
|-
 
|-
 
|  
 
|  
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image74.png|84px]] , [[Image:draft_Vassoler Rutz_344011008-image75.png|120px]] and [[Image:draft_Vassoler Rutz_344011008-image76.png|12px]] is the stencil notation of the operator.
+
| <math>{{L}_{h}}\,(\theta )={{\left[ {{s}_{k}} \right]}_{h}}, \overset{\tilde{\ }}{\mathop{{{L}_{h}}}}\,\,(\theta )=\sum\limits_{k}{{{s}_{k}}{{e}^{i\,\theta \,k}}}</math>and <math display="inline">{s}_{k}</math> is the stencil notation of the operator.
 
|}
 
|}
| style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(16)
+
| style="text-align: center;width: 5px;text-align: right;white-space: nowrap;" |(16)
 
|}
 
|}
  
 
+
For the smoothing and the analysis of two grids, it necessary to distinguish components of high and low frequency of  <math>G_h</math> in relation to  <math>G_{2h}</math> .
For the smoothing and the analysis of two grids, it necessary to distinguish components of high and low frequency of  [[Image:draft_Vassoler Rutz_344011008-image77.png|18px]] in relation to  [[Image:draft_Vassoler Rutz_344011008-image78.png|24px]] .
+
  
 
It is known that [6] only if
 
It is known that [6] only if
Line 371: Line 375:
 
{| style="text-align: center; margin:auto;width: 100%;"
 
{| style="text-align: center; margin:auto;width: 100%;"
 
|-
 
|-
| [[Image:draft_Vassoler Rutz_344011008-image79.png|54px]] with  [[Image:draft_Vassoler Rutz_344011008-image80.png|84px]] ,
+
| <math>{{\phi }_{h}}\left( \theta ,x \right)</math> with  [[Image:draft_Vassoler Rutz_344011008-image80.png|84px]] ,
 
|}
 
|}
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(17)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(17)
Line 377: Line 381:
  
  
are visible in  [[Image:draft_Vassoler Rutz_344011008-image81.png|18px]] .
+
are visible in  <math>G_h</math> .
  
For each  [[Image:draft_Vassoler Rutz_344011008-image82.png|162px]] , three other frequency components  [[Image:draft_Vassoler Rutz_344011008-image83.png|48px]] with  [[Image:draft_Vassoler Rutz_344011008-image84.png|132px]] coincide in  [[Image:draft_Vassoler Rutz_344011008-image85.png|24px]] with  [[Image:draft_Vassoler Rutz_344011008-image86.png|54px]] and are not visible in  [[Image:draft_Vassoler Rutz_344011008-image87.png|24px]] . Therefore, the low- and high-frequency components are defined as follows:
+
For each  [[Image:draft_Vassoler Rutz_344011008-image82.png|162px]] , three other frequency components  <math>{{\phi }_{h}}\left( \theta ,x \right)</math> with  [[Image:draft_Vassoler Rutz_344011008-image84.png|132px]] coincide in  <math>G_{2h}</math> with  [[Image:draft_Vassoler Rutz_344011008-image86.png|54px]] and are not visible in  <math>G_{2h}</math> . Therefore, the low- and high-frequency components are defined as follows:
  
 
  [[Image:draft_Vassoler Rutz_344011008-image88.png|12px]] is a low-frequency component  [[Image:draft_Vassoler Rutz_344011008-image89.png|222px]] ;
 
  [[Image:draft_Vassoler Rutz_344011008-image88.png|12px]] is a low-frequency component  [[Image:draft_Vassoler Rutz_344011008-image89.png|222px]] ;
Line 396: Line 400:
 
<span id='_Ref428218644'></span><span id='_Toc428200463'></span><span id='_Toc428200508'></span><span id='_Toc429758843'></span><span id='_Ref466666326'></span>
 
<span id='_Ref428218644'></span><span id='_Toc428200463'></span><span id='_Toc428200508'></span><span id='_Toc429758843'></span><span id='_Ref466666326'></span>
  
==Figure 2: Low- (inner white area) and high-frequency (hatched area) areas.==
+
Figure 2: Low- (inner white area) and high-frequency (hatched area) areas.
 
+
== Considering the frequencies==
+
  
 +
Considering the frequencies
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
|-
 
|-
Line 413: Line 416:
 
|}
 
|}
  
 
+
The operator  [[Image:draft_Vassoler Rutz_344011008-image95.png|24px]] is represented by a 4x4 matrix  [[Image:draft_Vassoler Rutz_344011008-image96.png|24px]] , as follows
==The operator  [[Image:draft_Vassoler Rutz_344011008-image95.png|24px]] is represented by a 4x4 matrix  [[Image:draft_Vassoler Rutz_344011008-image96.png|24px]] , as follows==
+
 
+
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
|-
 
|-
Line 425: Line 426:
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(19)
 
|  style="text-align: center;width: 5px;text-align: right;white-space: nowrap;"|(19)
 
|}
 
|}
 
  
 
where  [[Image:draft_Vassoler Rutz_344011008-image99.png|18px]] is represented by a 4x4 identity matrix,  [[Image:draft_Vassoler Rutz_344011008-image100.png|36px]] is the 4x4 matrix,  [[Image:draft_Vassoler Rutz_344011008-image101.png|42px]] is a 1x4 matrix,  [[Image:draft_Vassoler Rutz_344011008-image102.png|42px]] is a 4x1 matrix and  [[Image:draft_Vassoler Rutz_344011008-image103.png|66px]] is a 1x1 matrix.
 
where  [[Image:draft_Vassoler Rutz_344011008-image99.png|18px]] is represented by a 4x4 identity matrix,  [[Image:draft_Vassoler Rutz_344011008-image100.png|36px]] is the 4x4 matrix,  [[Image:draft_Vassoler Rutz_344011008-image101.png|42px]] is a 1x4 matrix,  [[Image:draft_Vassoler Rutz_344011008-image102.png|42px]] is a 4x1 matrix and  [[Image:draft_Vassoler Rutz_344011008-image103.png|66px]] is a 1x1 matrix.
Line 456: Line 456:
 
|}
 
|}
  
 
+
The asymptotic convergence factor  [[Image:draft_Vassoler Rutz_344011008-image111.png|54px]] can be calculated by
== The asymptotic convergence factor  [[Image:draft_Vassoler Rutz_344011008-image111.png|54px]] can be calculated by==
+
  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
Line 490: Line 489:
 
|}
 
|}
  
 
+
The smoothing operator  [[Image:draft_Vassoler Rutz_344011008-image134.png|18px]] , according to [6], is given by
==The smoothing operator  [[Image:draft_Vassoler Rutz_344011008-image134.png|18px]] , according to [6], is given by==
+
  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
 
{| class="formulaSCP" style="width: 100%;margin: 1em auto 0.1em auto;width: 100%;text-align: center;"  
Line 519: Line 517:
 
For  <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> ,  [[Image:draft_Vassoler Rutz_344011008-image138.png|234px]] and for  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> ,  [[Image:draft_Vassoler Rutz_344011008-image140.png|234px]] .
 
For  <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> ,  [[Image:draft_Vassoler Rutz_344011008-image138.png|234px]] and for  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> ,  [[Image:draft_Vassoler Rutz_344011008-image140.png|234px]] .
  
4. Numerical Results
+
== 4. Numerical Results ==
 
+
 
In this work, the anisotropic diffusion equation was solved using 7-point ILU solver in different directions. Several restriction operators and two prolongation operators were employed. It is proposed an algorithm that presents the lowest asymptotic convergence factor values and the lowest computational cost for the multigrid method.
 
In this work, the anisotropic diffusion equation was solved using 7-point ILU solver in different directions. Several restriction operators and two prolongation operators were employed. It is proposed an algorithm that presents the lowest asymptotic convergence factor values and the lowest computational cost for the multigrid method.
  
Line 527: Line 524:
 
Section 4.1 presents the convergence analysis by means of LFA. Only the optimum components obtained via LFA will be used in the complexity analysis in section 4.2.
 
Section 4.1 presents the convergence analysis by means of LFA. Only the optimum components obtained via LFA will be used in the complexity analysis in section 4.2.
  
==4.1. Convergence Analysis==
+
===4.1. Convergence Analysis===
  
 
Fig. 3 depicts'' '' [[Image:draft_Vassoler Rutz_344011008-image148.png|24px]] , given by Eq. (22), with  <math display="inline">\mbox{ILU}</math> in the EN, NE, ES, SE directions, FW restriction, bilinear prolongation, number of inner iterations  [[Image:draft_Vassoler Rutz_344011008-image150.png|36px]] ,  [[Image:draft_Vassoler Rutz_344011008-image141.png|48px]] and  [[Image:draft_Vassoler Rutz_344011008-image151.png|54px]] , with  [[Image:draft_Vassoler Rutz_344011008-image152.png|42px]] .
 
Fig. 3 depicts'' '' [[Image:draft_Vassoler Rutz_344011008-image148.png|24px]] , given by Eq. (22), with  <math display="inline">\mbox{ILU}</math> in the EN, NE, ES, SE directions, FW restriction, bilinear prolongation, number of inner iterations  [[Image:draft_Vassoler Rutz_344011008-image150.png|36px]] ,  [[Image:draft_Vassoler Rutz_344011008-image141.png|48px]] and  [[Image:draft_Vassoler Rutz_344011008-image151.png|54px]] , with  [[Image:draft_Vassoler Rutz_344011008-image152.png|42px]] .
Line 610: Line 607:
 
Next section presents the complexity analysis of multigrid. For the analysis, multigrid was built with the components that had the best convergence factors, according to LFA. In addition, a comparison between partial weighting and full weighting will be presented.
 
Next section presents the complexity analysis of multigrid. For the analysis, multigrid was built with the components that had the best convergence factors, according to LFA. In addition, a comparison between partial weighting and full weighting will be presented.
  
==4.2. Complexity Analysis==
+
===4.2. Complexity Analysis===
  
 
In order to assess the effect of the number of unknowns on CPU time, optimum components obtained via LFA were used. Figures 8 (a) and 8 (b) show, for FW and  <math display="inline">{\mbox{PW}}_\mbox{x}</math> restrictions respectively, that for the anisotropic problem ( [[Image:draft_Vassoler Rutz_344011008-image195.png|36px]] ), the  <math display="inline">t_{CPU}</math> is not lower. As the problem becomes more anisotropic ( [[Image:draft_Vassoler Rutz_344011008-image156.png|66px]] or  [[Image:draft_Vassoler Rutz_344011008-image158.png|42px]] ), the  [[Image:draft_Vassoler Rutz_344011008-image197.png|24px]] decreases for every ''N ''value assessed. For every anisotropy coefficient assessed, the  [[Image:draft_Vassoler Rutz_344011008-image198.png|24px]] with FW restriction is very similar to the  [[Image:draft_Vassoler Rutz_344011008-image199.png|24px]] with  <math display="inline">{\mbox{PW}}_\mbox{x}</math> restriction, that is,  <math display="inline">t_{CPU}\left(FW\right)\approx t_{CPU}\left({\mbox{PW}}_\mbox{x}\right)</math> .
 
In order to assess the effect of the number of unknowns on CPU time, optimum components obtained via LFA were used. Figures 8 (a) and 8 (b) show, for FW and  <math display="inline">{\mbox{PW}}_\mbox{x}</math> restrictions respectively, that for the anisotropic problem ( [[Image:draft_Vassoler Rutz_344011008-image195.png|36px]] ), the  <math display="inline">t_{CPU}</math> is not lower. As the problem becomes more anisotropic ( [[Image:draft_Vassoler Rutz_344011008-image156.png|66px]] or  [[Image:draft_Vassoler Rutz_344011008-image158.png|42px]] ), the  [[Image:draft_Vassoler Rutz_344011008-image197.png|24px]] decreases for every ''N ''value assessed. For every anisotropy coefficient assessed, the  [[Image:draft_Vassoler Rutz_344011008-image198.png|24px]] with FW restriction is very similar to the  [[Image:draft_Vassoler Rutz_344011008-image199.png|24px]] with  <math display="inline">{\mbox{PW}}_\mbox{x}</math> restriction, that is,  <math display="inline">t_{CPU}\left(FW\right)\approx t_{CPU}\left({\mbox{PW}}_\mbox{x}\right)</math> .
Line 623: Line 620:
 
| [[Image:draft_Vassoler Rutz_344011008-image207.jpeg|center|282px]]
 
| [[Image:draft_Vassoler Rutz_344011008-image207.jpeg|center|282px]]
 
|}
 
|}
</div>
+
</div>Figure 8: CPU time versus number of nodes (N).
 
+
==Figure 8: CPU time versus number of nodes (N).==
+
  
 
In order to assess the performance of the multigrid with different anisotropy coefficients, a curve adjustment of the form  [[Image:draft_Vassoler Rutz_344011008-image208.png|72px]] [18] was made, where ''p ''represents the complexity order of the solver, ''N'' is the number of grid points and ''c ''is a constant that depends on the method. The closer the value of ''p ''is to one, the better the performance of method used. Ideally, multigrid presents ''p=1'', what means that the CPU time grows linearly with the increase of ''N''. Results are shown in Table 1 for both restrictions assessed (FW and  <math display="inline">{\mbox{PW}}_\mbox{x}</math> ).
 
In order to assess the performance of the multigrid with different anisotropy coefficients, a curve adjustment of the form  [[Image:draft_Vassoler Rutz_344011008-image208.png|72px]] [18] was made, where ''p ''represents the complexity order of the solver, ''N'' is the number of grid points and ''c ''is a constant that depends on the method. The closer the value of ''p ''is to one, the better the performance of method used. Ideally, multigrid presents ''p=1'', what means that the CPU time grows linearly with the increase of ''N''. Results are shown in Table 1 for both restrictions assessed (FW and  <math display="inline">{\mbox{PW}}_\mbox{x}</math> ).
Line 699: Line 694:
  
 
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
 
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
Table 2. Ratio between number of flops of a V-cycle and the number of points of the finest grid. </div>
+
Table 2. Ratio between number of flops of a V-cycle and the number of points of the finest grid.  
 +
 
 +
</div>
  
 
{| style="width: 76%;margin: 1em auto 0.1em auto;border-collapse: collapse;"  
 
{| style="width: 76%;margin: 1em auto 0.1em auto;border-collapse: collapse;"  
Line 730: Line 727:
  
 
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
 
<div class="center" style="width: auto; margin-left: auto; margin-right: auto;">
Table 3. Ratio between number of flops of the restriction and number of points of the finest grid.  </div>
+
Table 3. Ratio between number of flops of the restriction and number of points of the finest grid.   
 +
 
 +
</div>
  
 
According to Tab. 2, the multigrid cycle that requires the lowest number of flops is the cycle with  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator, except for the isotropic case ( [[Image:draft_Vassoler Rutz_344011008-image232.png|36px]] ). Considering only the restriction step, results presented in Tab. 3 show a great advantage of the  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator over the FW operator regarding the number of flops performed. For every case, the number of flops for  <math display="inline">{\mbox{PW}}_\mbox{x}</math> is roughly 75% lower than the number of flops for FW.
 
According to Tab. 2, the multigrid cycle that requires the lowest number of flops is the cycle with  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator, except for the isotropic case ( [[Image:draft_Vassoler Rutz_344011008-image232.png|36px]] ). Considering only the restriction step, results presented in Tab. 3 show a great advantage of the  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator over the FW operator regarding the number of flops performed. For every case, the number of flops for  <math display="inline">{\mbox{PW}}_\mbox{x}</math> is roughly 75% lower than the number of flops for FW.
Line 736: Line 735:
 
The remaining anisotropy coefficients assessed showed similar performances to those presented in Fig. 5 and Tables 2 and 3, what confirms the efficiency and robustness of the algorithm proposed, in addition to the low computational cost with the use of the  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator.
 
The remaining anisotropy coefficients assessed showed similar performances to those presented in Fig. 5 and Tables 2 and 3, what confirms the efficiency and robustness of the algorithm proposed, in addition to the low computational cost with the use of the  <math display="inline">{\mbox{PW}}_\mbox{x}</math> operator.
  
5. Conclusions
+
== 5. Conclusions ==
 
+
 
- Among the ILU directions assessed (EN, NE, ES, SE) for standard multigrid (FW restriction operator and bilinear prolongation), it is concluded that  <math display="inline">{\rho }_{loc}\approx 0.02</math> when using  <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> for  [[Image:draft_Vassoler Rutz_344011008-image156.png|66px]] and  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> '' ''for [[Image:draft_Vassoler Rutz_344011008-image158.png|42px]] <span style="text-align: center; font-size: 75%;">.</span>
 
- Among the ILU directions assessed (EN, NE, ES, SE) for standard multigrid (FW restriction operator and bilinear prolongation), it is concluded that  <math display="inline">{\rho }_{loc}\approx 0.02</math> when using  <math display="inline">{\mbox{ILU}}_{\mbox{EN}}</math> for  [[Image:draft_Vassoler Rutz_344011008-image156.png|66px]] and  <math display="inline">{\mbox{ILU}}_{\mbox{NE}}</math> '' ''for [[Image:draft_Vassoler Rutz_344011008-image158.png|42px]] <span style="text-align: center; font-size: 75%;">.</span>
  

Revision as of 23:44, 9 April 2018


Abstract: In this work it is proposed an efficient and robust algorithm for the solution of a physical anisotropy problem. This algorithm is based on the choice of the most efficient restriction operator and on an incomplete LU decomposition suited for each anisotropy direction. Local Fourier analysis is carried out in order to assist the project of an efficient multigrid method. The model considered is pure diffusion with anisotropy aligned to the coordinate axis x. Finite difference method with uniform grid and second-order numerical scheme is used for the discretization of equations. Problems are solved with geometric multigrid method, correction scheme, V-cycle and standard coarsening ratio. The asymptotic convergence factor is calculated for different multigrid components, such as restriction operators, prolongation operators and solvers. Based on the optimum components obtained by LFA, experiments are carried out for the analysis of complexity and computational cost for the algorithm proposed. The main conclusion is that the methodology proposed is efficient for the resolution of problems with strong anisotropy.

Keywords: Physical anisotropy; Multigrid components; Diffusion; Local Fourier analysis.


1. Introduction

Computational Fluid Dynamics (CFD) is a branch of Computational Science that studies numerical methods for the simulation of fluid flows problems. It is known that these methods usually require a high computational cost. In general, this happens because the problems that have to be solved require the resolution of algebraic equation systems whose coefficients matrices are large and sparse [1].

Linear systems are obtained with discretization of the mathematical model, which consists in approximating, through algebraic equations, each term of the mathematical model for each grid node or point. This process leads to an algebraic equation system of the form

(1)

where is the coefficient matrix, is the variable of interest and is the independent vector.

In CFD, the methods traditionally employed in this process are: Finite Difference Method (FDM) [2], Finite Volume Method (FVM) [3] and Finite Element Method (FEM) [4].

The algebraic equation system given by Eq. (1) can be solved using direct or iterative methods. In this work, iterative methods were employed due to the aforementioned characteristics of linear systems in CFD (sparse and large coefficients matrices), for which iterative methods are recommended. The multigrid method is one of the most effective methods to accelerate the convergence of iterative methods for the resolution of linear or nonlinear systems, isotropic or anisotropic problems, among others [5-7].

Anisotropic problems are fairly common in Engineering and appear in many phenomena, such as in the case of a material with different heat conduction behaviors for different directions. In this case, the coefficients of the differential equations are distinct among themselves and generate, what is called, physical anisotropy. Anisotropies can also appear from discretization of grids with different spacing in each direction, for instance, boundary layer problems. This is called geometric anisotropy [5,6]. The efficiency of the multigrid method has not yet been fully achieved for problems with strong anisotropy, either physical or geometric [8].

Physical anisotropy problems have been investigated by Rabi and de Lemos [9], who discretized the two-dimensional advection-diffusion equation combining null and constant coefficients. Finite volume method was used in the discretization of the equations. Multigrid method was employed using correction scheme, Gauss-Seidel solver and V- and W-cycles. The authors presented a study of the different speed ranges (null and constant), number of grids, number of smoothing steps at each grid level and different solvers. They concluded that there was a significant reduction in the computational effort when the speed range values were increased.

Wienands and Joppich [10] presented an in-depth study on the Local Fourier Analysis (LFA) and its application on several problems, which included anisotropic problems. The authors calculated the convergence factor of the multigrid method for the anisotropic diffusion equation with different solvers and restriction and prolongation operators.

Johannsen [11] solved an anisotropic diffusion problem using finite volume method for discretizing the equations and 9-point incomplete LU (ILU) decomposition method for solving the linear equations systems. The author employed LFA to demonstrate the superior smoothing properties of ILU.

Oliveira et al. [8] assessed geometric anisotropy for different grids and aspect ratios. They also assessed some components of the multigrid method, such as: solvers, type of restriction, number of levels and number of inner iterations, among several coarsening algorithms. They concluded that Partial Weighting (PW) had a good performance.

Vinogradova and Krukier [12] solved a three-dimensional advection-diffusion problem with intermediate anisotropy using FDM for the discretization of the equations and ILU for the resolution of the linear equation systems. They concluded that the proposed methodology is efficient, but it has the disadvantage of having limitations in the coefficients of the mixed derivatives, which has no physical meaning.

Vassoler-Rutz et al. [11] analyzed the effect of physical anisotropy on multigrid method for two anisotropic diffusion problems. They used FAS scheme, V-cycle, and Modified Strongly Implicit (MSI) and Gauss-Seidel (GS) solvers. They concluded that, for strong anisotropies, the complexity order of the multigrid method is not suitable.

Several works on the implementation of the multigrid method found in the literature demonstrate that the choice of the multigrid components is crucial for the convergence or not of the method. Trottenberg et al. [6] state that these choices are difficult and thus small changes can considerably improve convergence. In this sense, LFA can assist these choices as it allows to predict the performance of the multigrid method, since it provides estimates of the converge rates based on the variation of the multigrid method components.

Pinto et. al. [14] solved the anisotropic diffusion problem using ILU in triangular grids. They used LFA to show the good smoothing properties of the solver and the asymptotic convergence of the multigrid.

Franco et. al. [13] performed LFA in transient problems and obtained the critical value of the parameter that represents the level of space-time anisotropy for the 1D and 2D Fourier equations.

In this work, an efficient and robust method for solving physical anisotropy problems using LFA is proposed. A two-dimensional diffusion mathematical model is considered, in which physical anisotropy appears in the coefficients and it will be denominated diffusion anisotropy. Equations were discretized using FDM with second-order central difference scheme.

The asymptotic convergence factor () of the multigrid method was calculated through assessment of the ILU solver in different directions [7] and several restriction and prolongation operators. The results obtained by means of LFA were used to assess the influence of physical anisotropy on computational cost by means of CPU time and number of operations in each V-cycle and at the restriction step.

The remainder of this work is organized as follows: section 2 presents the mathematical and numerical models. Section 3 discusses considerations regarding the LFA used. Section 4 presents the results of the convergence analysis and complexity analysis. Section 5 presents the conclusion.

2. Mathematical and Numerical Models

For the problem presented below, the calculus domain used is given by , and the discretization of the equations is done using a uniform grid with a number of points given by , where and are the number of points in the directions x and y, respectively, including the boundaries.

2.1. Mathematical Model and Discretization

The diffusion anisotropy problem will be assessed through the two-dimension diffusion equation given by Eq. (2) [5,6]

(2)


where T is the temperature, is the second derivative of T as a function of x, is the second derivative as a function of y and .

The source term S and the analytical solution T are given by

and .
(3)


Eq. (2) was discretized using FDM with CDS, resulting in

(4)


where is the system unknown.

Fig. 1(b) depicts the notation of the grid points in Fig. 1(a). The points P (central), W (west), E (east), N (north) and S (south) in Fig. 1(b) correspond to the points in Fig. 1(a), respectively.

Draft Vassoler Rutz 344011008-image23.png (a) (b)
Figure 1: Points of a uniform two-dimensional grid.

The discretization of Eq. (2) results in Eq. (4), and for the inner points, considering and

,

(5)

For the boundaries north, south, east and west, .

2.2. Multigrid method and computational details

The multigrid method accelerates the convergence rate of iterative methods. It consists in employing a group of grids with different refinement levels. At each grid refinement level, the more oscillatory errors are smoothed, and only low frequency errors remain. After passing to another grid, the remaining low frequency errors become more oscillatory. The efficiency of this process, called smoothing, depends on the choice of a suitable solver.

For employing multigrid, besides a solver with good smoothing properties, grid transfer operators are required (restriction and prolongation).

In [8] and [16], restriction through partial weighting in the directions and , henceforth denoted by and , respectively, applied in problems involving geometric anisotropy was proposed. These operators, are given in stencil notation as

 : Draft Vassoler Rutz 344011008-image34.png ,  : Draft Vassoler Rutz 344011008-image35.png .
(6)


In this work, such restriction was combined with 7-point incomplete LU decomposition (henceforth denoted by ILU). According to [17], this decomposition has a better convergence factor than 5-point incomplete LU decomposition for anisotropic problems.

In Eq. (2), discretized using FDM, the stencil for 5-point Laplacian operator is given by

Draft Vassoler Rutz 344011008-image36.png .
(7)

The ILU decomposition of this same operator will be given by

Draft Vassoler Rutz 344011008-image37.png ,
(8)


note that in this case, .

Thus, the ILU decomposition is represented by:

(9)

where is a lower triangular matrix, is an upper triangular matrix and is the residual matrix.

The iterative process for solving Eq. (4) can be:

(10)


Depending on the ordination of the grid points, different directions can be obtained for the ILU decomposition. In lexicographical order, [17], is given by

(11)


Another example of ordination for ILU, [17], is given by

.

(12)


The linear equation system given by Eq. (1) was solved using the geometric multigrid method [5,6] with correction scheme (CS), V-cycle and zero initial estimate.

The coarsening ratio is given by (standard coarsening) [17]. The grid transfer operators employed were: Injection Restriction (INJ), Half Weighting (HW), Full Weighting (FW) (see [6]), Partial Weighting in ( ), Partial Weighting in ( ) and prolongation by bilinear interpolation and 7-point interpolation. The equation systems obtained by means of discretization were resolved using 7-point ILU solvers in different directions ( , , among others).

The stop criterion used to interrupt the iterative process is based on the dimensionless residual norm. The residual of the algebraic equation system is defined by

(13)


where is the solution of the unknown in the iteration m.

Considering and , if the iterative process is interrupted if .

Double precision arithmetic was used for the simulations. The numerical codes were implemented in the Fortran 2003 language, using the Intel 9.1 Visual Fortran application. All numerical results were obtained in a computer with Intel Core i7 2.66 GHz processor, 16 GB RAM and Windows 8 operating system, 64-bit version.

3. Local Fourier Analysis

LFA allows to predict the performance of the multigrid method as it supplies estimates of the convergence rate of its components. Therefore, LFA becomes a powerful tool in quantitative analysis and for the project of efficient multigrid methods.

In order to perform the LFA, general discrete linear operators with constant coefficients are considered, which are defined in an infinite grid , where the influence of the boundaries can be dismissed [6].

Consider the grid functions of the form of

, with ,
(14)


where Draft Vassoler Rutz 344011008-image68.png varies in the infinite grid and Draft Vassoler Rutz 344011008-image70.png is a parameter that characterize the frequency of the function concerning to grid .

For , every function of the grid are eigenfunctions of a discrete operator that can be written by a stencil.

Thus,

(15)

where

and is the stencil notation of the operator.
(16)

For the smoothing and the analysis of two grids, it necessary to distinguish components of high and low frequency of in relation to .

It is known that [6] only if

with Draft Vassoler Rutz 344011008-image80.png ,
(17)


are visible in .

For each Draft Vassoler Rutz 344011008-image82.png , three other frequency components with Draft Vassoler Rutz 344011008-image84.png coincide in with Draft Vassoler Rutz 344011008-image86.png and are not visible in . Therefore, the low- and high-frequency components are defined as follows:

Draft Vassoler Rutz 344011008-image88.png is a low-frequency component  Draft Vassoler Rutz 344011008-image89.png ;
Draft Vassoler Rutz 344011008-image90.png is a high-frequency component  
Draft Vassoler Rutz 344011008-image91.png
Draft Vassoler Rutz 344011008-image92.png

. See Fig. 2.

Draft Vassoler Rutz 344011008-image93.png

Figure 2: Low- (inner white area) and high-frequency (hatched area) areas.

Considering the frequencies

.
(18)

The operator Draft Vassoler Rutz 344011008-image95.png is represented by a 4x4 matrix Draft Vassoler Rutz 344011008-image96.png , as follows

Draft Vassoler Rutz 344011008-image97.png , with Draft Vassoler Rutz 344011008-image98.png ,
(19)

where Draft Vassoler Rutz 344011008-image99.png is represented by a 4x4 identity matrix, Draft Vassoler Rutz 344011008-image100.png is the 4x4 matrix, Draft Vassoler Rutz 344011008-image101.png is a 1x4 matrix, Draft Vassoler Rutz 344011008-image102.png is a 4x1 matrix and Draft Vassoler Rutz 344011008-image103.png is a 1x1 matrix.

A representation for Draft Vassoler Rutz 344011008-image104.png can be obtained by a matrix Draft Vassoler Rutz 344011008-image105.png of the form

Draft Vassoler Rutz 344011008-image106.png ,
(20)


where Draft Vassoler Rutz 344011008-image107.png is given by Eq. (19) and Draft Vassoler Rutz 344011008-image108.png is a 4x4 matrix and represents Draft Vassoler Rutz 344011008-image109.png .

Let it be

.
(21)

The asymptotic convergence factor Draft Vassoler Rutz 344011008-image111.png can be calculated by

Draft Vassoler Rutz 344011008-image112.png ,
(22)


where Draft Vassoler Rutz 344011008-image113.png is the spectral radius of the 4x4 matrix Draft Vassoler Rutz 344011008-image114.png .

In this work, LFA was used to determine the asymptotic convergence factor of the multigrid method ( Draft Vassoler Rutz 344011008-image115.png ) combining ILU solvers in several directions (such as and ), FW, HW, INJ, and restriction operators and bilinear and 7-point prolongation operators.

For the FW restriction operator, Draft Vassoler Rutz 344011008-image120.png , for the , Draft Vassoler Rutz 344011008-image122.png and for the , Draft Vassoler Rutz 344011008-image124.png . For the other restriction operators, see [4].

For the bilinear prolongation operator, Draft Vassoler Rutz 344011008-image125.png and for the 7-point prolongation operator, Draft Vassoler Rutz 344011008-image126.png [10].

In order to perform the LFA using the solver, the auto-functions of Draft Vassoler Rutz 344011008-image128.png , Draft Vassoler Rutz 344011008-image129.png , Draft Vassoler Rutz 344011008-image130.png and Draft Vassoler Rutz 344011008-image131.png are given by

Draft Vassoler Rutz 344011008-image132.png with Draft Vassoler Rutz 344011008-image133.png .
(23)

The smoothing operator Draft Vassoler Rutz 344011008-image134.png , according to [6], is given by

Draft Vassoler Rutz 344011008-image135.png , Draft Vassoler Rutz 344011008-image133.png ,
(24)


with

Draft Vassoler Rutz 344011008-image136.png .
(25)


For , Draft Vassoler Rutz 344011008-image138.png and for , Draft Vassoler Rutz 344011008-image140.png .

4. Numerical Results

In this work, the anisotropic diffusion equation was solved using 7-point ILU solver in different directions. Several restriction operators and two prolongation operators were employed. It is proposed an algorithm that presents the lowest asymptotic convergence factor values and the lowest computational cost for the multigrid method.

Eq. (2) was assessed for Draft Vassoler Rutz 344011008-image141.png and Draft Vassoler Rutz 344011008-image142.png , with Draft Vassoler Rutz 344011008-image143.png . When Draft Vassoler Rutz 344011008-image144.png or Draft Vassoler Rutz 344011008-image145.png in this work, there is symmetric anisotropy. For instance, Draft Vassoler Rutz 344011008-image146.png is an anisotropy symmetric to Draft Vassoler Rutz 344011008-image147.png .

Section 4.1 presents the convergence analysis by means of LFA. Only the optimum components obtained via LFA will be used in the complexity analysis in section 4.2.

4.1. Convergence Analysis

Fig. 3 depicts Draft Vassoler Rutz 344011008-image148.png , given by Eq. (22), with in the EN, NE, ES, SE directions, FW restriction, bilinear prolongation, number of inner iterations Draft Vassoler Rutz 344011008-image150.png , Draft Vassoler Rutz 344011008-image141.png and Draft Vassoler Rutz 344011008-image151.png , with Draft Vassoler Rutz 344011008-image152.png .

Draft Vassoler Rutz 344011008-image153.jpeg

Figure 3: Draft Vassoler Rutz 344011008-image148.png versus anisotropy coefficients ( Draft Vassoler Rutz 344011008-image154.png ) with in different directions.

It is noticed that for Draft Vassoler Rutz 344011008-image156.png , has a good performance and for

Draft Vassoler Rutz 344011008-image158.png

has a good performance as well, that is, Draft Vassoler Rutz 344011008-image160.png . By using solvers in the ES and SE directions, the multigrid did not present a good performance for any of the anisotropy coefficients studied.

For the analyses presented below, tests were carried out using only the solvers that had the best performances in the previous analysis.

Fig. 4 presents Draft Vassoler Rutz 344011008-image148.png using as solvers, for Draft Vassoler Rutz 344011008-image156.png and for Draft Vassoler Rutz 344011008-image158.png ; Draft Vassoler Rutz 344011008-image164.png ; FW restriction; 7-point and bilinear prolongation; Draft Vassoler Rutz 344011008-image141.png and Draft Vassoler Rutz 344011008-image151.png , with Draft Vassoler Rutz 344011008-image165.png .

Draft Vassoler Rutz 344011008-image166.jpeg

Figure 4: Draft Vassoler Rutz 344011008-image148.png versus anisotropy coefficients ( Draft Vassoler Rutz 344011008-image154.png ) with different interpolation operators.

It is observed that the restriction|prolongation combinations FW|bilinear and FW|7-points had a good performance ( Draft Vassoler Rutz 344011008-image160.png ) and presented very similar convergence factors.

Bilinear prolongation operator was used in the following analysis as it is easy to program and demands fewer memory resources. The asymptotic convergence factors were compared with different restriction operators.

Fig. 5 presents Draft Vassoler Rutz 344011008-image148.png using as solvers, for Draft Vassoler Rutz 344011008-image156.png and for Draft Vassoler Rutz 344011008-image158.png ; Draft Vassoler Rutz 344011008-image164.png ; FW, HW, INJ, and restriction; bilinear prolongation; Draft Vassoler Rutz 344011008-image141.png and Draft Vassoler Rutz 344011008-image151.png , with Draft Vassoler Rutz 344011008-image165.png .

Draft Vassoler Rutz 344011008-image171.jpeg

Figure 5: Draft Vassoler Rutz 344011008-image148.png versus anisotropy coefficients ( Draft Vassoler Rutz 344011008-image154.png ) with different restriction operators.

Fig. 5 shows that for Draft Vassoler Rutz 344011008-image172.png and Draft Vassoler Rutz 344011008-image173.png , with Draft Vassoler Rutz 344011008-image165.png (symmetric anisotropies), Draft Vassoler Rutz 344011008-image174.png presents very similar values. It is noted that for anisotropic problems ( Draft Vassoler Rutz 344011008-image175.png ), the lowest values for Draft Vassoler Rutz 344011008-image174.png are achieved with FW and restriction, which present very similar values.

Based on the results shown, it is proposed Algorithm 1, which combines ILU solver in different directions with FW and restrictions. The algorithm will be presented now. The abbreviation REST, used in the algorithm, represents any of the restrictions (FW or ) previously defined.

Algorithm1:

___________________________________________________________________________

if Draft Vassoler Rutz 344011008-image179.png then

Apply smoothing with REST restriction

else if Draft Vassoler Rutz 344011008-image181.png then

Apply smoothing with FW restriction

else

Apply REST restriction

end if

___________________________________________________________________________

Next, it is presented the asymptotic convergence factor Draft Vassoler Rutz 344011008-image184.png , calculated via LFA, and the empiric asymptotic convergence factor Draft Vassoler Rutz 344011008-image185.png , for different anisotropy coefficients.

Fig. 6 shows Algorithm 1 with REST , Draft Vassoler Rutz 344011008-image164.png and bilinear prolongation.

Draft Vassoler Rutz 344011008-image187.jpeg

Figure 6: Draft Vassoler Rutz 344011008-image188.png and Draft Vassoler Rutz 344011008-image189.png versus anisotropy coefficients ( Draft Vassoler Rutz 344011008-image154.png ).

It is observed that Draft Vassoler Rutz 344011008-image190.png for every anisotropy coefficients assessed. Moreover, Draft Vassoler Rutz 344011008-image184.png calculated via LFA is in accordance with Draft Vassoler Rutz 344011008-image185.png calculated experimentally.

Fig. 7 depicts the numerical asymptotic convergence factor Draft Vassoler Rutz 344011008-image184.png calculated via LFA and the experimental asymptotic convergence factor Draft Vassoler Rutz 344011008-image185.png , for different anisotropy coefficients for different grids. Algorithm 1 with REST = , Draft Vassoler Rutz 344011008-image164.png and bilinear prolongation was used. As the grid becomes more refined, Draft Vassoler Rutz 344011008-image192.png for every anisotropy coefficient analyzed, what demonstrates the robustness of the methodology assessed.

Draft Vassoler Rutz 344011008-image193.jpeg

Figure 7: Draft Vassoler Rutz 344011008-image188.png and Draft Vassoler Rutz 344011008-image189.png versus anisotropy coefficients ( Draft Vassoler Rutz 344011008-image154.png ) for different number of grid points.

Next section presents the complexity analysis of multigrid. For the analysis, multigrid was built with the components that had the best convergence factors, according to LFA. In addition, a comparison between partial weighting and full weighting will be presented.

4.2. Complexity Analysis

In order to assess the effect of the number of unknowns on CPU time, optimum components obtained via LFA were used. Figures 8 (a) and 8 (b) show, for FW and restrictions respectively, that for the anisotropic problem ( Draft Vassoler Rutz 344011008-image195.png ), the is not lower. As the problem becomes more anisotropic ( Draft Vassoler Rutz 344011008-image156.png or Draft Vassoler Rutz 344011008-image158.png ), the Draft Vassoler Rutz 344011008-image197.png decreases for every N value assessed. For every anisotropy coefficient assessed, the Draft Vassoler Rutz 344011008-image198.png with FW restriction is very similar to the Draft Vassoler Rutz 344011008-image199.png with restriction, that is, .

It is also observed for symmetric anisotropies ( Draft Vassoler Rutz 344011008-image202.png and Draft Vassoler Rutz 344011008-image203.png ) with Draft Vassoler Rutz 344011008-image204.png , that the values of Draft Vassoler Rutz 344011008-image205.png obtained are extremely similar.

Draft Vassoler Rutz 344011008-image206.jpeg
Draft Vassoler Rutz 344011008-image207.jpeg
Figure 8: CPU time versus number of nodes (N).

In order to assess the performance of the multigrid with different anisotropy coefficients, a curve adjustment of the form Draft Vassoler Rutz 344011008-image208.png [18] was made, where p represents the complexity order of the solver, N is the number of grid points and c is a constant that depends on the method. The closer the value of p is to one, the better the performance of method used. Ideally, multigrid presents p=1, what means that the CPU time grows linearly with the increase of N. Results are shown in Table 1 for both restrictions assessed (FW and ).

One can observe in Tab. 1 that, for every anisotropy employed, the multigrid method has a good performance, since p ≈ 1 in every case. These results prove the efficiency and robustness of Algorithm 1, proposed in this work.

Draft Vassoler Rutz 344011008-image210.png Draft Vassoler Rutz 344011008-image211.png Draft Vassoler Rutz 344011008-image212.png
Draft Vassoler Rutz 344011008-image213.png 1.07747 1.07733
Draft Vassoler Rutz 344011008-image214.png 1.05023 1.05894
1 1.05940 1.04380
Draft Vassoler Rutz 344011008-image215.png 1.07255 1.06199
Draft Vassoler Rutz 344011008-image216.png 1.07627 1.07775


Table 1. Complexity order (p) for different anisotropy coefficients ( Draft Vassoler Rutz 344011008-image210.png ).

The values obtained for the convergence factor, presented in section 4.1, and for the complexity order concerning to the and FW restriction operators (Tab. 1) are quite similar and thus, insufficient to decide which one results in a more efficient algorithm.

To complement the analysis, the number of arithmetical operations performed in each V-cycle and at the restriction step for each one of the operators was assessed.

These arithmetical operations concern to floating point operations (flops) performed during the iterative process and are not affect by the hardware used. Each addition, multiplication and division operation correspond to 1 flop.

Tests were carried out for Draft Vassoler Rutz 344011008-image218.png ( Draft Vassoler Rutz 344011008-image219.png is the number of points of the finest grid, considering a problem whose maximum number of levels is Draft Vassoler Rutz 344011008-image220.png ) and for some values of Draft Vassoler Rutz 344011008-image221.png . Tab. 2 presents the ratio between number of flops of a V-cycle and number of points of the finest grid Draft Vassoler Rutz 344011008-image222.png . Tab. 3 shows the ratio between number of flops performed in each restriction and the number of points of the finest grid Draft Vassoler Rutz 344011008-image222.png .

Draft Vassoler Rutz 344011008-image210.png Draft Vassoler Rutz 344011008-image223.png
Draft Vassoler Rutz 344011008-image214.png 1510.221554 1543.296057
Draft Vassoler Rutz 344011008-image225.png 1798.299366 1837.988770
Draft Vassoler Rutz 344011008-image226.png 2368.277388 2126.896785
Draft Vassoler Rutz 344011008-image227.png 1860.997604 1900.687008
Draft Vassoler Rutz 344011008-image228.png 1564.958717 1598.033220


Table 2. Ratio between number of flops of a V-cycle and the number of points of the finest grid.

Draft Vassoler Rutz 344011008-image210.png
Draft Vassoler Rutz 344011008-image214.png 11.57607615 44.65057942
Draft Vassoler Rutz 344011008-image225.png 13.89129130 53.58069530
Draft Vassoler Rutz 344011008-image226.png 18.37833483 62.02688007
Draft Vassoler Rutz 344011008-image227.png 13.89129137 53.58069530
Draft Vassoler Rutz 344011008-image228.png 11.57607615 44.65057942


Table 3. Ratio between number of flops of the restriction and number of points of the finest grid.

According to Tab. 2, the multigrid cycle that requires the lowest number of flops is the cycle with operator, except for the isotropic case ( Draft Vassoler Rutz 344011008-image232.png ). Considering only the restriction step, results presented in Tab. 3 show a great advantage of the operator over the FW operator regarding the number of flops performed. For every case, the number of flops for is roughly 75% lower than the number of flops for FW.

The remaining anisotropy coefficients assessed showed similar performances to those presented in Fig. 5 and Tables 2 and 3, what confirms the efficiency and robustness of the algorithm proposed, in addition to the low computational cost with the use of the operator.

5. Conclusions

- Among the ILU directions assessed (EN, NE, ES, SE) for standard multigrid (FW restriction operator and bilinear prolongation), it is concluded that when using for Draft Vassoler Rutz 344011008-image156.png and for Draft Vassoler Rutz 344011008-image158.png .

- With FW restriction, for Draft Vassoler Rutz 344011008-image156.png and

Draft Vassoler Rutz 344011008-image158.png

, it can be concluded that the 7-point and bilinear prolongation operators presented a similar performance, and for every value of Draft Vassoler Rutz 344011008-image241.png assessed, Draft Vassoler Rutz 344011008-image160.png was obtained.

- With bilinear interpolation, for Draft Vassoler Rutz 344011008-image156.png and

Draft Vassoler Rutz 344011008-image158.png

, it can be stated that the lowest values of Draft Vassoler Rutz 344011008-image174.png are obtained with FW and restriction operators, and that the values of Draft Vassoler Rutz 344011008-image174.png with these operators are very similar.

- Using Algorithm 1, Draft Vassoler Rutz 344011008-image190.png for every anisotropy coefficient assessed and Draft Vassoler Rutz 344011008-image192.png as the grid becomes more refined.

- The for every value of Draft Vassoler Rutz 344011008-image241.png assessed.

- The complexity order p of the multigrid method with Algorithm 1 is close to one for every anisotropy assessed. For Draft Vassoler Rutz 344011008-image246.png , for example, p=1.07747 with this algorithm.

- The computational cost of multigrid depends on the number of flops of the restriction in a V-cycle. Using Algorithm 1 with restriction, the computational cost is 75% lower than with FW restriction.

- The Algorithm 1 with restriction proposed in this work is efficient, robust and has low computational cost.

References

[1] T. Gradl, K. Klamroth, U. Rüde, U. Optimizing the number of multigrid cycles in the full multigrid algorithm. Numerical Linear Algebra with Applications. Vol. 17 (2010) 199–210.
[2] R. H. Pletcher, J. C. Tannehill, D. A. Anderson, Computational Fluid Mechanics and Heat Transfer, 3ª ed. CRC Press, USA, 2013.
[3] G. H. Golub, J. M. Ortega, Scientific Computing and Differential Equations: an Introduction to Numerical Methods, Academic Press, 1992.
[4] Y. Saad, Iterative Methods for Sparse Linear Systems, 2ªed., PWS, Philadelphia, 2003.
[5] W. L. Briggs, V.E. Henson, S.F. Mccormick, A Multigrid Tutorial, 2ª ed., SIAM, USA, 2000.
[6] U. Trottenberg, C. Oosterlee, A. Schüller, Multigrid, Academic Press, England, 2001.
[7] P. Wesseling, An Introduction to Multigrid Methods, John Wiley & Sons, England, 1992.
[8] F. Oliveira, M. A. V. Pinto, C. H. Marchi, L. K. Araki. Optimized partial semicoarsening multigrid algorithm for heat diffusion problems and anisotropic grids. Applied Mathematical Modelling. Vol. 36 (2012) 4665–4676.
[9] J. A. Rabi, M. J. S. de Lemos. Optimization of convergence acceleration in multigrid numerical solutions of conductive-convective problems. Applied Mathematics and Computational. Vol. 124 (2001) 215-226.
[10] R. Wienands, W. Joppich, Practical Fourier Analysis for Multigrid Methods, CRC Press, USA, 2005.
[11] K. Johannsen. A robust 9-point ILU smoother for anisotropic problems, IWR Preprint, University of Heidelberg, 2005.
[12] S. A. Vinogradova, L. A. Krukier, The use of incomplete LU decomposition in modeling convection-diffusion processes in an anisotropic medium. Mathematical Models and Computer Simulations. Vol. 5 (2013) 190-197.
[13] G. Vassoler-Rutz, M. A. V. Pinto, R. Suero, Comparison of the physical anisotropy of multigrid method for two-dimensional diffusion equation, in: Proceedings of COBEM, Rio de Janeiro, Brazil, Dezembro 6-11.
[14] M. A. V. Pinto, C. Rodrigo, F. J. Gaspar, C.W. Oosterlee. On the robustness of ILU smoothers on triangular grids, Applied Numerical Mathematics. Vol. 106 (2016), 37-52.
[15] S. R. Franco, F. J. Gaspar, M. A. V. Pinto, C. Rodrigo. Multigrid method based on a space-time approach with standard coarsening for parabolic problems. Applied Mathematics and Computation. Vol. 317 (2018) 25-34.
[16] F. Oliveira. Effect of Two-Dimensional Anisotropic Meshes on the Performance of the Geometric Multigrid Method. Tese de doutorado, UFPR, 2010. In portuguese.
[17] P. Wesseling, C. W. Oosterlee. Geometric Multigrid with Applications to Computational Fluid Dynamics. Journal of Computation and Applied Mathematics. Vol. 128 (2001) 311-334.
[18] R. L. Burden, J.D. Faires, Numerical Analysis, 9ª ed., Cengage Learning, USA, 2010.
Back to Top

Document information

Published on 11/03/19
Accepted on 07/03/19
Submitted on 09/04/18

Volume 35, Issue 1, 2019
DOI: 10.23967/j.rimni.2019.03.001
Licence: CC BY-NC-SA license

Document Score

0

Views 109
Recommendations 0

Share this document