Mesh setting in Moose: replicated and distributed

Shohei Ogawa

unread,

Jun 13, 2018, 5:47:35 PM6/13/18

to moose...@googlegroups.com

Hello,

I wrote down my understanding about mesh setting in Moose with some questions. I hope, as long as it is correct, the explanation below would be helpful to others who use big mesh.

To begin with, there are two meshing options: replicated and distributed. Those can be passed to parallel_type in Mesh block in your input file.

http://www.mooseframework.org/moose/documentation/systems/Mesh/index.html#replicated-and-distributed-mesh

The default argument for parallel_type is DEFAULT. This makes Moose use the replicated mesh unless you do something special to invoke the distributed mesh.

1. Replicated mesh format

With replicated mesh, all cores in each simulation have entire mesh information. If you do `mpiexec -n 64 ...`, there are 64 complete mesh data sets, not storing mesh data for each node. Is this correct? The typical benefit from this is that an implementation of some features is easier. I believe some existing features in Moose don't support the distributed mesh. In other words, those features are only for the replicated mesh. The example of those can be found:

https://github.com/idaholab/moose/search?q=errorIfDistributedMesh&unscoped_q=errorIfDistributedMesh

Using replicated mesh simplifies things, but if your mesh size is too large, too much memory is taken away for redundant mesh data. In past, I ran some simulations with 10 to 20 million elements with about 60 cores. I observed very high memory usage during the simulations, at the beginning in particular. I didn't pay attention to the meshing setting, so I was using the replicated mesh, and a large portion of memory was used for the mesh data.

If the number of elements is more than 1 million, you can get benefits using the distributed mesh format as discussed in the thread below.

https://groups.google.com/d/msg/moose-users/EC2mueskjY8/SGhbTyihAAAJ

2. Distributed mesh format

Each core only owns some part of mesh as needed. There are (at least) two ways to use the distributed mesh. One is just adding --distributed-mesh to a command you execute. The other is doing a pre-splitting as described here.

http://www.mooseframework.org/moose/documentation/systems/Mesh/splitting.html

Doing pre-splitting is useful if your mesh is too large to be placed a RAM on your machine or node without pre-splitting.

You can use the Nemesis mesh format to output data, which creates exodus files for each process. Those files can be visualized on Paraview with ease. Also, is it right even with distributed mesh, we can export data as Exodus to put all the data into one file? This would be helpful for me.

It would be great if you could give me a feedback.

Thank you,

Shohei Ogawa

Wang, Yaqi

unread,

Jun 13, 2018, 6:05:12 PM6/13/18

to moose-users

Very good summary, Shohei!

Few minor comments.

On Wed, Jun 13, 2018 at 3:46 PM, Shohei Ogawa <ogawa...@gmail.com> wrote:

Hello,

I wrote down my understanding about mesh setting in Moose with some questions. I hope, as long as it is correct, the explanation below would be helpful to others who use big mesh.

To begin with, there are two meshing options: replicated and distributed. Those can be passed to parallel_type in Mesh block in your input file.
http://www.mooseframework.org/moose/documentation/systems/Mesh/index.html#replicated-and-distributed-mesh

The default argument for parallel_type is DEFAULT. This makes Moose use the replicated mesh unless you do something special to invoke the distributed mesh.

What do you exactly mean 'something special'?

parallel_type parameter will be overwritten by command-line argument --distributed-mesh if I understand correctly. --distributed-mesh will be passed to multiapps. Also parallel_type is possibly not needed when the mesh type is nemesis. It should not be set to DISTRIBUTED when pre-splitting the mesh currently. (I did not verify these very carefully though.) MOOSE needs to protect this parameter from wrongly usage imo.

1. Replicated mesh format
With replicated mesh, all cores in each simulation have entire mesh information. If you do `mpiexec -n 64 ...`, there are 64 complete mesh data sets, not storing mesh data for each node. Is this correct? The typical benefit from this is that an implementation of

for each node => for each core. I believe the understanding is correct.

some features is easier. I believe some existing features in Moose don't support the distributed mesh. In other words, those features are only for the replicated mesh. The example of those can be found:
https://github.com/idaholab/moose/search?q=errorIfDistributedMesh&unscoped_q=errorIfDistributedMesh

Using replicated mesh simplifies things, but if your mesh size is too large, too much memory is taken away for redundant mesh data. In past, I ran some simulations with 10 to 20 million elements with about 60 cores. I observed very high memory usage during the simulations, at the beginning in particular. I didn't pay attention to the meshing setting, so I was using the replicated mesh, and a large portion of memory was used for the mesh data.

If the number of elements is more than 1 million, you can get benefits using the distributed mesh format as discussed in the thread below.
https://groups.google.com/d/msg/moose-users/EC2mueskjY8/SGhbTyihAAAJ

2. Distributed mesh format
Each core only owns some part of mesh as needed. There are (at least) two ways to use the distributed mesh. One is just adding --distributed-mesh to a command you execute. The other is doing a pre-splitting as described here.
http://www.mooseframework.org/moose/documentation/systems/Mesh/splitting.html
Doing pre-splitting is useful if your mesh is too large to be placed a RAM on your machine or node without pre-splitting.

You can use the Nemesis mesh format to output data, which creates exodus files for each process. Those files can be visualized on Paraview with ease. Also, is it right even with distributed mesh, we can export data as Exodus to put all the data into one file? This would be helpful for me.

I do not know if there is a tool to do this.

It would be great if you could give me a feedback.

Thank you,
Shohei Ogawa

--
You received this message because you are subscribed to the Google Groups "moose-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to moose-users+unsubscribe@googlegroups.com.
Visit this group at https://groups.google.com/group/moose-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/moose-users/CAHfiw0e60Fpu5AMgsSw13p8EQ%3Dsfm__%3DO3x9Bi%2BXMWA_dxhAnw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Shohei Ogawa

unread,

Jun 13, 2018, 6:27:30 PM6/13/18

to moose-users

Hi Yaqi,

Thank you for the reply. Below, there are my comments to your comments.

Shohei

On Wednesday, June 13, 2018 at 6:05:12 PM UTC-4, yaqi.wang wrote:

Very good summary, Shohei!

Few minor comments.

On Wed, Jun 13, 2018 at 3:46 PM, Shohei Ogawa <ogawa...@gmail.com> wrote:
Hello,

I wrote down my understanding about mesh setting in Moose with some questions. I hope, as long as it is correct, the explanation below would be helpful to others who use big mesh.

To begin with, there are two meshing options: replicated and distributed. Those can be passed to parallel_type in Mesh block in your input file.
http://www.mooseframework.org/moose/documentation/systems/Mesh/index.html#replicated-and-distributed-mesh

The default argument for parallel_type is DEFAULT. This makes Moose use the replicated mesh unless you do something special to invoke the distributed mesh.

What do you exactly mean 'something special'?
parallel_type parameter will be overwritten by command-line argument --distributed-mesh if I understand correctly. --distributed-mesh will be passed to multiapps. Also parallel_type is possibly not needed when the mesh type is nemesis. It should not be set to DISTRIBUTED when pre-splitting the mesh currently. (I did not verify these very carefully though.) MOOSE needs to protect this parameter from wrongly usage imo.

Your understanding is right. With 'something special', I meant adding command line option such as --distributed-mesh or using pre-split mesh.

1. Replicated mesh format
With replicated mesh, all cores in each simulation have entire mesh information. If you do `mpiexec -n 64 ...`, there are 64 complete mesh data sets, not storing mesh data for each node. Is this correct? The typical benefit from this is that an implementation of

for each node => for each core. I believe the understanding is correct.

This makes sense a lot. In the previous simulations, I noticed if I increased the number of cores for my simulation, the memory usage jumped up, so I had to keep it reasonably low. I will use the distributed mesh format next time.

some features is easier. I believe some existing features in Moose don't support the distributed mesh. In other words, those features are only for the replicated mesh. The example of those can be found:
https://github.com/idaholab/moose/search?q=errorIfDistributedMesh&unscoped_q=errorIfDistributedMesh

Using replicated mesh simplifies things, but if your mesh size is too large, too much memory is taken away for redundant mesh data. In past, I ran some simulations with 10 to 20 million elements with about 60 cores. I observed very high memory usage during the simulations, at the beginning in particular. I didn't pay attention to the meshing setting, so I was using the replicated mesh, and a large portion of memory was used for the mesh data.

If the number of elements is more than 1 million, you can get benefits using the distributed mesh format as discussed in the thread below.
https://groups.google.com/d/msg/moose-users/EC2mueskjY8/SGhbTyihAAAJ

2. Distributed mesh format
Each core only owns some part of mesh as needed. There are (at least) two ways to use the distributed mesh. One is just adding --distributed-mesh to a command you execute. The other is doing a pre-splitting as described here.
http://www.mooseframework.org/moose/documentation/systems/Mesh/splitting.html
Doing pre-splitting is useful if your mesh is too large to be placed a RAM on your machine or node without pre-splitting.

You can use the Nemesis mesh format to output data, which creates exodus files for each process. Those files can be visualized on Paraview with ease. Also, is it right even with distributed mesh, we can export data as Exodus to put all the data into one file? This would be helpful for me.

I do not know if there is a tool to do this.

If I have exodus = true in my output block, I get one exodus file with --distributed-mesh option or using pre-split mesh. The exodus file looks fine for me although I didn't inspect the data much. Also, I got a 'Warning' below.

Warning: This MeshOutput subclass only supports meshes which have been serialized!

So I was wondering if Moose or Libmesh has a feature to combine data from all the processor to generate one Exodus file. If you run a large problem with so many cores, it makes sense not to combine output to one file. I believe this is inefficient or even impossible when the data is too large for a RAM space for one processor.

It would be great if you could give me a feedback.

Thank you,
Shohei Ogawa

--
You received this message because you are subscribed to the Google Groups "moose-users" group.

To unsubscribe from this group and stop receiving emails from it, send an email to moose-users...@googlegroups.com.

Wang, Yaqi

unread,

Jun 13, 2018, 6:36:42 PM6/13/18

to moose-users

Maybe we can let MOOSE provide a functionality like --merge-mesh. Or such a capability has already existed somewhere else. When we use distributed mesh and use exodus output, master processor will have a memory spike during output, which can destroy your calculation if you are very tight on the memory.

It would be great if you could give me a feedback.

Thank you,
Shohei Ogawa

--
You received this message because you are subscribed to the Google Groups "moose-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to moose-users...@googlegroups.com.
Visit this group at https://groups.google.com/group/moose-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/moose-users/CAHfiw0e60Fpu5AMgsSw13p8EQ%3Dsfm__%3DO3x9Bi%2BXMWA_dxhAnw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "moose-users" group.

To unsubscribe from this group and stop receiving emails from it, send an email to moose-users+unsubscribe@googlegroups.com.

Visit this group at https://groups.google.com/group/moose-users.

To view this discussion on the web visit https://groups.google.com/d/msgid/moose-users/85ae2abe-49f3-4b31-b805-cd2c827be039%40googlegroups.com.

Cody Permann

unread,

Jun 13, 2018, 6:53:07 PM6/13/18

to moose...@googlegroups.com

"exodus = true" will recombine the mesh for you. That's exactly what it does even for distributed mesh.

It could also run you out of RAM on processor 0 so be careful. I'd recommend that you use nemesis for speed and space.

To view this discussion on the web visit https://groups.google.com/d/msgid/moose-users/85ae2abe-49f3-4b31-b805-cd2c827be039%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

--

You received this message because you are subscribed to the Google Groups "moose-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to moose-users...@googlegroups.com.
Visit this group at https://groups.google.com/group/moose-users.

To view this discussion on the web visit https://groups.google.com/d/msgid/moose-users/CABk6%3D%2BZ%3Dfe1q_Xfy-TgDBtuk7UW44hRa_BksnM3Ot%2BEeNo71yg%40mail.gmail.com.

Shohei Ogawa

unread,

Jun 13, 2018, 6:56:14 PM6/13/18

to moose-users

Thank you very much for the information.

Shohei

Reply all

Reply to author

Forward