New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
performance_ex1
with triangles
#4202
Comments
Your suggested fixes look good to me. Thanks @aschaf! |
I opened #4206 to resolve the issue. A related question: if I want to further speed up the assembly (eg. on my laptop) should I use the MPI parallel version to utilize all available cores, or would it make sense to somehow extend the TBilinearform to utilize eg. OpenMP for the assembly loop? |
The MPI-parallel version should be easier to try -- just look at how |
I noticed another issue I somewhat missed in the pull request. When turning on SIMD, it again crashes at line 103 in Lines 94 to 106 in 7c296d0
specifically because at some point el+j+i*vsize is greater than the number of elements. A quick and dirty fix was just to replace line 103 with
|
@aschaf, you are right -- this is a bug. In the respective situation in Line 214 in 7c296d0
A better approach for here and for |
@v-dobrev Thank you for the explanation, I will open a PR once this is fixed. |
Hi,
I was playing around with the performance miniapp
performance_ex1
, and in order to be able to run the program with the-perf
and-asm
option on triangular meshes, I had to apply a small fix totbilinearform.hpp
. Atmfem/fem/tbilinearform.hpp
Lines 505 to 507 in 37e03ab
I replaced line 506 with
which then allowed the program to run with
-perf -ams
. Second, I also tried using aTPiecewiseConstCoefficient
and I found an error inmfem/fem/teltrans.hpp
Lines 94 to 106 in 37e03ab
where the increment in the inner loop increases the wrong variable. If these fixes are reasonable, I would open a pull request fixing those issues.
Best,
Andreas
The text was updated successfully, but these errors were encountered: