Implement new oren nayar model from OpenPBR #1817

fpsunflower · 2024-05-18T02:56:33Z

Description

This implements the "EON" energy conserving flavor of Oren-Nayar proposed to be part of OpenPBR and MaterialX. I've added it to the existing closure for now, but it would be easy to split it off as its own closure (the amount of shared logic is not that much, and we can easily use the same parameter struct). This is mainly to get the ball rolling on implementation.

Tests

I've tweaked the existing furnace test to test both variants. As expected, the new closure passes the furnace test prefectly while the old one had some issues.

Checklist:

I have read the contribution guidelines.
I have updated the documentation, if applicable.
I have ensured that the change is tested somewhere in the testsuite (adding new test cases if necessary).
My code follows the prevailing code style of this project. If I haven't
already run clang-format v17 before submitting, I definitely will look at
the CI test that runs clang-format and fix anything that it highlights as
being nonconforming.

… arg for backwards compatibility) Signed-off-by: Chris Kulla <ckulla@gmail.com>

…eep checking the old mode as well) Signed-off-by: Chris Kulla <ckulla@gmail.com>

fpsunflower · 2024-05-18T02:57:05Z

@jstone-lucasfilm and @portsmouth may be interested in this.

jstone-lucasfilm · 2024-05-19T00:49:44Z

This is great timing, thanks @fpsunflower, and I'm posting a link to my initial implementation of EON Diffuse in GLSL:

AcademySoftwareFoundation/MaterialX#1822

Let's compare notes on these two implementations in upcoming days, and I'm happy to borrow elements from your version where we can improve alignment.

lgritz · 2024-05-20T15:57:19Z

Needs clang-format but otherwise LGTM. I'm leaving it for others to check the math.

Oh, maybe update the docs where the material closures are described: src/doc/stdlib.md

jstone-lucasfilm · 2024-05-20T16:44:28Z

@fpsunflower Two initial notes on differences between the current GLSL and OSL, where we should discuss the best approach on which to harmonize:

In GLSL, I've named the new option energy_compensation, based on the naming convention I've seen for GGX energy compensation flags in some production renderers, while in OSL the new option is named energy_conservation. I don't have a strong opinion on this, and am open to preferences from the group.
In GLSL, I'm using the original definitions of the A and B terms, independent of whether energy compensation is enabled, while in OSL the definitions of A and B change when energy compensation is enabled. As above, I'm open to preferences from the group as to which approach is preferred.

Signed-off-by: Chris Kulla <ckulla@gmail.com>

fpsunflower · 2024-05-20T17:52:45Z

energy_compensation sounds better. I've updated.

For the A/B terms, I'm following Jamie's recommendation to switch to the Fujii terms as well.

It is worth considering if it wouldn't just be simpler to have a second BSDF since what we are doing here is really more than just compensating the previous one.

portsmouth · 2024-05-20T17:55:13Z

I agree with "energy compensation" over "energy conservation".

As strictly speaking, the latter just means that energy is not gained. While the model actually ensures that energy is not lost in the white albedo case (i.e. energy preservation) -- via the compensation.

portsmouth · 2024-05-20T18:00:54Z

Note that we're working on improved importance sampling, using an LTC lobe. We get some decent reduction in variance from that, though we're still testing (and talking with @shill-lucasfilm about possible further improvements):

A self-contained GLSL implementation of the sampling function is below, if you want to try it. Of course, since this is WIP, for this PR you probably will want to just use regular cosine-weighted sampling and leave this improvement for later. Just thought it worth mentioning.

vec4 sample_cosine_lobe(inout uint rndSeed)
{
    float rh = sqrt(rand(rndSeed));
    float phi_h = 2.0 * PI * rand(rndSeed);
    float x = rh * cos(phi_h); float y = rh * sin(phi_h);
    float z = sqrt(max(0.0, 1.0 - x*x - y*y));
    return vec4(x, y, z, abs(z) / PI);
}

vec4 sample_EON(in vec3 wo, float roughness, inout uint rndSeed)
{
    vec4 wh = sample_cosine_lobe(rndSeed);         // wh sample
    float sinO = sqrt(1.0 - wo.z*wo.z);            // for LTC elements
    float a = roughness*pow(sinO, 50.0)*0.5 + 1.0; // LTC M element a
    float b = roughness*pow(sinO, 4.0)*sqrt(wo.z); // LTC M element b
    vec3 wi = vec3(a*wh.x + b*wh.z, a*wh.y, wh.z); // wi = M wh
    float l = 1.0/length(wi);                      // 1/|M wh| = |M^-1 wi|
    wi *= l;                                       // normalize wi
    float inv_det = 1.0/(a*a);                     // |M^-1|
    float J = inv_det / (l*l*l);                   // |M^-1| / |M^-1 wi|^3
    float pdf_i = wh.w * J;                        // PDF D(wi), Eqn.1 LTC paper
    float phi_o = atan(wo.y, wo.x);                // find azimuth of wo plane
    float C = cos(phi_o); float S = sin(phi_o);    // 2d rot matrix
    mat2 R = mat2(C, S, -S, C);                    // 2d rot matrix
    wi = vec3(R*wi.xz, wi.z);                      // rotate wi
    return vec4(wi, pdf_i);
}

jstone-lucasfilm · 2024-05-20T18:00:55Z

@fpsunflower @portsmouth I'm open to the idea that EON is a completely distinct diffuse model, especially if it uses an independent set of A and B terms, though I'd like to work through the additional infrastructure that this would require of renderers.

If we maintain independent A and B terms for EON and QON, I'm imagining that this would require independent directional albedo lookups for the two models as well (e.g. in the context of pre-integrated irradiance maps for real-time shading).

portsmouth · 2024-05-20T18:04:42Z

@fpsunflower @portsmouth I'm open to the idea that EON is a completely distinct diffuse model, especially if it uses an independent set of A and B terms, though I'd like to work through the additional infrastructure that this would require of renderers.

👍

If we maintain independent A and B terms for EON and QON, I'm imagining that this would require independent directional albedo lookups for the two models as well (e.g. in the context of pre-integrated irradiance maps for real-time shading).

Note that the directional albedo for EON is analytical, as given below in GLSL, so should be pretty easy to add:

const float pi = 3.14159265f;
const float constant1_FON = 0.5f - 2.0f / (3.0f * pi);
const float constant2_FON = 2.0f / 3.0f - 28.0f / (15.0f * pi);

  // FON directional albedo (approx.)
  float E_FON_approx(float mu, float roughness)
  {
      float sigma = roughness; // FON sigma prime
      float mucomp = 1.0f - mu;
      float mucomp2 = mucomp * mucomp;
      const mat2 Gcoeffs = mat2(0.0571085289f, -0.332181442f,
                                0.491881867f,   0.0714429953f);
      float GoverPi = dot(Gcoeffs * vec2(mucomp, mucomp2), vec2(1.0f, mucomp2));
      return (1.0f + sigma * GoverPi) / (1.0f + constant1_FON * sigma);
  }

  // EON directional albedo (approx)
  vec3 E_EON(vec3 rho, float roughness, vec3 wi_local)
  {
      float sigma = roughness;                           // FON sigma prime
      float mu_i = wi_local.z;                           // input angle cos
      float AF = 1.0f / (1.0f + constant1_FON * sigma);  // FON A coeff.
      float EF = E_FON_approx(mu_i, sigma);              // FON wi albedo (approx)
      float avgEF = AF * (1.0f + constant2_FON * sigma); // average albedo
      vec3 rho_ms = (rho * rho) * avgEF / (vec3(1.0f) - rho * (1.0f - avgEF));
      return rho * EF + rho_ms * (1.0f - EF);
  }

jstone-lucasfilm · 2024-05-20T18:14:04Z

@portsmouth This sounds reasonable to me, and I'll give this a try in MaterialX.

From a naming perspective, I'd lean towards continuing to use the name oren_nayar_diffuse_bsdf for this shading model, following in the tradition of continuing to use the name generalized_schlick_bsdf when Naty's F82 improvements were included.

I'd be interested in thoughts from others on the thread, and I'm open to counter-proposals for naming conventions if you have a compelling suggestion.

portsmouth · 2024-05-20T18:22:33Z

If it's going to be a single BSDF with non-energy-preserving (legacy) and energy-preserving modes, I think it's fine to keep the name as oren_nayar_diffuse_bsdf (as all we're doing is tweaking/improving the original BSDF really, and the name is a famous one so no need to reinvent it).

If you want to make it a separate BSDF, then some other name is needed, but I think that's more confusing.

fpsunflower · 2024-05-20T18:33:22Z

Maybe we should simply call it portsmouth_diffuse_brdf? or portsmouth_fujii_diffuse_brdf?

I would argue that tweaking fresnel is a different case than modifying the shape of the lobe. Also, IIRC the F82 change was mostly "backwards compatible" in that a default of white for F82 didn't change the look that much.

Here, there are basically 3 improvements we are taking:

Fujii's "dark ring" fix
Fujii's new A/B terms
Jamie's analytical energy compensation lobe

In my implementation there weren't that many lines of code that could be shared in practice (even less so if we derive a custom sampling routine for this bsdf). So I think it probably would be cleanest to just make it a new brdf.

lgritz · 2024-05-20T18:37:30Z

Aside: I like how everybody seems to be allowed to have just one BSDF named after them per lifetime. Choose wisely!

jstone-lucasfilm · 2024-05-21T00:13:15Z

@fpsunflower @portsmouth I've updated the MaterialX GLSL implementation to use Fujii's constants and fixes when energy_compensation is enabled. Take a look when you have a chance, and we can continue comparing notes on these two initial implementations of EON.

etheory · 2024-05-23T22:05:41Z

Hi all, I tried to look for the Slack channel where this discussion started (only just became aware of it in the OSL TSC) but here was a spreadsheet I made a while back that does the same thing, not sure if it's the same, but it's also closed-form and very simple: https://www.desmos.com/calculator/extaj6cnzr (and reciprocal).

fpsunflower · 2024-05-24T04:40:58Z

@etheory it looks like your math aligns with what @portsmouth wrote up. Very cool :)

The MaterialX change was just merged, I will make a few final tweaks to handle rho correctly (had a TODO about this in the code) and merge this one as well.

etheory · 2024-05-24T05:19:14Z

src/testrender/shading.cpp

+                      + constant1_FON * sigma);  // Fujii model A coefficient
+        float BF = sigma * AF;                   // Fujii model B coefficient
+        float Si = sqrtf(std::max(0.0f, 1.0f - mu * mu));
+        float G  = Si * (OIIO::fast_acos(mu) - Si * mu)


Why is the fast_acos required here?
If you look at my version: https://www.desmos.com/calculator/extaj6cnzr the E3(x) function I define can take mu directly as a parameter and provides a very high quality polynomial fit to give the same result.
It's quite a lot faster numerically. Curious if you would be interested in testing the performance. It also doesn't require special handling as it has no issues at either 0 or 1. It should provide a significant performance improvement over calling acos.

i.e.

You can then use E3(mu) in place of E(acos(mu)) and get a significant performance improvement with an absolutely tiny error and no need for special edge-case handling.

@etheory Thanks for your notes, that's helpful. Very interesting that you also found a similar functional form.

We also did a fit to that G function, which has maximum fraction error of 6e-4 over all directions. We can compare to your formulation as well.

If it turns out to be better, we'd be happy to credit you in the write-up, if you don't mind sharing your real name.

Sure thing. My name is Luke Emrose. Engineering Manager for Rendering at Animal Logic. More than happy for you to use it if it's useful.

Thanks Luke! Obviously if we find something in your notes that we use in the write-up, we will let you know and credit you in the acknowledgements.

Yeah I went for the vanilla analytic formula for now to have as a reference. I fully expect swapping it for one of the approximations in a second pass.

etheory · 2024-05-24T05:24:44Z

src/testrender/shading.cpp

+                   / (1.0f
+                      + constant1_FON * sigma);  // Fujii model A coefficient
+        float BF = sigma * AF;                   // Fujii model B coefficient
+        float Si = sqrtf(std::max(0.0f, 1.0f - mu * mu));


Is mu truly bounded to be <=1 given the small discrepancies of vector normalization in float?
I'd say probably not, and if it isn't, you'll get issues.
I suggest doing a 0,1 clamp on mu just to make sure. If Si >1, then G could become negative, which would be VERY bad.

mu is guaranteed to be >0 here. We check for the >=1 case by clamping the term inside the sqrt for Si.

…ler. Take into account the rho parameter as described in the paper. Signed-off-by: Chris Kulla <ckulla@gmail.com>

Signed-off-by: Chris Kulla <ckulla@gmail.com>

…n#1817) * Implement new EON mode for Oren-Nayar closures (via an opt-in keyword arg for backwards compatibility) * Add test for energy conserving Oren-Nayar (on half the sphere so we keep checking the old mode as well) Signed-off-by: Chris Kulla <ckulla@gmail.com>

fpsunflower added 2 commits May 17, 2024 17:36

Implement new EON mode for Oren-Nayar closures (via an opt-in keyword…

68547e8

… arg for backwards compatibility) Signed-off-by: Chris Kulla <ckulla@gmail.com>

Add test for energy conserving Oren-Nayar (on half the sphere so we k…

775b904

…eep checking the old mode as well) Signed-off-by: Chris Kulla <ckulla@gmail.com>

Rename to energy_compensation to match MaterialX PR, run clang-foramt

f0d355e

Signed-off-by: Chris Kulla <ckulla@gmail.com>

fpsunflower mentioned this pull request May 20, 2024

Initial support for EON diffuse AcademySoftwareFoundation/MaterialX#1822

Merged

etheory reviewed May 24, 2024

View reviewed changes

fpsunflower added 2 commits May 24, 2024 16:26

Split EON BSDF into its own BSDF to make the renderer side logic simp…

ff061b9

…ler. Take into account the rho parameter as described in the paper. Signed-off-by: Chris Kulla <ckulla@gmail.com>

Run clang-format

4805388

Signed-off-by: Chris Kulla <ckulla@gmail.com>

fpsunflower merged commit 9e3d7e4 into AcademySoftwareFoundation:main May 25, 2024
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement new oren nayar model from OpenPBR #1817

Implement new oren nayar model from OpenPBR #1817

fpsunflower commented May 18, 2024

fpsunflower commented May 18, 2024

jstone-lucasfilm commented May 19, 2024

lgritz commented May 20, 2024

jstone-lucasfilm commented May 20, 2024

fpsunflower commented May 20, 2024

portsmouth commented May 20, 2024

portsmouth commented May 20, 2024 •

edited

jstone-lucasfilm commented May 20, 2024

portsmouth commented May 20, 2024 •

edited

jstone-lucasfilm commented May 20, 2024

portsmouth commented May 20, 2024 •

edited

fpsunflower commented May 20, 2024

lgritz commented May 20, 2024

jstone-lucasfilm commented May 21, 2024

etheory commented May 23, 2024 •

edited

fpsunflower commented May 24, 2024

etheory May 24, 2024

etheory May 24, 2024 •

edited

portsmouth May 24, 2024

etheory May 24, 2024

portsmouth May 24, 2024

fpsunflower May 24, 2024

etheory May 24, 2024 •

edited

fpsunflower May 24, 2024

Implement new oren nayar model from OpenPBR #1817

Implement new oren nayar model from OpenPBR #1817

Conversation

fpsunflower commented May 18, 2024

Description

Tests

Checklist:

fpsunflower commented May 18, 2024

jstone-lucasfilm commented May 19, 2024

lgritz commented May 20, 2024

jstone-lucasfilm commented May 20, 2024

fpsunflower commented May 20, 2024

portsmouth commented May 20, 2024

portsmouth commented May 20, 2024 • edited

jstone-lucasfilm commented May 20, 2024

portsmouth commented May 20, 2024 • edited

jstone-lucasfilm commented May 20, 2024

portsmouth commented May 20, 2024 • edited

fpsunflower commented May 20, 2024

lgritz commented May 20, 2024

jstone-lucasfilm commented May 21, 2024

etheory commented May 23, 2024 • edited

fpsunflower commented May 24, 2024

etheory May 24, 2024

Choose a reason for hiding this comment

etheory May 24, 2024 • edited

Choose a reason for hiding this comment

portsmouth May 24, 2024

Choose a reason for hiding this comment

etheory May 24, 2024

Choose a reason for hiding this comment

portsmouth May 24, 2024

Choose a reason for hiding this comment

fpsunflower May 24, 2024

Choose a reason for hiding this comment

etheory May 24, 2024 • edited

Choose a reason for hiding this comment

fpsunflower May 24, 2024

Choose a reason for hiding this comment

portsmouth commented May 20, 2024 •

edited

portsmouth commented May 20, 2024 •

edited

portsmouth commented May 20, 2024 •

edited

etheory commented May 23, 2024 •

edited

etheory May 24, 2024 •

edited

etheory May 24, 2024 •

edited