batch: suppress warning for SIMD loop vectorization failure with clang #2032

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

lgritz wants to merge 1 commit into AcademySoftwareFoundation:main

from lgritz:lg-simdloop

Open

batch: suppress warning for SIMD loop vectorization failure with clang #2032

lgritz wants to merge 1 commit into AcademySoftwareFoundation:main from lgritz:lg-simdloop

+83 −11

Conversation

@lgritz

Copy link

Collaborator

@lgritz lgritz commented Oct 25, 2025 •

edited

Loading

Background: We knew all along that some of the omp SIMD loops that worked
fine with icc, icx, and even gcc, failed to parallelize the loop for clang,
and the warnings issued as a result would break the build in CI where we
use compile options to turn warnings into hard errors.

So we made a OSL_OMP_COMPLEX_SIMD_LOOP macro that would issue the omp
simd pragma for other compilers, but not clang. These were set up by Alex
Wells, who through trial and error found the loops that clang wouldn't vectorize
and so marked them. This was a few years ago, so probably involved clang 14
or so? We hoped that in the future, new clang releases would be better at
vectorizing those loops and we could eventually reduce the number of places
we used the special "anywhere but clang" macro.

After adding a "latest dependencies with clang" test that test against clang 18
(sorry, that's not exactly state of the art, but still several releases newer
than when Alex did this work), it sure seems that FEWER such loops vectorize
properly, so it breaks the build with errors that look like this

/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/src/liboslexec/wide/wide_opstring_b8_AVX2.cpp:94:1: error: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Werror,-Wpass-failed=transform-warning]
 94 | __OSL_MASKED_OP2(hash, Wi, Ws)(void* wr_, void* ws_, unsigned int mask_value)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/src/liboslexec/wide/define_opname_macros.h:44:5: note: expanded from macro '__OSL_MASKED_OP2'
 44 | __OSL_CONCAT7(osl_, __OSL_LIBRARY_SELECTOR, NAME, _, A, B, _masked)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:114:44: note: expanded from macro '__OSL_CONCAT7'
 114 | #define __OSL_CONCAT7(A, B, C, D, E, F, G) __OSL_CONCAT(__OSL_CONCAT6(A,B,C,D,E,F),G)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:109:28: note: expanded from macro '__OSL_CONCAT'
 109 | #define __OSL_CONCAT(A, B) __OSL_CONCAT_INDIRECT(A,B)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:108:37: note: expanded from macro '__OSL_CONCAT_INDIRECT'
 108 | #define __OSL_CONCAT_INDIRECT(A, B) A ## B
 | ^
<scratch space>:336:1: note: expanded from here
 336 | osl_b8_AVX2_hash_WiWs_masked
 | ^
1 error generated.

First, I tried using OSL_OMP_COMPLEX_SIMD_LOOP in more places, but eventually
decided that rather than not even try to vectorize (and somehow have to discern
exactly which clang versions could or could not do it with each loop), a
better strategy was to try to vectorize, but simply disable the warning that is
issued when we can't vectorize.

@lgritz lgritz requested a review from AlexMWells

October 25, 2025 19:28

@lgritz lgritz added build / testing / port / CI batch shading labels

Oct 25, 2025

AlexMWells

AlexMWells reviewed

Oct 27, 2025

View reviewed changes

src/liboslexec/wide/wide_opalgebraic.cpp

OSL_OMP_COMPLEX_SIMD_LOOP(simdlen(__OSL_WIDTH))

for (int lane = 0; lane < __OSL_WIDTH; ++lane) {

Vec3 V = wV[lane];

if (wr.mask()[lane]) {

Copy link

Contributor

@AlexMWells AlexMWells Oct 27, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These hardly seem complex, just turning around and calling Vec3::length(), for your build which Vec3 implementation is being used?

Does this version of clang not have a vectorized version of sqrt?
Perhaps there is a required clang compiler flag to allow the SIMD version of sqrt?

If not, perhaps better to conditional dependency around those specific usages for this specific clang version.
Can you reproduce in a godbolt.org?

Copy link

Collaborator Author

@lgritz lgritz Nov 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I changed strategy as noted in my PR update. Try to vectorize, but suppress the warning for those loops that were problematic.

@lgritz


 batch: relax SIMD loop for certain operations

53c15bb

When building with a newer version of clang, I uncovered a few more
SIMD pragma loops fail to vectorize on clang.
Since it seems to depend on the compiler version, and I don't want to
disable attempted vectorization, I thought a better strategy was just
to disable the warnings when it fails to vectorize in those specific
places.
Add a "latest dependencies with clang" test, that's how I stumbled
across these in the first place.
Signed-off-by: Larry Gritz <lg@larrygritz.com>

@lgritz lgritz force-pushed the lg-simdloop branch from 4f81bab to 53c15bb Compare

November 3, 2025 00:53

@lgritz

Copy link

Collaborator Author

lgritz commented Nov 3, 2025

I've amended the PR to change strategy -- instead of not trying to vectorize those loops, instead I just suppress the warning that was breaking the build when vectorization failed.

The new description of the PR is as follows:

/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/src/liboslexec/wide/wide_opstring_b8_AVX2.cpp:94:1: error: loop not vectorized: the optimizer was unable to perform the requested transformation; the transformation might be disabled or specified as part of an unsupported transformation ordering [-Werror,-Wpass-failed=transform-warning]
 94 | __OSL_MASKED_OP2(hash, Wi, Ws)(void* wr_, void* ws_, unsigned int mask_value)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/src/liboslexec/wide/define_opname_macros.h:44:5: note: expanded from macro '__OSL_MASKED_OP2'
 44 | __OSL_CONCAT7(osl_, __OSL_LIBRARY_SELECTOR, NAME, _, A, B, _masked)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:114:44: note: expanded from macro '__OSL_CONCAT7'
 114 | #define __OSL_CONCAT7(A, B, C, D, E, F, G) __OSL_CONCAT(__OSL_CONCAT6(A,B,C,D,E,F),G)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:109:28: note: expanded from macro '__OSL_CONCAT'
 109 | #define __OSL_CONCAT(A, B) __OSL_CONCAT_INDIRECT(A,B)
 | ^
/home/runner/work/OpenShadingLanguage/OpenShadingLanguage/build/include/OSL/oslversion.h:108:37: note: expanded from macro '__OSL_CONCAT_INDIRECT'
 108 | #define __OSL_CONCAT_INDIRECT(A, B) A ## B
 | ^
<scratch space>:336:1: note: expanded from here
 336 | osl_b8_AVX2_hash_WiWs_masked
 | ^
1 error generated.

@lgritz lgritz changed the title ~~(削除) batch: relax SIMD loop for certain operations (削除ここまで)~~ (追記) batch: suppress warning for SIMD loop vectorization failure with clang (追記ここまで)

Nov 3, 2025

Labels

batch shading build / testing / port / CI

2 participants

@lgritz @AlexMWells

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

batch: suppress warning for SIMD loop vectorization failure with clang #2032

Are you sure you want to change the base?

batch: suppress warning for SIMD loop vectorization failure with clang #2032

Conversation

@lgritz lgritz commented Oct 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

@AlexMWells AlexMWells Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

@lgritz lgritz Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

lgritz commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

batch: suppress warning for SIMD loop vectorization failure with clang #2032

Are you sure you want to change the base?

batch: suppress warning for SIMD loop vectorization failure with clang #2032

Conversation

@lgritz lgritz commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

@AlexMWells AlexMWells Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

@lgritz lgritz Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

lgritz commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

@lgritz lgritz commented Oct 25, 2025 •

edited

Loading