-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Inlining + target_feature broken in powerpc64 #60637
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Labels
A-LLVM
Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues.
A-SIMD
Area: SIMD (Single Instruction Multiple Data)
C-bug
Category: This is a bug.
O-PowerPC
Target: PowerPC processors
T-compiler
Relevant to the compiler team, which will review and decide on the PR/issue.
Comments
This is something on the LLVM side. See this comment. This happens in all microarchitectures except for ARM and x86 I think. |
This was resolved in llvm/llvm-project#117493 and llvm/llvm-project#126562 |
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 13, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 13, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 16, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 17, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 21, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 21, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 21, 2025
sayantn
added a commit
to sayantn/stdarch
that referenced
this issue
Apr 21, 2025
There are still some optimization issues in darwin and ios |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
A-LLVM
Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues.
A-SIMD
Area: SIMD (Single Instruction Multiple Data)
C-bug
Category: This is a bug.
O-PowerPC
Target: PowerPC processors
T-compiler
Relevant to the compiler team, which will review and decide on the PR/issue.
godbolt
with
#[inline]
that code produces a function call withinsplat_u32x4
(b example::u32x4::splat
) tou32x4::splat
, which is not eliminated, even though this method is module private. With#[inline(always)]
,u32x4::splat
is inlined intosplat_u32x4
, and no code foru32x4::splat
is generated.#[inline]
should not be needed here, much less#[inline(always)]
, yet without#[inline(always)]
this produces bad codegen.Removing the
#[target_feature]
attribute fromsplat_u32x4
fixes the issue, no#[inline]
necessary: godbolt. So there must be some interaction between inlining and target features going on here.The text was updated successfully, but these errors were encountered: