-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wrong result for miniMDock #726
Comments
My guess is the masked __shfl() which, I think, is currently not supported by chipStar. |
chipStar’s warp size is 32 by default. It does not seem that miniMDock’s HIP code accounts this size. For example, the WARPMINIMUMEXCHANGE call here will definitely compute out-of-bounds warp indices for If the Intel-PVC supports 64 wide subgroups in Level Zero or OpenCL, you could try changing chipStar’s warp size to it with FYI, I had trouble compiling miniMDock for chipStar because of |
I think it calls ballot which is not supported in v1.2, but is supported by Intel's driver regardless. I've patched that in the chipStar LLVM-SPIRV branch so it generates it regardless and hopes for the best: CHIP-SPV/SPIRV-LLVM-Translator@0d66986 @mathialakan for the masked shfl, currently there is no easy workaround, but just try to reorganize the client code to use uniform shuffles. |
@pjaaskel should we close this and/or open a new issue with for implementing masked shfl support? |
Hi
The HIP code of miniMDock (https://github.com/ORNL-PE/miniMDock/tree/sycl_dev ) is working perfectly in AMD systems, but on Intel-PVC systems, it builds successfully but giving wrong results. The results obtained for the 7cpa test case ( -lfile ./input/7cpa/7cpa_ligand.pdbqt -nrun 10) on Arognne's sunspot using iprof is attached here,
chipstar_iprof_sunspot.pdf
The correct result should be look like this, 7cpa_results.pdf
The code is using warp-level shuffle intrinsics for reduction (device/hip/kernels.cpp) and the code is able to setup warp/wavefront/subgroup size based on the running-system.
Would you please help me to resolve the issues related to this build.
Thanks
Mathi
The text was updated successfully, but these errors were encountered: