Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cutlass 3.x gemm on sm90 #9398

Open
wants to merge 10 commits into
base: develop
Choose a base branch
from
Open

cutlass 3.x gemm on sm90 #9398

wants to merge 10 commits into from

Conversation

ckl117
Copy link
Contributor

@ckl117 ckl117 commented Nov 8, 2024

PR types

Performance optimization

PR changes

Others

Description

增加cutlass 3.x FP8 GEMM 代码生成和调优;
默认不开启调优,通过环境变量FLAGS_use_cutlass_device_best_config_path=tune控制cutlass FP8 gemm调优功能,如果未设置变量、为空或者defalut则使用默认配置,否则使用指定的json文件配置,增加易用性;
去除nvcc编译选项-O3,该选项或造成部分cutlass 配置失效;

Copy link

paddle-bot bot commented Nov 8, 2024

Thanks for your contribution!

@DrownFish19
Copy link
Collaborator

安装方式已经变化,setup_cuda中cutalss已经通过submodule添加

Copy link

codecov bot commented Nov 8, 2024

Codecov Report

Attention: Patch coverage is 0% with 7 lines in your changes missing coverage. Please review.

Project coverage is 52.94%. Comparing base (4b02477) to head (a4f85e7).

Files with missing lines Patch % Lines
...erimental/transformers/fused_transformer_layers.py 0.00% 7 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff             @@
##           develop    #9398      +/-   ##
===========================================
+ Coverage    52.93%   52.94%   +0.01%     
===========================================
  Files          688      688              
  Lines       109379   109384       +5     
===========================================
+ Hits         57899    57917      +18     
+ Misses       51480    51467      -13     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@DrownFish19
Copy link
Collaborator

PaddleNLP-CI跳过了此处测试,合入需验证版本升级带来影响。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants