Skip to content

Conversation

Wrench-Git
Copy link
Contributor

This change remove a redundency transformation mul_tensor->mul_scalar->mul_tensor. Also with a faster BinaryOpInferrer.
The aten::mul is almost as fast as torch in cpu avg.

@Wrench-Git Wrench-Git added enhancement New feature or request DIPU DIPU related labels Jul 11, 2024
@Wrench-Git Wrench-Git force-pushed the faster_mul branch 2 times, most recently from a51cfbb to ad2e7e0 Compare July 11, 2024 05:40
@fandaoyi fandaoyi requested a review from yangbofun July 12, 2024 10:45
@lljbash lljbash changed the title faster aten::mul in cuda perf(dipu): faster aten::mul in cuda & muxi Jul 16, 2024
@mrdanielw mrdanielw merged commit 8738b0c into DeepLink-org:main Jul 16, 2024
26 of 29 checks passed
Wrench-Git added a commit to DeepLink-org/deeplink.framework.dev that referenced this pull request Jul 16, 2024
* faster aten::mul in cuda

* improve the code format

* loose the check for scalar tensor

* improve the code

* let the logic of mul diff on different devices.

* Update autogen_diopi_wrapper.py

* Update diopi_functions.yaml

* Update OpUtils.hpp
Wrench-Git added a commit to DeepLink-org/deeplink.framework.dev that referenced this pull request Jul 16, 2024
* faster aten::mul in cuda

* improve the code format

* loose the check for scalar tensor

* improve the code

* let the logic of mul diff on different devices.

* Update autogen_diopi_wrapper.py

* Update diopi_functions.yaml

* Update OpUtils.hpp
Wrench-Git added a commit to DeepLink-org/deeplink.framework.dev that referenced this pull request Jul 16, 2024
* faster aten::mul in cuda

* improve the code format

* loose the check for scalar tensor

* improve the code

* let the logic of mul diff on different devices.

* Update autogen_diopi_wrapper.py

* Update diopi_functions.yaml

* Update OpUtils.hpp
zhangzefeng92 pushed a commit to DeepLink-org/deeplink.framework.dev that referenced this pull request Jul 18, 2024
update ../../impl/ascend/device_configs.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DIPU DIPU related enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants