-
Notifications
You must be signed in to change notification settings - Fork 66
[release/2.5] aten::copy optimization (revised) #2032
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: release/2.5
Are you sure you want to change the base?
Conversation
Jenkins build for 616d924f4f85aee911536bcc24184c774e190957 commit finished as FAILURE Detected error during Pytorch building:
|
Jenkins build for d5407e4531d0137b2d02d1c4a9c1a51616e3b05b commit finished as FAILURE Detected error during Pytorch building:
|
I just confirmed that this builds locally. |
Jenkins build for d5407e4531d0137b2d02d1c4a9c1a51616e3b05b commit finished as FAILURE Detected error during Pytorch building:
|
This PR cherry-picks all the aten::copy optimization from 2.5_perf_fix branch into 2.5 release branch. cc: @coderfeli --------- Co-authored-by: feli <[email protected]> Co-authored-by: coderfeli <[email protected]>
This PR replaces several calls to `at::cuda::getCurrentDeviceProperties()->gcnArchName` and `at::cuda::getDeviceProperties(device_index)->gcnArchName` when checking to see if the GPU architecture is in a certain list. Pull Request resolved: pytorch#150473 Approved by: https://github.com/jeffdaily, https://github.com/cyyever
Jenkins build for 6b8f6cf245754ac855302db81a2685ae0ecc737a commit finished as FAILURE |
Upstream this change only for MI300 or only MI hardware. |
Jenkins build for 6b8f6cf245754ac855302db81a2685ae0ecc737a commit finished as FAILURE |
Jenkins build for 6b8f6cf245754ac855302db81a2685ae0ecc737a commit finished as FAILURE |
Jenkins build for 6b8f6cf245754ac855302db81a2685ae0ecc737a commit finished as FAILURE |
Jenkins build for ae9b843fda825a78eedef6dca97bc1852a73e40e commit is in progress |
Jenkins build for ae9b843fda825a78eedef6dca97bc1852a73e40e commit is in progress |
This PR is the same as #1862 with an extra condition to make sure the new code path is not taken on an gfx1100 machine which failed during tests.