Commit 8f01149
* Set MXCC_OVERRIDE_OPTIONS in compile script
Add MXCC_OVERRIDE_OPTIONS for metax GPU compilation.
* Add MXCC_OVERRIDE_OPTIONS for Metax GPU
* Update flash_attn_grad_kernel.cu
* Update compile.sh
* [Metax][feat] add top_p_sampling.patch. (#225)
* [Metax] Fix add flags
---------
Co-authored-by: duqimeng <77875733+duqimeng@users.noreply.github.com>
Co-authored-by: MingkunZhang <39252862+StareAtYou@users.noreply.github.com>
1 parent 8f3743f commit 8f01149
2 files changed
Lines changed: 30 additions & 1 deletion
- cmake/cupti.cmake+12-2
- paddle/common/flags.cc+30
- paddle/fluid/pybind/eager.h+13
- paddle/fluid/pybind/eager_py_layer.cc+185-14
- paddle/phi/CMakeLists.txt+7-2
- paddle/phi/backends/dynload/rocm_driver.cc+2
- paddle/phi/kernels/cpu/elementwise.h+1-19
- paddle/phi/kernels/funcs/distribution_helper.h+29-14
- paddle/phi/kernels/funcs/dropout_impl.cu.h+26-13
- paddle/phi/kernels/funcs/rng_launch_config.h+58
- paddle/phi/kernels/fusion/gpu/fused_dropout_add_utils.h+24-11
- paddle/phi/kernels/fusion/xpu/fused_rope_utils.h+10-4
- paddle/phi/kernels/gpu/interpolate_grad_kernel.cu+2-1
- paddle/phi/kernels/gpu/interpolate_kernel.cu+2-1
- paddle/phi/kernels/gpu/layer_norm_kernel.cu+3-2
- paddle/phi/kernels/gpu/rms_norm_cuda_kernel.h+144-10
- paddle/phi/kernels/stride/matmul_stride_kernel.cu+1-1
- python/paddle/distributed/fleet/meta_optimizers/muon_sharding_optimizer.py+15-5
- python/paddle/distributed/fleet/meta_parallel/pipeline_parallel.py+129-31
- python/paddle/distributed/fleet/meta_parallel/pp_utils/p2p_communication.py+21-4
- python/paddle/distributed/fleet/recompute/recompute.py+33-84
- python/paddle/distributed/fleet/recompute/recompute_hybrid.py+2-6
- python/paddle/optimizer/muon.py+72-55
- python/paddle/tensor/ops.py+66
- test/collective/fleet/hybrid_parallel_sharding_muon_model.py+87-3
- test/collective/fleet/test_parallel_dygraph_muon.py+46
- test/legacy_test/test_pylayer_clear_dataptr.py+700
- test/legacy_test/test_recompute_with_tuple_input.py+48-453
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
119 | 148 | | |
120 | 149 | | |
121 | 150 | | |
| |||
0 commit comments