
| index | Title |
| 1 | Memory-efficient tensor parallelism for long-sequence Transformer training Author(s):Peng LIANG, Linbo QIAO, Yanqi SHI,... Clicked:1649 Download:3044 Cited:0 <Full Text> <PPT> 946 Frontiers of Information Technology & Electronic Engineering 2025 Vol.26 No.5 P.770-787 DOI:10.1631/FITEE.2400602 |