Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
11 |
Material Type: Artigo
|
Improved Multi-GPU parallelization of a Lagrangian Transport ModelBolarinwa, SaheedarXiv.org, 2022-11Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
12 |
Material Type: Artigo
|
Expression Acceleration: Seamless Parallelization of Typed High-Level LanguagesHummelgren, Lars ; Wikman, John ; Eriksson, Oscar ; Haller, Philipp ; Broman, DavidarXiv.org, 2024-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
13 |
Material Type: Artigo
|
The Solution for the AIGC Inference Performance Optimization CompetitionPan, Sishun ; Xu, Haonan ; Wan, Zhonghua ; Yang, YangarXiv.org, 2024-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
14 |
Material Type: Artigo
|
Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated SchedulesPan, Xinglin ; Lin, Wenxiang ; Shi, Shaohuai ; Chu, Xiaowen ; Sun, Weinong ; Li, BoarXiv.org, 2024-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
15 |
Material Type: Artigo
|
On the Performance and Memory Footprint of Distributed Training: An Empirical Study on TransformersLu, Zhengxian ; Wang, Fangyu ; Xu, Zhiwei ; Yang, Fei ; Li, TaoarXiv.org, 2024-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
16 |
Material Type: Artigo
|
GPU-friendly Stroke ExpansionLevien, Raph ; Uguray, ArmanarXiv.org, 2024-06Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
17 |
Material Type: Artigo
|
Speculative Path PlanningBakhshalipour, Mohammad ; Qadri, Mohamad ; Guri, DominicarXiv.org, 2021-02Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
18 |
Material Type: Artigo
|
LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context ParallelismGu, Diandian ; Sun, Peng ; Hu, Qinghao ; Huang, Ting ; Chen, Xun ; Xiong, Yingtong ; Wang, Guoteng ; Chen, Qiaoling ; Zhao, Shangchun ; Fang, Jiarui ; Wen, Yonggang ; Zhang, Tianwei ; Jin, Xin ; Liu, XuanzhearXiv.org, 2024-06Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
19 |
Material Type: Artigo
|
On the Confluence of Directed Graph Reductions Preserving Feedback Vertex Set MinimalityMoussa Abdenbi ; Alexandre Blondin Massé ; Goupil, Alain ; Marcotte, OdilearXiv.org, 2024-06Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
|
20 |
Material Type: Artigo
|
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeAgrawal, Amey ; Kedia, Nitin ; Panwar, Ashish ; Mohan, Jayashree ; Kwatra, Nipun ; Gulavani, Bhargav S ; Tumanov, Alexey ; Ramachandran RamjeearXiv.org, 2024-06Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |