Chuhao Xu,
Yiyu Liu,
Zijun Li,
Quan Chen,
Han Zhao,
Deze Zeng,
Qian Peng,
Xueqi Wu,
Haifeng Zhao,
Senbo Fu,
Minyi Guo
(2024).
Improving the Multi-Tenancy GPU Performance through Adaptive Bubbleless Spatial-Temporal Sharing.
In
ASPLOS2024 (CCF-A) (Accepted).
Weihao Cui,
Han Zhao,
Quan Chen,
Ningxin Zheng,
Jingwen Leng,
Jieru Zhao,
Zhuo Song,
Tao Ma,
Yong Yang,
Chao Li,
Minyi Guo
(2021).
Enable Simultaneous DNN Services Based on Deterministic Operator Overlap and Precise Latency Prediction.
In
SC2021 (CCF-A).