Associate Professor
School of Computer Science & Engineering

Sun Yat-sen University

Guangzhou, China 51005

Email: zhangxw79 AT mail.sysu.edu.cn

> About

Xianwei is an Associate Professor (2020 - ) at Sun Yat-sen University. During 2017-2020, he worked in AMD Inc. (Research & RTG) on hardware and software designs for compute-optimized GPUs. He completed his Ph.D. (2017) in the Computer Science Department at University of Pittsburgh, and obtained his Bachelor's (2011) degree from Northwestern Polytechnical University. More info can be found in LinkedIn.

> Research

Topics: GPU, Compiling, Memory System, HPC, Intelligent Computing, Simulation/Modeling/Profiling
Xianwei's research interests lie broadly in hardware and software co-designs to improve the performance and efficiency of computing systems. A particular emphasis is on GPU computing and memory system design around the critical aspects of latency, energy and bandwidth, etc.

Welcome to join arcSYSu (ARChitecture and SYStem Upscaling @ SYSU) [详见 Q&A]

[people @ arcSYSu, refining computing system uses] (#: co-advise)

2023 Mengyue Xi Han Huang# Wenyuan Liang Hengzhong Liang
Wenxuan Pan Aoyuan Sun Zhongchun Zheng
2022/21 Xuanteng Huang[phd]# Zejia Lin[phd] Tianyu Guo[phd] Yuhao Gu[phd]#
Tianyi Zhang Zhaowen Shan Chun-yu Chen Kan Wu[phd]#
Alum. Tianao Ge (ms22, phd@Hkust-gz) Zewei Mo (ms22, Intel->phd@upitt)
Yue Weng (ms23, Nvidia)# Yinchuan Guo (ms24, Huawei)
Lianghong Huang (ms24, MetaX)

> Teaching

- Undergraduate
§ DCS290/292 - Compilation Principle & Construction, [24s, 23s, 22s, 21s] (SYsU-lang).
§ DCS3013 - Computer Architecture, [22f].
- Graduate
§ DCS5637/6207 - Advanced Computer Architecture, [23f, 22f, 21f].

> Publications

[ see full publication list ]
§ [DAC'24]. T. Guo, X. Huang, K. Wu, X. Zhang and N. Xiao, SMILE: LLC-based Shared Memory Expansion to Improve GPU Thread Level Parallelism
§ [ICCD'23]. Z. Lin, Z. Mo, X. Huang, X. Zhang and Y. Lu, KeSCo: Compiler-based Kernel Scheduling for Multi-task GPU Applications
§ [LCTES'22]. T. Ge, Z. Mo, K. Wu, X. Zhang and Y. Lu, RollBin: Reducing Code-size via Loop Rerolling at Binary Level
§ [MEMSYS'20]. X. Zhang and E. Shcherbakov, DELTA: Validate GPU Memory Profiling with Microbenchmarks
§ [IISWC'19]. T. Ta, X. Zhang, A. Gutierrez and B. Beckmann, Autonomous Data-Race-Free GPU Testing
§ [HPCA'18]. A. Gutierrez, B. Beckmann, A. Dutu, J. Gross, M. LeBeane, J. Kalamatianos, O. Kayiran, M. Poremba, B. Potter, S. Puthoor, M. Sinclair, M. Wyse, J. Yin, X. Zhang, A. Jain, and T. Rogers. Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level.
§ [PACT'17]. X. Zhang, Y. Zhang, B. Childers and J. Yang, DrMP: Mixed Precision-aware DRAM for High Performance Approximate and Precise Computing
§ [HPCA'16]. X. Zhang, Y. Zhang, B. Childers and J. Yang, Restore Truncation for Performance Improvement in Future DRAM Systems

> Miscellaneous

- Honors/Awards
§ [2019] AMD® Spotlight Award
§ [2016] Andrew Mellon Fellowship
§ [2013] Best Paper Award of ISLPED
§ [2009] Tencent® Technology Excellence Scholarship
- Services
§ [ERC] MICRO (IEEE/ACM Int'l Sym. on Microarchitecture) - 2020
§ [TPC] ICCD (IEEE Int’l Conf. on Computer Design) - 2020, 2019, 2018