> About
Hi there! I'm Xianwei, now an Associate Professor (2020 - ) in School of Computer Science & Engineering at Sun Yat-sen University,
researching on computer architecture and system towards high-performance and intelligent computing.
During 2017-2020, I worked in AMD Inc. (Research & RTG) on architecture and software designs for compute-optimized GPUs.
Previously, I completed my Ph.D. (2017) in the Computer Science Department at University of Pittsburgh, and
obtained Bachelor's (2011) degree on Software Engineering from Northwestern Polytechnical University.
More info can be found in LinkedIn.
> Research
Topics: GPU, Compiling, Memory System, HPC, Intelligent Computing, Simulation/Modeling/Profiling
Grants: National Key R&D Program, NSFC Program, CCF-Tencent®/Huawei®/Phytium® Funds
My research interests lie broadly in hardware and software co-designs to improve the performance and efficiency of computing systems.
A particular emphasis is on GPU computing and memory system design through architecture/compiler/runtime around the critical aspects of latency, energy and bandwidth, etc.
I currently lead the arcSYSu (ARChitecture and SYStem Upscaling @ SYSU) research team, and fortunately work with a group of wonderful graduate and undergraduate researchers/interns on computing systems.
[⭐️ Hiring!] Welcome to join us! [详见 Q&A]
[people @ arcSYSu, refining computing system uses] (#: co-advise)
2024 | Hongxin Xu | Tengyang Zheng# | Gaojin Sun | Lu Wu |
Jingyi He | Bingjie Liu | |||
2023 | Mengyue Xi | Han Huang# | Wenyuan Liang | Hengzhong Liang |
Wenxuan Pan | Aoyuan Sun | Zhongchun Zheng | ||
2022/21 | Xuanteng Huang[phd]# | Zejia Lin[phd] | Tianyu Guo[phd] | Yuhao Gu[phd]# |
Tianyi Zhang | Zhaowen Shan | Chun-yu Chen | Kan Wu[phd]# | |
Ug/RA | Guanyi Chen | Xianjie Chen | Junru Chen | Yunhao Han |
Yibin Luo | Zheng Zhou | Haoquan Chen | Yipeng Ouyang | |
Alum. | Tianao Ge (ms22, phd@Hkust-gz) | Zewei Mo (ms22, Intel->phd@upitt) | ||
Yue Weng (ms23, Nvidia)# | Yinchuan Guo (ms24, Huawei) | Lianghong Huang (ms24, MetaX) |
> Publications
[ see full publication list ]
§ [DAC'24]. T. Guo, X. Huang, K. Wu, X. Zhang and N. Xiao, SMILE: LLC-based Shared Memory Expansion to Improve GPU Thread Level Parallelism
§ [ICCD'23]. Z. Lin, Z. Mo, X. Huang, X. Zhang and Y. Lu, KeSCo: Compiler-based Kernel Scheduling for Multi-task GPU Applications
§ [LCTES'22]. T. Ge, Z. Mo, K. Wu, X. Zhang and Y. Lu, RollBin: Reducing Code-size via Loop Rerolling at Binary Level
§ [MEMSYS'20]. X. Zhang and E. Shcherbakov, DELTA: Validate GPU Memory Profiling with Microbenchmarks
§ [IISWC'19]. T. Ta, X. Zhang, A. Gutierrez and B. Beckmann, Autonomous Data-Race-Free GPU Testing
§ [HPCA'18]. A. Gutierrez, B. Beckmann, A. Dutu, J. Gross, M. LeBeane, J. Kalamatianos, O. Kayiran, M. Poremba, B. Potter, S. Puthoor, M. Sinclair, M. Wyse, J. Yin, X. Zhang, A. Jain, and T. Rogers. Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level.
§ [PACT'17]. X. Zhang, Y. Zhang, B. Childers and J. Yang, DrMP: Mixed Precision-aware DRAM for High Performance Approximate and Precise Computing
§ [HPCA'16]. X. Zhang, Y. Zhang, B. Childers and J. Yang, Restore Truncation for Performance Improvement in Future DRAM Systems
> Teaching
- Undergraduate
§ DCS290/292 - Compilation Principle & Construction,
[24s,
23s,
22s,
21s]
(SYsU-lang).
§ DCS3013 - Computer Architecture, [22f].
- Graduate
§ DCS5637/6207 - Advanced Computer Architecture, [24f, 23f, 22f, 21f].
> Miscellaneous
- Honors/Awards
§ [2019] AMD® Spotlight Award
§ [2016] Andrew Mellon Fellowship
§ [2013] Best Paper Award of ISLPED
- Services
§ [ERC] MICRO (IEEE/ACM Int'l Sym. on Microarchitecture) - 2020
§ [TPC] ICCD (IEEE Int’l Conf. on Computer Design) - 2020, 2019, 2018