Table of Contents

Wenlai Zhao

photo.jpeg

Education Background

Time Degree University Supervisor
2011 - 2018 Ph.D in Computer Science & Technology Tsinghua University Prof.Guangwen Yang & Prof.Haohuan Fu
2015 - 2016 Visiting Ph.D in Custom Computing Imperial College London Prof. Wayne Luk
2007 - 2011 Bachelor in Computer Science & Technology Tsinghua University

Work & Research Experience

Time Institute Position Research Area
2020 - now Department of Computer Science, Tsinghua University Assistant Professor High Performance AI Systems & Applications
2018 - now National Supercomputing Center in Wuxi (NSCCWX) Deputy Director of AI R&D Department AI Supercomputer Software Stack
2018 - 2020 Department of Computer Science, Tsinghua University Postdoc Research Fellow High Performance AI Systems
2016 - 2018 National Supercomputing Center in Wuxi (NSCCWX) Leader of AI R&D Group AI Platform on the Sunway TaihuLight Supercomputer

Academic Service

Time Organization Position
2018 ACM Transactions on Reconfigurable Technology and Systems (TRETS) Invited Reviewer
2019 International Conference on Computational Science (ICCS2019) Program Committee Member (Multiscale Modelling and Simulation Workshop)
2020 The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC 2020) Program Committee Member (Machine Learning and System)
2020 IEEE Transactions on Parallel and Distributed Systems (TPDS) Special Section on AI/ML/DL Invited Reviewer

Publications

More details on Google Scholar or dblp.

  1. Teng Yu, Wenlai Zhao*, Pan Liu, etc., "Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer"[J]. IEEE Transactions on Parallel and Distributed Systems (TPDS) 2020
  2. Yixue Hao, Min Chen, Donggang Cao, Wenlai Zhao, Ivan Petrov, Vitaly Antonenko, Ruslan Smeliansky, "Cognitive-Caching: Cognitive Wireless Mobile Caching by Learning Fine-Grained Caching-Aware Indicators"[J], IEEE Wireless Communications 2020
  3. Liang Qiao, Hongkun Yu, Kunpeng Wang, Ruixin Sun, Wenlai Zhao*, Guangwen Yang, "Large-scale Parallel Design for Cryo-EM Structure Determination on Heterogeneous Many-core Architectures"[C]. Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019
  4. Ouyi Li, Wenlai Zhao*, Xuancheng Huang, etc., "Scaling the Training of Recurrent Neural Networks on Sunway TaihuLight Supercomputer"[C]. International Conference on Computational Science (ICCS) 2019
  5. Wei Gao, Jiarui Fang, Wenlai Zhao, Jinzhe Yang, etc., "swATOP: Automatically optimizing deep learning operators on SW26010 many-core processor"[C]. Proceedings of the 48th International Conference on Parallel Processing (ICPP) 2019
  6. Kunpeng Wang, Shizhen Xu, Haohuan Fu, Hongkun Yu, Wenlai Zhao, Guangwen Yang, Parallelizing Cryo-EM 3D Reconstruction on GPU Cluster with A Partitioned and Streamed Model[C]. Proceedings of the ACM International Conference on Supercomputing (ICS) 2019
  7. Wenlai Zhao, Haohuan Fu, Jiarui Fang, etc., "Optimizing Convolutional Neural Networks on Sunway TaihuLight Supercomputer"[J], ACM Transactions on Architecture and Code Optimization (TACO) 2018
  8. Liandeng Li, Teng Yu, Wenlai Zhao, Haohuan Fu, etc., "Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers"[C], Supercomputing (SC) 2018
  9. Jiarui Fang, Haohuan Fu, Wenlai Zhao, Bingwei Chen, Weijie Zheng, Guangwen Yang, "swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight Supercomputer"[C], 31st IEEE International Parallel \& Distributed Processing Symposium (IPDPS) 2017
  10. Wenlai Zhao, Haohuan Fu, Wayne Luk, and etc. "F-CNN: An FPGA-based Framework for Training Convolutional Neural Networks"[C], Application-specific Systems, Architectures and Processors (ASAP) 2016
  11. Wenlai Zhao, Haohuan Fu, Wayne Luk and Guangwen Yang, "Patra: Parallel Tree-reweighted Message Passing Architecture"[C], Field Programmable Logic and Applications (FPL) 2014
  12. Wenlai Zhao, Haohuan Fu and Guangwen Yang, "A Fully-Pipelined FPGA Design for Tree-reweighted Message Passing Algorithm"This is the caption[C], Field-Programmable Custom Computing Machines (FCCM) 2014