=====Wenlai Zhao===== {{my:photo.jpeg?100*100 }} * **Email**: * **Address**: Rm. S814, Mengminwei Tech. Building, Tsinghua University, Beijing, China * **Linkedin**: https://www.linkedin.com/in/cryinlaugh/ ====Education Background==== ^ Time ^ Degree ^ University ^ Supervisor ^ | 2011 - 2018 | Ph.D in Computer Science & Technology | Tsinghua University | Prof.Guangwen Yang & Prof.Haohuan Fu | | 2015 - 2016 | Visiting Ph.D in Custom Computing | Imperial College London | Prof. Wayne Luk | | 2007 - 2011 | Bachelor in Computer Science & Technology | Tsinghua University | | ====Work & Research Experience==== ^ Time ^ Institute ^ Position ^ Research Area ^ | 2020 - now | Department of Computer Science, Tsinghua University | Assistant Professor | High Performance AI Systems & Applications | | 2018 - now | National Supercomputing Center in Wuxi (NSCCWX) | Deputy Director of AI R&D Department | AI Supercomputer Software Stack | | 2018 - 2020 | Department of Computer Science, Tsinghua University | Postdoc Research Fellow | High Performance AI Systems | | 2016 - 2018 | National Supercomputing Center in Wuxi (NSCCWX) | Leader of AI R&D Group | AI Platform on the Sunway TaihuLight Supercomputer | ====Academic Service==== ^ Time ^ Organization ^ Position ^ | 2018 | ACM Transactions on Reconfigurable Technology and Systems (TRETS) | Invited Reviewer | | 2019 | International Conference on Computational Science (ICCS2019) | Program Committee Member (Multiscale Modelling and Simulation Workshop) | | 2020 | The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC 2020) | Program Committee Member (Machine Learning and System) | | 2020 | IEEE Transactions on Parallel and Distributed Systems (TPDS) Special Section on AI/ML/DL | Invited Reviewer | ====Publications==== More details on [[https://scholar.google.com/citations?user=owctMYYAAAAJ|Google Scholar]] or [[https://dblp.org/pers/hd/z/Zhao:Wenlai | dblp]]. - Teng Yu, **Wenlai Zhao***, Pan Liu, etc., {{ my:2020-tpds_autokmeans.pdf | "Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer"}}[J]. IEEE Transactions on Parallel and Distributed Systems (TPDS) 2020 - Yixue Hao, Min Chen, Donggang Cao, **Wenlai Zhao**, Ivan Petrov, Vitaly Antonenko, Ruslan Smeliansky, {{ my:2020-wirless.pdf | "Cognitive-Caching: Cognitive Wireless Mobile Caching by Learning Fine-Grained Caching-Aware Indicators"}}[J], IEEE Wireless Communications 2020 - Liang Qiao, Hongkun Yu, Kunpeng Wang, Ruixin Sun, **Wenlai Zhao***, Guangwen Yang,{{ my:2019-bibm_swthunder.pdf | "Large-scale Parallel Design for Cryo-EM Structure Determination on Heterogeneous Many-core Architectures"}}[C]. Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019 - Ouyi Li, **Wenlai Zhao***, Xuancheng Huang, etc., {{ my : 2-5-iccs2019.pdf | "Scaling the Training of Recurrent Neural Networks on Sunway TaihuLight Supercomputer"}}[C]. International Conference on Computational Science (ICCS) 2019 - Wei Gao, Jiarui Fang, **Wenlai Zhao**, Jinzhe Yang, etc., {{ my : icpp19-swatop.pdf | "swATOP: Automatically optimizing deep learning operators on SW26010 many-core processor"}}[C]. Proceedings of the 48th International Conference on Parallel Processing (ICPP) 2019 - Kunpeng Wang, Shizhen Xu, Haohuan Fu, Hongkun Yu, **Wenlai Zhao**, Guangwen Yang, {{my:2019-ics-thundergpu.pdf | Parallelizing Cryo-EM 3D Reconstruction on GPU Cluster with A Partitioned and Streamed Model}}[C]. Proceedings of the ACM International Conference on Supercomputing (ICS) 2019 - **Wenlai Zhao**, Haohuan Fu, Jiarui Fang, etc., {{ my:2-3-taco18.pdf | "Optimizing Convolutional Neural Networks on Sunway TaihuLight Supercomputer"}}[J], ACM Transactions on Architecture and Code Optimization (TACO) 2018 - Liandeng Li, Teng Yu, **Wenlai Zhao**, Haohuan Fu, etc., {{ my:2-4-kmeans.pdf | "Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers"}}[C], Supercomputing (SC) 2018 - Jiarui Fang, Haohuan Fu, **Wenlai Zhao**, Bingwei Chen, Weijie Zheng, Guangwen Yang, {{ my:2017-ipdps-swdnn.pdf | "swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight Supercomputer"}}[C], 31st IEEE International Parallel \& Distributed Processing Symposium (IPDPS) 2017 - **Wenlai Zhao**, Haohuan Fu, Wayne Luk, and etc. {{ my:2-fcnn.pdf | "F-CNN: An FPGA-based Framework for Training Convolutional Neural Networks"}}[C], Application-specific Systems, Architectures and Processors (ASAP) 2016 - **Wenlai Zhao**, Haohuan Fu, Wayne Luk and Guangwen Yang, {{ my:1-patra.pdf | "Patra: Parallel Tree-reweighted Message Passing Architecture"}}[C], Field Programmable Logic and Applications (FPL) 2014 - **Wenlai Zhao**, Haohuan Fu and Guangwen Yang, {{ my:fccm2014poster.pdf |"A Fully-Pipelined FPGA Design for Tree-reweighted Message Passing Algorithm"This is the caption}}[C], Field-Programmable Custom Computing Machines (FCCM) 2014