THU FASTsys Research Group

    

Introduction

  • Guangyan Zhang is an associate professor in the Department of Computer Science and Technology, where he joined since July 2008. He obtained his Ph.D degree from Tsinghua University under the guidance of Prof. Weimin Zheng and Prof. Jiwu Shu. Before that, he received the bachelor’s and master’s degrees in computer science from Jilin University in 2000 and 2003.
  • He is a Professional Member of the ACM, a member of the CCF technical committee of information storage technology, a Communication Member of CCF Task Force on Big Data.

Education

  • Bachelor of Computer Science, Jilin University, China, 2000
  • Master of Computer Science, Jilin University, China, 2003
  • Ph.D. in Computer Science, Tsinghua University, China, 2008

Research

— His current research focuses on Big Data Computing, Storage Systems, and Distributed Systems, especially in:

  • Cloud Storage
  • RAID and Erasure Codes
  • Flash and PCM Storage
  • Storage Virtualization
  • Distributed File Systems
  • Deep Learning
  • Graph Computing
  • Stream Computing
  • Benchmarking

Courses

— He teaches three courses, one of which is for graduate students, and the others are for undergraduate students.

  • CS 70240013: Advanced Computer Architecture, for Ph.D and master students,
  • CS 40240443: Computer Architecture, for undergraduate students, and
  • CS 30240522: Program Design and Training, for undergraduate students.

Publications

  • Flash-oriented Coded Storage: Research Status and Future Directions

    [PDF]

    Zhiyue Li, Guangyan Zhang, Yang Wang.

    Accepted by ACM Transactions on Storage (TOS),

  • Understanding Silent Data Corruption in Processors for Mitigating its Effects

    [PDF]

    Shaobu Wang, Guangyan Zhang, Junyu Wei, Yang Wang, Jiesheng Wu, and Qingchao Luo.

    ACM Transactions on Architecture and Code Optimization (TACO),

  • StreamCache: Revisiting Page Cache for File Scanning on Fast Storage Devices

    [PDF]

    Zhiyue Li, Guangyan Zhang

    In the Proceedings of the 2024 USENIX Annual Technical Conference (ATC’24), Santa Clara, CA, USA, 2024, Pages 1119-1134

  • FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning with Partitioning and Parallelism of Search Space

    [PDF]

    Xiaqing Li, Qi Guo, Guangyan Zhang, Siwei Ye, Guanhua He, Rui Zhang, Yifan Hao, Zidong Du, and Weimin Zheng

    Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 7, Pages 1174-1188 (2024).

  • Exploiting Data-pattern-aware Vertical Partitioning to Achieve Fast and Low-cost Cloud Log Storage

    [PDF]

    Junyu Wei, Guangyan Zhang, Junchao Chen, Yang Wang, Weimin Zheng, Tingtao Sun, Jiesheng Wu, Jiangwei Jiang.

    ACM Transactions on Storage (TOS), Volume 20, Issue 2, Pages 1-35 (2024).

  • Understanding Silent Data Corruptions in a Large Production CPU Population

    [PDF]

    Shaobu Wang, Guangyan Zhang, Junyu Wei, Yang Wang, Jiesheng Wu, and Qingchao Luo.

    In the Proceedings of the 29th ACM Symposium on Operating Systems Principles (SOSP ‘23), Koblenz, Germany, Oct, 2023. Pages 216-230

  • LogGrep: Fast and Cheap Cloud Log Storage by Exploiting both Static and Runtime Patterns

    [PDF]

    Junyu Wei, Guangyan Zhang, Junchao Chen, Yang Wang, Weimin Zheng, Tingtao Sun, Jiesheng Wu, Jiangwei Jiang.

    In the Proceedings of the 18th European Conference on Computer Systems (EuroSys’23), Roma, Italy, May 2023.

  • A Survey on Design and Application of Open-Channel SSDs

    [PDF]

    Junchao Chen, Guangyan Zhang, Junyu Wei.

    Frontiers of Information Technology & Electronic Engineering (FITEE), Volume 24, Pages 637–658 (2023).

  • A globally shared resource paradigm for encoded storage systems in the public cloud

    [PDF]

    Zhiyue Li, Guangyan Zhang.

    Fundamental Research.

  • ShortTail: Taming Tail Latency for Erasure-code Based In-memory Systems

    [PDF]

    Yun Teng, Zhiyue Li, Jing Huang, Guangyan Zhang.

    Frontiers of Information Technology & Electronic Engineering (FITEE), Volume 23, Pages 1646–1657 (2022).

  • Aurogon: Taming Aborts in All Phases for Distributed In-Memory Transactions

    [PDF]

    Tianyang Jiang, Guangyan Zhang, Zhiyue Li, Weimin Zheng.

    In the Proceedings of the 20th USENIX Conference on File and Storage Technologies(FAST’22), Santa Clara, CA, February 2022. Pages 217-232.

  • FusionRAID: Achieving Consistent Low Latency for Commodity SSD Arrays

    [PDF]

    Tianyang Jiang, Guangyan Zhang, Zican Huang, Xiaosong Ma, Junyu Wei, Zhiyue Li, Weimin Zheng.

    In the Proceedings of the 19th USENIX Conference on File and Storage Technologies(FAST’21), Santa Clara, CA, February 2021. Pages 355-370.

  • On the Feasibility of Parser-based Log Compression in Large-Scale Cloud Systems

    [PDF]

    Junyu Wei, Guangyan Zhang, Yang Wang, Zhiwei Liu, Zhanyang Zhu, Junchao Chen, Tingtao Sun, Qi Zhou.

    In the Proceedings of the 19th USENIX Conference on File and Storage Technologies(FAST’21), Santa Clara, CA, February 2021. Pages 249-262.

  • SmartTuning: Selecting HyperParameters of a ConvNet System for Fast Training and Small Working Memory

    [PDF]

    Xiaqing Li, Guangyan Zhang, Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 32, Issue 7, Pages 1690-1701 (2021)

  • Boafft: Distributed Deduplication for Big Data Storage in the Cloud

    [PDF]

    Shengmei Luo, Guangyan Zhang, Chengwen Wu, Samee U. Khan, and Keqin Li.

    IEEE Transactions on Cloud Computing, Volume 8, Pages 1199-1211 (2020).

  • Determining Data Distribution for Large Disk Enclosures with 3-D Data Templates

    [PDF]

    Guangyan Zhang, Zhufan Wang, Xiaosong Ma, Songlin Yang, Zican Huang, Weimin Zheng.

    ACM Transactions on Storage, Volume 15, Issue 4, Pages 1-38 (2019).

  • Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems

    [PDF]

    Zhufan Wang, Guangyan Zhang, Yang Wang, Qinglin Yang, Jiaji Zhu.

    In the Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19), Renton, WA, July 2019. Pages 993-1007.

  • Redio: Accelerating Disk-based Graph Processing by Reducing Disk I/Os

    [PDF]

    Chengwen Wu, Guangyan Zhang, Yang Wang, Xinyang Jiang, Weimin Zheng.

    IEEE Transactions on Computers, Volume 68, Issue 3, Pages 414 - 425 (2019).

  • HyConv: Accelerating Multi-phase CNN Computation by Fine-grained Policy Selection

    [PDF]

    Xiaqing Li, Guangyan Zhang, Zhufan Wang, Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 30, Issue 2, Pages 388 - 399 (2019).

  • RAID+: Deterministic and Balanced Data Distribution for Large Disk Enclosures

    [PDF]

    Guangyan Zhang, Zican Huang, Xiaosong Ma, Songlin Yang, Zhufan Wang, Weimin Zheng.

    In the Proceedings of the 16th USENIX Conference on File and Storage Technologies (FAST’18), Oakland, CA, February 2018. Pages 279-293.

  • Accelerating Breadth-First Graph Search on a Single Server by Dynamic Edge Trimming

    [PDF]

    Guangyan Zhang, Shuhan Cheng, Jiwu Shu, Qingda Hu, and Weimin Zheng.

    Journal of Parallel and Distributed Computing (JPDC), Volume 120, Pages 383-394, (2018).

  • Building a Fault Tolerant Framework with Deadline Guarantee in Big Data Stream Computing Environments

    [PDF]

    Dawei Sun, Guangyan Zhang, Chengwen Wu, Keqin Li, and Weimin Zheng.

    Journal of Computer and System Sciences, Volume 89, Pages 4-23 (2017).

  • Xscale: Online X-code RAID-6 Scaling Using Lightweight Data Reorganization

    [PDF]

    Guangyan Zhang, Guiyong Wu, Yu Lu, Jie Wu, and Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 27, Issue 12, Pages 3687 - 3700 (2016).

  • AsyncStripe: I/O Efficient Asynchronous Graph Computing on a Single Server

    [PDF]

    Shuhan Cheng, Guangyan Zhang, Jiwu Shu, and Weimin Zheng.

    In the Proceedings of the 2016 International Conference on Hardware - Software Codesign and System Synthesis (CODES+ISSS 2016), Pittsburgh, USA, October 2016.

  • Performance Analysis of GPU-based Convolutional Neural Networks

    [PDF]

    Xiaqing Li, Guangyan Zhang, H. Howie Huang, Zhufan Wang and Weimin Zheng.

    In the Proceedings of the 45th International Conference on Parallel Processing (** ICPP-2016**), Philadelphia, PA USA, August 2016.

  • FastBFS: Fast Breadth-First Graph Search on a Single Server

    [PDF]

    Shuhan Cheng, Guangyan Zhang, Jiwu Shu, Qingda Hu, and Weimin Zheng.

    In the Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS’16), Chicago, Illinois USA, May 2016.

  • Rethinking Computer Architectures and Software Systems for Phase Change Memory

    [PDF]

    Chengwen Wu,Guangyan Zhang, and Keqin Li.

    Journal of Emerging Technologies in Computing Systems, Volume 12, Issue 4, Article No. 33 (2016).

  • Reconsidering Single Disk Failure Recovery for Erasure Coded Storage Systems: Optimizing Load Balancing in Stack-Level

    [PDF]

    Yingxun Fu, Jiwu Shu, Zhirong Shen, and Guangyan Zhang.

    IEEE Transactions on Parallel and Distributed Systems, Volume 27, Issue 5, Pages 1457-1469 (2016).

  • CaCo: An Efficient Cauchy Coding Approach for Cloud Storage Systems

    [PDF]

    Guangyan Zhang, Guiyong Wu, Shupeng Wang, Jiwu Shu, Weimin Zheng, and Keqin Li.

    IEEE Transactions on Computers, Volume 65, Issue 2, Pages 435-447 (2016).

  • Re-Stream: real-time and energy-efficient resource scheduling in big data stream computing environments

    [PDF]

    Dawei Sun, Guangyan Zhang, Songlin Yang, Weimin Zheng, Samee Khan, and Keqin Li.

    Information Sciences, Volume 319, Pages 92-112 (2015).

  • FastRAQ: A Fast Approach to Range-Aggregate Queries in Big Data Environments

    [PDF]

    Xiaochun Yun, Guangjun Wu, Guangyan Zhang, Keqin Li, and Shupeng Wang,

    IEEE Transactions on Cloud Computing, Volume 3, Issue 2, Pages 206-218 (2015).

  • Redistribute Data to Regain Load Balance during RAID-4 Scaling

    [PDF]

    Guangyan Zhang, Jigang Wang, Keqin Li, Jiwu Shu, and Weimin Zheng

    IEEE Transactions on Parallel and Distributed Systems, Volume 26, Issue 1, Pages 219-229 (2015).

  • Accelerate RDP RAID-6 Scaling by Reducing Disk I/Os and XOR Operations

    [PDF]

    Guangyan Zhang, Keqin Li, Jingzhe Wang, Weimin Zheng

    IEEE Transactions on Computers, Volume 64, Issue 1, Pages 32-44 (2015).

  • Rethinking RAID-5 Data Layout for Better Scalability

    [PDF]

    Guangyan Zhang, Weimin Zheng, Keqin Li

    IEEE Transactions on Computers, Volume 63 , Issue 11, Pages 2816-2828 (2014).

  • Design and evaluation of a new approach to RAID-0 scaling

    [PDF]

    Guangyan Zhang, Weimin Zheng, and Keqin Li

    ACM Transactions on Storage, 9, 4, Article 11 (November 2013), 31 pages.

  • AIP: A Tool for Flexible and Transparent Data Managemen

    [PDF]

    Guangyan Zhang, Jianping Qiu, Jiwu Shu, Weimin Zheng.

    SCIENCE CHINA, Information Sciences, 2013, 56(5):052114(11).

  • FastScale: Accelerate RAID Scaling by Minimizing Data Migration

    [PDF]

    Weimin Zheng, Guangyan Zhang

    In the Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST’11), San Jose, CA, February 2011.

  • SOPA: Selecting the Optimal Policy Adaptively

    [PDF]

    Yang Wang, Jiwu Shu, Guangyan Zhang, Wei Xue, Weimin Zheng.

    ACM Transactions on Storage, Volume 6, Issue 2 (2010).

  • ALV: A New Data Redistribution Approach to RAID-5 Scaling

    [PDF]

    Guangyan Zhang, Weimin Zheng, and Jiwu Shu.

    IEEE Transactions on Computers, Volume 59, Pages 345-357 (2010).

  • Design and Implementation of an Out-of-Band Virtualization System for Large SANs

    [PDF]

    Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng.

    IEEE Transactions on Computers, Volume 56, Pages 1654-1665 (2007).

  • SLAS: An efficient approach to scaling round-robin striped volumes

    [PDF]

    Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng.

    ACM Transactions on Storage, Volume 3, Issue 1, Pages 1-39 (2007).