Welcome to join us!

Our group is led by Prof. Guangyan Zhang.
We are one of the teams in China to start working on storage systems. Our group has produced around a hundred publications on international conferences like FAST, USENIX ATC, EuroSys, etc. as well as authoritative journals including IEEE TC, IEEE TPDS, ACM TOS, etc.

activity

news and our daily activities

News

· 2023-12-29

· 2023-07-17

· 2022-08-12

· 2022-04-01

· 2021-12-11

· 2020-12-10

· 2020-12-10

...MORE

Research

Current research focuses on Big Data Computing, Storage Systems, and Distributed Systems, especially in:

Cloud Storage

  • Boafft: Distributed Deduplication for Big Data Storage in the Cloud.link

RAID and Erasure Codes

  • RAID+: Deterministic and Balanced Data Distribution for Large Disk Enclosures.link

Flash and PCM Storage

FusionRAID: Achieving Consistent Low Latency for Commodity SSD Arrays.link

Storage Virtualization

  • Design and Implementation of an Out-of-Band Virtualization System for Large SANs.link

Distributed File Systems

  • Determining Data Distribution for Large Disk Enclosures with 3-D Data Templates.link

Deep Learning

  • Performance Analysis of GPU-based Convolutional Neural Networks.link

Graph Computing

  • Large graph computing systems.link

Stream Computing

  • Building a Fault Tolerant Framework with Deadline Guarantee in Big Data Stream Computing Environments.link

Publications

Selected publications include:

  • 2024

    FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning with Partitioning and Parallelism of Search Space
    [PDF]

    Xiaqing Li, Qi Guo, Guangyan Zhang, Siwei Ye, Guanhua He, Rui Zhang, Yifan Hao, Zidong Du, and Weimin Zheng

    Accepted by Transactions on Parallel and Distributed Systems (TPDS)

  • 2023

    Exploiting Data-pattern-aware Vertical Partitioning to Achieve Fast and Low-cost Cloud Log Storage
    [PDF]

    Junyu Wei, Guangyan Zhang, Junchao Chen, Yang Wang, Weimin Zheng, Tingtao Sun, Jiesheng Wu, Jiangwei Jiang.

    Accepted by ACM Transactions on Storage (TOS), Volume 20, Issue 2, Pages 1-35 (2024).

  • 2023

    Understanding Silent Data Corruptions in a Large Production CPU Population
    [PDF]

    Shaobu Wang, Guangyan Zhang, Junyu Wei, Yang Wang, Jiesheng Wu, and Qingchao Luo.

    In the Proceedings of the 29th ACM Symposium on Operating Systems Principles (SOSP’ 2023), Koblenz, Germany, Oct, 2023. Pages 216-230

  • 2023

    LogGrep: Fast and Cheap Cloud Log Storage by Exploiting both Static and Runtime Patterns
    [PDF]

    Junyu Wei, Guangyan Zhang, Junchao Chen, Yang Wang, Weimin Zheng, Tingtao Sun, Jiesheng Wu, Jiangwei Jiang.

    In the Proceedings of the 18th European Conference on Computer Systems (EuroSys’23), Roma, Italy, May 2023.

  • 2023

    A Survey on Design and Application of Open-Channel SSDs
    [PDF]

    Junchao Chen, Guangyan Zhang, Junyu Wei.

    Frontiers of Information Technology & Electronic Engineering (FITEE), Volume 24, Pages 637–658 (2023).

  • 2023

    A globally shared resource paradigm for encoded storage systems in the public cloud
    [PDF]

    Zhiyue Li, Guangyan Zhang.

    Fundamental Research.

  • 2022

    ShortTail: Taming Tail Latency for Erasure-code Based In-memory Systems
    [PDF]

    Yun Teng, Zhiyue Li, Jing Huang, Guangyan Zhang.

    Frontiers of Information Technology & Electronic Engineering (FITEE), Volume 23, Pages 1646–1657 (2022).

  • 2022

    Aurogon: Taming Aborts in All Phases for Distributed In-Memory Transactions
    [PDF]

    Tianyang Jiang, Guangyan Zhang, Zhiyue Li, Weimin Zheng.

    In the Proceedings of the 20th USENIX Conference on File and Storage Technologies(FAST’22), Santa Clara, CA, February 2022. Pages 217-232.

  • 2021

    FusionRAID: Achieving Consistent Low Latency for Commodity SSD Arrays
    [PDF]

    Tianyang Jiang, Guangyan Zhang, Zican Huang, Xiaosong Ma, Junyu Wei, Zhiyue Li, Weimin Zheng.

    In the Proceedings of the 19th USENIX Conference on File and Storage Technologies(FAST’21), Santa Clara, CA, February 2021. Pages 355-370.

  • 2021

    On the Feasibility of Parser-based Log Compression in Large-Scale Cloud Systems
    [PDF]

    Junyu Wei, Guangyan Zhang, Yang Wang, Zhiwei Liu, Zhanyang Zhu, Junchao Chen, Tingtao Sun, Qi Zhou.

    In the Proceedings of the 19th USENIX Conference on File and Storage Technologies(FAST’21), Santa Clara, CA, February 2021. Pages 249-262.

  • 2021

    SmartTuning: Selecting HyperParameters of a ConvNet System for Fast Training and Small Working Memory
    [PDF]

    Xiaqing Li, Guangyan Zhang, Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 32, Issue 7, Pages 1690-1701 (2021)

  • 2020

    Boafft: Distributed Deduplication for Big Data Storage in the Cloud
    [PDF]

    Shengmei Luo, Guangyan Zhang, Chengwen Wu, Samee U. Khan, and Keqin Li.

    IEEE Transactions on Cloud Computing, Volume 8, Pages 1199-1211 (2020).

  • 2019

    Determining Data Distribution for Large Disk Enclosures with 3-D Data Templates
    [PDF]

    Guangyan Zhang, Zhufan Wang, Xiaosong Ma, Songlin Yang, Zican Huang, Weimin Zheng.

    ACM Transactions on Storage, Volume 15, Issue 4, Pages 1-38 (2019).

  • 2019

    Dayu: Fast and Low-interference Data Recovery in Very-large Storage Systems
    [PDF]

    Zhufan Wang, Guangyan Zhang, Yang Wang, Qinglin Yang, Jiaji Zhu.

    In the Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19), Renton, WA, July 2019. Pages 993-1007.

  • 2019

    Redio: Accelerating Disk-based Graph Processing by Reducing Disk I/Os
    [PDF]

    Chengwen Wu, Guangyan Zhang, Yang Wang, Xinyang Jiang, Weimin Zheng.

    IEEE Transactions on Computers, Volume 68, Issue 3, Pages 414 - 425 (2019).

  • 2019

    HyConv: Accelerating Multi-phase CNN Computation by Fine-grained Policy Selection
    [PDF]

    Xiaqing Li, Guangyan Zhang, Zhufan Wang, Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 30, Issue 2, Pages 388 - 399 (2019).

  • 2018

    RAID+: Deterministic and Balanced Data Distribution for Large Disk Enclosures
    [PDF]

    Guangyan Zhang, Zican Huang, Xiaosong Ma, Songlin Yang, Zhufan Wang, Weimin Zheng.

    In the Proceedings of the 16th USENIX Conference on File and Storage Technologies (FAST’18), Oakland, CA, February 2018. Pages 279-293.

  • 2018

    Accelerating Breadth-First Graph Search on a Single Server by Dynamic Edge Trimming
    [PDF]

    Guangyan Zhang, Shuhan Cheng, Jiwu Shu, Qingda Hu, and Weimin Zheng.

    Journal of Parallel and Distributed Computing (JPDC), Volume 120, Pages 383-394, (2018).

  • 2017

    Building a Fault Tolerant Framework with Deadline Guarantee in Big Data Stream Computing Environments
    [PDF]

    Dawei Sun, Guangyan Zhang, Chengwen Wu, Keqin Li, and Weimin Zheng.

    Journal of Computer and System Sciences, Volume 89, Pages 4-23 (2017).

  • 2016

    Xscale: Online X-code RAID-6 Scaling Using Lightweight Data Reorganization
    [PDF]

    Guangyan Zhang, Guiyong Wu, Yu Lu, Jie Wu, and Weimin Zheng.

    IEEE Transactions on Parallel and Distributed Systems, Volume 27, Issue 12, Pages 3687 - 3700 (2016).

  • 2016

    AsyncStripe: I/O Efficient Asynchronous Graph Computing on a Single Server
    [PDF]

    Shuhan Cheng, Guangyan Zhang, Jiwu Shu, and Weimin Zheng.

    In the Proceedings of the 2016 International Conference on Hardware - Software Codesign and System Synthesis (CODES+ISSS 2016), Pittsburgh, USA, October 2016.

  • 2016

    Performance Analysis of GPU-based Convolutional Neural Networks
    [PDF]

    Xiaqing Li, Guangyan Zhang, H. Howie Huang, Zhufan Wang and Weimin Zheng.

    In the Proceedings of the 45th International Conference on Parallel Processing (** ICPP-2016**), Philadelphia, PA USA, August 2016.

  • 2016

    FastBFS: Fast Breadth-First Graph Search on a Single Server
    [PDF]

    Shuhan Cheng, Guangyan Zhang, Jiwu Shu, Qingda Hu, and Weimin Zheng.

    In the Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS’16), Chicago, Illinois USA, May 2016.

  • 2016

    Rethinking Computer Architectures and Software Systems for Phase Change Memory
    [PDF]

    Chengwen Wu,Guangyan Zhang, and Keqin Li.

    Journal of Emerging Technologies in Computing Systems, Volume 12, Issue 4, Article No. 33 (2016).

  • 2016

    Reconsidering Single Disk Failure Recovery for Erasure Coded Storage Systems: Optimizing Load Balancing in Stack-Level
    [PDF]

    Yingxun Fu, Jiwu Shu, Zhirong Shen, and Guangyan Zhang.

    IEEE Transactions on Parallel and Distributed Systems, Volume 27, Issue 5, Pages 1457-1469 (2016).

  • 2016

    CaCo: An Efficient Cauchy Coding Approach for Cloud Storage Systems
    [PDF]

    Guangyan Zhang, Guiyong Wu, Shupeng Wang, Jiwu Shu, Weimin Zheng, and Keqin Li.

    IEEE Transactions on Computers, Volume 65, Issue 2, Pages 435-447 (2016).

  • 2015

    Re-Stream: real-time and energy-efficient resource scheduling in big data stream computing environments
    [PDF]

    Dawei Sun, Guangyan Zhang, Songlin Yang, Weimin Zheng, Samee Khan, and Keqin Li.

    Information Sciences, Volume 319, Pages 92-112 (2015).

  • 2015

    FastRAQ: A Fast Approach to Range-Aggregate Queries in Big Data Environments
    [PDF]

    Xiaochun Yun, Guangjun Wu, Guangyan Zhang, Keqin Li, and Shupeng Wang,

    IEEE Transactions on Cloud Computing, Volume 3, Issue 2, Pages 206-218 (2015).

  • 2015

    Redistribute Data to Regain Load Balance during RAID-4 Scaling
    [PDF]

    Guangyan Zhang, Jigang Wang, Keqin Li, Jiwu Shu, and Weimin Zheng

    IEEE Transactions on Parallel and Distributed Systems, Volume 26, Issue 1, Pages 219-229 (2015).

  • 2015

    Accelerate RDP RAID-6 Scaling by Reducing Disk I/Os and XOR Operations
    [PDF]

    Guangyan Zhang, Keqin Li, Jingzhe Wang, Weimin Zheng

    IEEE Transactions on Computers, Volume 64, Issue 1, Pages 32-44 (2015).

  • 2014

    Rethinking RAID-5 Data Layout for Better Scalability
    [PDF]

    Guangyan Zhang, Weimin Zheng, Keqin Li

    IEEE Transactions on Computers, Volume 63 , Issue 11, Pages 2816-2828 (2014).

  • 2013

    Design and evaluation of a new approach to RAID-0 scaling
    [PDF]

    Guangyan Zhang, Weimin Zheng, and Keqin Li

    ACM Transactions on Storage, 9, 4, Article 11 (November 2013), 31 pages.

  • 2013

    AIP: A Tool for Flexible and Transparent Data Managemen
    [PDF]

    Guangyan Zhang, Jianping Qiu, Jiwu Shu, Weimin Zheng.

    SCIENCE CHINA, Information Sciences, 2013, 56(5):052114(11).

  • 2011

    FastScale: Accelerate RAID Scaling by Minimizing Data Migration
    [PDF]

    Weimin Zheng, Guangyan Zhang

    In the Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST’11), San Jose, CA, February 2011.

  • 2010

    SOPA: Selecting the Optimal Policy Adaptively
    [PDF]

    Yang Wang, Jiwu Shu, Guangyan Zhang, Wei Xue, Weimin Zheng.

    ACM Transactions on Storage, Volume 6, Issue 2 (2010).

  • 2010

    ALV: A New Data Redistribution Approach to RAID-5 Scaling
    [PDF]

    Guangyan Zhang, Weimin Zheng, and Jiwu Shu.

    IEEE Transactions on Computers, Volume 59, Pages 345-357 (2010).

  • 2007

    Design and Implementation of an Out-of-Band Virtualization System for Large SANs
    [PDF]

    Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng.

    IEEE Transactions on Computers, Volume 56, Pages 1654-1665 (2007).

  • 2007

    SLAS: An efficient approach to scaling round-robin striped volumes
    [PDF]

    Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng.

    ACM Transactions on Storage, Volume 3, Issue 1, Pages 1-39 (2007).

Faculty

Guangyan Zhang

Associate Professor

Advisees

Current Students

Junyu Wei

Ph.D.

Qinglin Yang

Ph.D.

Zhiyue Li

Ph.D.

Yun Teng

Master

Shaobu Wang

Ph.D.

Shipeng Hu

Ph.D.

Xiao Niu

Master

Sijie Cai

Ph.D.

Xinyuan Zhu

Ph.D.

Maomao Wang

Master

Yuqi Zhou

Master

Alumni

Zican Huang

Master

Zhanyang Zhu

Master

Xiaqing Li

Ph.D.

Zhufan Wang

Master

Songlin Yang

Master

Guiyong Wu

Master

Chengwen Wu

Master

Hanyu He

Master

Qi Wu

Master

Tianyang Jiang

Ph.D.

Junchao Chen

Master

Several doctoral and master degree positions are available for applicants each year. If you’re interested in our research fields, please feel free to contact Prof. Guangyan Zhang via Email.

Resources

The top conferences in our concern

The top journals in our concern