Research Interests


I am a researcher in the field of high performance computing with a focus on data compression and sparse matrix storage. My research interests include: HPC, data compression, sparse matrix storage, sparse algorithms, CUDA programming, and performance portability. I am currently working on the development of high performance compression algorithms for the FZ Project and the IVSparse Library. I have also had experience as an undergraduate research intern at Sandia National Laboratories working on the E3SM atmospheric model, HOMME (High-Order Method Modeling Environment).

Projects


FZ

The FZ Project is a lossy error-bound lossy compression framework funded by the NSF. These compressors see use in many fields such as AI, quantum simulation, scientific computing, and instrumentation.

I am currently working on the development team of the IU FZ Team under the advisement of Dr. Fengguang Song. Thus far I’ve developed the FZModules library, a modular framework for constructing bespoke lossy compression pipelines. FZModules is designed for use by domain scientists and compression researchers to develop compression pipelines for specific applications.

FZModules Github

FZModules Publication

Rate distortion curves for different compression pipelines highlighted in the paper.

cuSZ Github

FZ Website

IVSparse (Primary Co-Contributor)

The IVSparse Library is a C++ sparse matrix library optimized for the compression of sparse data in cases where non-zero values are highly redundant. IVSparse supports three main compression formats.

  • CSC: Compressed Sparse Column - The standard sparse matrix format that stores the column indices of non-zero values and the values themselves.
  • VCSC: Value Compressed Sparse Column - A format that takes advantage of the redundancy in the values of the matrix by storing unique values and their counts.
    • ~2.25x fold compression over CSC (for redundant data)
  • IVCSC: Indexed Value Compressed Sparse Column - A format that further compresses the index storage of VCSC by using positive-delta encoding and then bytepacking indices for each unique value in a column.
    • ~7.5x fold compression over CSC (for redundant data)
Sizes in GB of a random large matrix for different compression formats.

IVSparse Github

BigData ‘24 Slides

Publications


  • [SC Workshops ‘25 (DRBSD)] Skyler Ruiter, Jiannan Tian, Fengguang Song. “FZModules: A Heterogenous Computing Framework for Customizable Data Compression Pipelines.” 2025 IEEE/ACM The 11th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD). St Louis, MO, USA. November 16-21. arxiv

  • [BigData ‘24] Seth Wolfgang, Skyler Ruiter, Marc Tunnell, Timothy Triche Jr, Erin Carrier, Zachary DeBruine. “Value-Compressed Sparse Column (VCSC): Sparse Matrix Storage for Single-cell Omics Data.” 2024 IEEE International Conference on Big Data (BigData). Washington, DC, USA. December 15-18. arxiv

Talks and Pictures


SC25 Photo. Seth Wolfgang (left), Jack Dongarra (middle), Me (right).
FZ ZF Joint Workshop at Washington DC. I'm pictured back middle left.
Dr. Song's research group out for dinner.
FZ ZF Joint Workshop at Sarasota FL. I'm pictured top right.
GVSU Computing Seminar Series IVSparse Talk
Seth Wolfgang (right) and I (left) preseting IVSparse at the 2024 Data Compression Conference (DCC).
Seth Wolfgang (right) and I (left) preseting IVSparse at the 2024 Winter GVSU Student Scholars Day to the university president.
Seth Wolfgang (left) and I (right) preseting IVSparse at the 2024 Grand Rapids Tech Week.