Paper Search Results


AuthorId: 2122804649
Limit: 10
Sort by: score
Embedding: s2_recommendations
IP address: 18.223.237.218
Freq flyer: False

authorId(s): 2122804649
Author(s): Shaoyi Huang
scorecitationCountPaperAuthorsyearMore like thisCompare & ContrastProNE-sSciNCLSpecterGNN
82
Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning
Hongwu Peng, Shaoyi Huang, ..., Caiwen Ding
2021
48
A length adaptive algorithm-hardware co-design of transformer on FPGA through sparse attention and dynamic pipelining
Hongwu Peng, Shaoyi Huang, ..., Caiwen Ding
2022
48
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization
Panjie Qi, E. Sha, ..., Bingbing Li
2021
40
Accommodating Transformer onto FPGA: Coupling the Balanced Model Compression and FPGA-Implementation Optimization
Panjie Qi, Yuhong Song, ..., E. Sha
2021
29
AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Hongwu Peng, Shaoyi Huang, ..., Caiwen Ding
2023
29
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Hongwu Peng, Ran Ran, ..., Caiwen Ding
2023
28
E.T.: Re-Thinking Self-Attention for Transformer Models on GPUs
Shiyang Chen, Shaoyi Huang, ..., Hang Liu
2021
26
MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training
Hongwu Peng, Xi Xie, ..., Caiwen Ding
2023
26
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
Shaoyi Huang, Dongkuan Xu, ..., Caiwen Ding
2021
25
Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks
Xiaoru Xie, Hongwu Peng, ..., Caiwen Ding
2023

Help Bulk Download
GitHub Final Report (YouTube)
JSALT-2023 Contact us (by email)
BETA Version