no code implementations • 21 Mar 2024 • Zeyu Han, Chao GAO, Jinyang Liu, Jeff Zhang, Sai Qian Zhang
In addition to the algorithmic perspective, we overview various real-world system designs to investigate the implementation costs associated with different PEFT algorithms.
no code implementations • 2 Dec 2023 • Kiran Thorat, Jiahui Zhao, Yaotian Liu, Hongwu Peng, Xi Xie, Bin Lei, Jeff Zhang, Caiwen Ding
The increasing use of Advanced Language Models (ALMs) in diverse sectors, particularly due to their impressive capability to generate top-tier content following linguistic instructions, forms the core of this investigation.
1 code implementation • 18 May 2021 • Udit Gupta, Samuel Hsia, Jeff Zhang, Mark Wilkening, Javin Pombra, Hsien-Hsin S. Lee, Gu-Yeon Wei, Carole-Jean Wu, David Brooks
Thus, we design RecPipeAccel (RPAccel), a custom accelerator that jointly optimizes quality, tail-latency, and system throughput.
no code implementations • 2 Jul 2018 • Jeff Zhang, Siddharth Garg
FATE proposes two novel ideas: (i) DelayNet, a DNN based timing model for MAC units; and (ii) a statistical sampling methodology that reduces the number of MAC operations for which timing simulations are performed.
no code implementations • 11 Feb 2018 • Jeff Zhang, Kartheek Rangineni, Zahra Ghodsi, Siddharth Garg
Hardware accelerators are being increasingly deployed to boost the performance and energy efficiency of deep neural network (DNN) inference.
no code implementations • 11 Feb 2018 • Jeff Zhang, Tianyu Gu, Kanad Basu, Siddharth Garg
Due to their growing popularity and computational cost, deep neural networks (DNNs) are being targeted for hardware acceleration.
no code implementations • 27 Jun 2017 • Peter Wang, Zhongxia Yan, Jeff Zhang
Video prediction has been an active topic of research in the past few years.