Search Results for author: Jeff Zhang

Found 7 papers, 1 papers with code

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

no code implementations • 21 Mar 2024 • Zeyu Han, Chao GAO, Jinyang Liu, Jeff Zhang, Sai Qian Zhang

In addition to the algorithmic perspective, we overview various real-world system designs to investigate the implementation costs associated with different PEFT algorithms.

Paper
Add Code

Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis

no code implementations • 2 Dec 2023 • Kiran Thorat, Jiahui Zhao, Yaotian Liu, Hongwu Peng, Xi Xie, Bin Lei, Jeff Zhang, Caiwen Ding

The increasing use of Advanced Language Models (ALMs) in diverse sectors, particularly due to their impressive capability to generate top-tier content following linguistic instructions, forms the core of this investigation.

Language Modelling Large Language Model

Paper
Add Code

RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance

1 code implementation • 18 May 2021 • Udit Gupta, Samuel Hsia, Jeff Zhang, Mark Wilkening, Javin Pombra, Hsien-Hsin S. Lee, Gu-Yeon Wei, Carole-Jean Wu, David Brooks

Thus, we design RecPipeAccel (RPAccel), a custom accelerator that jointly optimizes quality, tail-latency, and system throughput.

Recommendation Systems Scheduling

Paper
Code

FATE: Fast and Accurate Timing Error Prediction Framework for Low Power DNN Accelerator Design

no code implementations • 2 Jul 2018 • Jeff Zhang, Siddharth Garg

FATE proposes two novel ideas: (i) DelayNet, a DNN based timing model for MAC units; and (ii) a statistical sampling methodology that reduces the number of MAC operations for which timing simulations are performed.

General Classification