Search Results for author: Thierry Tambe

Found 5 papers, 0 papers with code

CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning

no code implementations • 4 May 2023 • Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks

To minimize the occurrence of expensive eDRAM refresh operations, it is beneficial to shorten the lifetime of stored data during the training process.

Paper
Add Code

Quantifying and Maximizing the Benefits of Back-End Noise Adaption on Attention-Based Speech Recognition Models

no code implementations • 3 May 2021 • Coleman Hooper, Thierry Tambe, Gu-Yeon Wei

This work analyzes how attention-based Bidirectional Long Short-Term Memory (BLSTM) models adapt to noise-augmented speech.

speech-recognition Speech Recognition

Paper
Add Code

EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference

no code implementations • 28 Nov 2020 • Thierry Tambe, Coleman Hooper, Lillian Pentecost, Tianyu Jia, En-Yu Yang, Marco Donato, Victor Sanh, Paul N. Whatmough, Alexander M. Rush, David Brooks, Gu-Yeon Wei

Transformer-based language models such as BERT provide significant accuracy improvement for a multitude of natural language processing (NLP) tasks.

Edge-computing Network Pruning +2

Paper
Add Code

AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

no code implementations • 29 Sep 2019 • Thierry Tambe, En-Yu Yang, Zishen Wan, Yuntian Deng, Vijay Janapa Reddi, Alexander Rush, David Brooks, Gu-Yeon Wei

Conventional hardware-friendly quantization methods, such as fixed-point or integer, tend to perform poorly at very low word sizes as their shrinking dynamic ranges cannot adequately capture the wide data distributions commonly seen in sequence transduction models.

Quantization

Paper
Add Code

MASR: A Modular Accelerator for Sparse RNNs

no code implementations • 23 Aug 2019 • Udit Gupta, Brandon Reagen, Lillian Pentecost, Marco Donato, Thierry Tambe, Alexander M. Rush, Gu-Yeon Wei, David Brooks

The architecture is enhanced by a series of dynamic activation optimizations that enable compact storage, ensure no energy is wasted computing null operations, and maintain high MAC utilization for highly parallel accelerator designs.

speech-recognition Speech Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.