Search Results for author: Charith Mendis

Found 10 papers, 7 papers with code

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

1 code implementation NeurIPS 2023 Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Kaidi Cao, Bahare Fatemi, Mike Burrows, Charith Mendis, Bryan Perozzi

TpuGraphs provides 25x more graphs than the largest graph property prediction dataset (with comparable graph sizes), and 770x larger graphs on average compared to existing performance prediction datasets on machine learning programs.

Graph Property Prediction Property Prediction

FLuRKA: Fast fused Low-Rank & Kernel Attention

no code implementations27 Jun 2023 Ahan Gupta, Yueming Yuan, Yanqi Zhou, Charith Mendis

FLuRKA provide sizable performance gains over these approximate techniques and are of high quality.

SENSEi: Input-Sensitive Compilation for Accelerating GNNs

no code implementations27 Jun 2023 Damitha Lenadora, Vimarsh Sathia, Gerasimos Gerogiannis, Serif Yesil, Josep Torrellas, Charith Mendis

We leverage this observation to propose SENSEi, a system that exposes different sparse and dense matrix primitive compositions based on different matrix re-associations of GNN computations and selects the best among them based on input attributes.

Graph Attention Graph Embedding

Learning Large Graph Property Prediction via Graph Segment Training

1 code implementation NeurIPS 2023 Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint.

Graph Property Prediction Property Prediction

COMET: Neural Cost Model Explanation Framework

1 code implementation14 Feb 2023 Isha Chaudhary, Alex Renda, Charith Mendis, Gagandeep Singh

We generate and compare COMET's explanations for the popular neural cost model, Ithemal against those for an accurate CPU simulation-based cost model, uiCA.

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

1 code implementation8 Oct 2022 Ondrej Sykora, Phitchaya Mangpo Phothilimthana, Charith Mendis, Amir Yazdanbakhsh

In this paper, we introduce GRANITE, a new machine learning model that estimates the throughput of basic blocks across different microarchitectures.

Decoder Multi-Task Learning

DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates

2 code implementations8 Oct 2020 Alex Renda, Yishen Chen, Charith Mendis, Michael Carbin

In this paper we present DiffTune, a system for learning the parameters of x86 basic block CPU simulators from coarse-grained end-to-end measurements.

Scheduling

Compiler Auto-Vectorization with Imitation Learning

1 code implementation NeurIPS 2019 Charith Mendis, Cambridge Yang, Yewen Pu, Dr.Saman Amarasinghe, Michael Carbin

We show that the learnt policy produces a vectorization scheme which is better than industry standard compiler heuristics both in terms of static measures and runtime performance.

Imitation Learning

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks

3 code implementations21 Aug 2018 Charith Mendis, Alex Renda, Saman Amarasinghe, Michael Carbin

Predicting the number of clock cycles a processor takes to execute a block of assembly instructions in steady state (the throughput) is important for both compiler designers and performance engineers.

Cannot find the paper you are looking for? You can Submit a new open access paper.