Search Results for author: Ismail Tutar

Found 3 papers, 0 papers with code

Catalog Phrase Grounding (CPG): Grounding of Product Textual Attributes in Product Images for e-commerce Vision-Language Applications

no code implementations • 30 Aug 2023 • Wenyi Wu, Karim Bouyarmane, Ismail Tutar

We present Catalog Phrase Grounding (CPG), a model that can associate product textual data (title, brands) into corresponding regions of product images (isolated product region, brand logo region) for e-commerce vision-language applications.

Decoder object-detection +2

Paper
Add Code

Solving Price Per Unit Problem Around the World: Formulating Fact Extraction as Question Answering

no code implementations • 12 Apr 2022 • Tarik Arici, Kushal Kumar, Hayreddin Çeker, Anoop S V K K Saladi, Ismail Tutar

Our model architecture consists of two subnetworks for the two subtasks: a classifier to predict UoM type (or the question) and an extractor to extract the relevant quantities.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling

no code implementations • 24 Sep 2021 • Tarik Arici, Mehmet Saygin Seyfioglu, Tal Neiman, Yi Xu, Son Train, Trishul Chilimbi, Belinda Zeng, Ismail Tutar

Vision-and-Language Pre-training (VLP) improves model performance for downstream tasks that require image and text inputs.

Image Reconstruction Language Modelling +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.