no code implementations • 28 Oct 2022 • Yichao Zhou, James B. Wendt, Navneet Potti, Jing Xie, Sandeep Tata
A key bottleneck in building automatic extraction models for visually rich documents like invoices is the cost of acquiring the several thousand high-quality labeled documents that are needed to train a model with acceptable accuracy.
no code implementations • 7 Jan 2022 • Beliz Gunel, Navneet Potti, Sandeep Tata, James B. Wendt, Marc Najork, Jing Xie
Automating information extraction from form-like documents at scale is a pressing need due to its potential impact on automating business workflows across many industries like financial services, insurance, and healthcare.
1 code implementation • ACL 2020 • Bodhisattwa Majumder, Navneet Potti, Sandeep Tata, James B. Wendt, Qi Zhao, Marc Najork
We propose a novel approach using representation learning for tackling the problem of extracting structured information from form-like document images.