no code implementations • 3 Feb 2023 • Dongsheng Xu, Qingbao Huang, Feng Shuang, Yi Cai
One possible reason is that current studies mainly focus on constructing the plane-level geometric relationship of scene text without depth information.
Image Captioning Optical Character Recognition (OCR) +1