Search Results for author: Xiaofan Zhang

Found 44 papers, 22 papers with code

Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models

no code implementations • 27 Apr 2024 • Zhongzhen Huang, Kui Xue, Yongqi Fan, Linjie Mu, Ruoyu Liu, Tong Ruan, Shaoting Zhang, Xiaofan Zhang

With experimental results, we show that our framework brings notable performance improvements and surpasses the previous counterparts in the evidence retrieval process in terms of evidence retrieval accuracy.

Answer Generation Question Answering +1

Paper
Add Code

Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

no code implementations • 23 Apr 2024 • Qiao Deng, Zhongzhen Huang, Yunqi Wang, Zhichuan Wang, Zhao Wang, Xiaofan Zhang, Qi Dou, Yeung Yu Hui, Edward S. Hui

Medical vision-language pre-training has emerged as a promising approach for learning domain-general representations of medical image and text.

Medical Visual Question Answering Question Answering +1

Paper
Add Code

CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation

no code implementations • 26 Mar 2024 • Yongrui Yu, HanYu Chen, Zitian Zhang, Qiong Xiao, Wenhui Lei, Linrui Dai, Yu Fu, Hui Tan, Guan Wang, Peng Gao, Xiaofan Zhang

To address these problems, we present a pipeline that integrates the conditional diffusion model for lymph node generation and the nnU-Net model for lymph node segmentation to improve the segmentation performance of abdominal lymph nodes through synthesizing a diversity of realistic abdominal lymph node data.

Denoising Image Generation +4

Paper
Add Code

PathoTune: Adapting Visual Foundation Model to Pathological Specialists

no code implementations • 25 Mar 2024 • Jiaxuan Lu, Fang Yan, Xiaofan Zhang, Yue Gao, Shaoting Zhang

As natural image understanding moves towards the pretrain-finetune era, research in pathology imaging is concurrently evolving.

Paper
Add Code

VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification

1 code implementation • 23 Mar 2024 • Lanfeng Zhong, Xin Liao, Shaoting Zhang, Xiaofan Zhang, Guotai Wang

To address this issue, we introduce VLM-CPL, a novel approach based on consensus pseudo labels that integrates two noisy label filtering techniques with a semi-supervised learning strategy.

Image Classification Zero-Shot Learning

Paper
Code

GuideGen: A Text-guided Framework for Joint CT Volume and Anatomical structure Generation

1 code implementation • 12 Mar 2024 • Linrui Dai, Rongzhao Zhang, Zhongzhen Huang, Xiaofan Zhang

Secondly, our Conditional Image Generator autoregressively generates CT slices conditioned on a corresponding mask slice to incorporate both style information and anatomical guidance.

Paper
Code

Modality-Aware and Shift Mixer for Multi-modal Brain Tumor Segmentation

no code implementations • 4 Mar 2024 • Zhongzhen Huang, Linda Wei, Shaoting Zhang, Xiaofan Zhang

Combining images from multi-modalities is beneficial to explore various information in computer vision, especially in the medical domain.

Brain Tumor Segmentation Segmentation +1

Paper
Add Code

OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine

no code implementations • 28 Feb 2024 • Xiaosong Wang, Xiaofan Zhang, Guotai Wang, Junjun He, Zhongyu Li, Wentao Zhu, Yi Guo, Qi Dou, Xiaoxiao Li, Dequan Wang, Liang Hong, Qicheng Lao, Tong Ruan, Yukun Zhou, Yixue Li, Jie Zhao, Kang Li, Xin Sun, Lifeng Zhu, Shaoting Zhang

The emerging trend of advancing generalist artificial intelligence, such as GPTv4 and Gemini, has reshaped the landscape of research (academia and industry) in machine learning and many other research areas.

Transfer Learning

Paper
Add Code

USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis

no code implementations • 30 Dec 2023 • Jing Jiao, Jin Zhou, Xiaokang Li, Menghua Xia, Yi Huang, Lihong Huang, Na Wang, Xiaofan Zhang, Shichong Zhou, Yuanyuan Wang, Yi Guo

In this paper, we present a universal US foundation model, named USFM, generalized to diverse tasks and organs towards label efficient US image analysis.

Image Enhancement

Paper
Add Code

PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains

1 code implementation • 15 Dec 2023 • Shengyi Hua, Fang Yan, Tianle Shen, Xiaofan Zhang

In this work, we present PathoDuet, a series of pretrained models on histopathological images, and a new self-supervised learning framework in histopathology.

Self-Supervised Learning

162

Paper
Code

ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting

1 code implementation • 7 Dec 2023 • Yankai Jiang, Zhongzhen Huang, Rongzhao Zhang, Xiaofan Zhang, Shaoting Zhang

The long-tailed distribution problem in medical image analysis reflects a high prevalence of common conditions and a low prevalence of rare ones, which poses a significant challenge in developing a unified model capable of identifying rare or novel tumor categories not encountered during training.

Organ Segmentation Segmentation +1

Paper
Code

AG-CRC: Anatomy-Guided Colorectal Cancer Segmentation in CT with Imperfect Anatomical Knowledge

1 code implementation • 7 Oct 2023 • Rongzhao Zhang, Zhian Bai, Ruoying Yu, Wenrao Pang, Lingyun Wang, Lifeng Zhu, Xiaofan Zhang, huan zhang, Weiguo Hu

When delineating lesions from medical images, a human expert can always keep in mind the anatomical structure behind the voxels.

Anatomy Computed Tomography (CT) +6

Paper
Code

Automatic lobe segmentation using attentive cross entropy and end-to-end fissure generation

no code implementations • 24 Jul 2023 • Qi Su, Na Wang, Jiawen Xie, Yinan Chen, Xiaofan Zhang

Therefore, we propose a new automatic lung lobe segmentation framework, in which we urge the model to pay attention to the area around the pulmonary fissure during the training process, which is realized by a task-specific loss function.

Segmentation

Paper
Add Code

Efficient Subclass Segmentation in Medical Images

1 code implementation • 1 Jul 2023 • Linrui Dai, Wenhui Lei, Xiaofan Zhang

One feasible way to reduce the cost is to annotate with coarse-grained superclass labels while using limited fine-grained annotations as a complement.

Segmentation Semantic Segmentation

Paper
Code

MedLSAM: Localize and Segment Anything Model for 3D CT Images

1 code implementation • 26 Jun 2023 • Wenhui Lei, Xu Wei, Xiaofan Zhang, Kang Li, Shaoting Zhang

Our findings are twofold: 1) MedLAM is capable of directly localizing any anatomical structure using just a few template scans, yet its performance surpasses that of fully supervised models; 2) MedLSAM not only aligns closely with the performance of SAM and its specialized medical adaptations with manual prompts but achieves this with minimal reliance on extreme point annotations across the entire dataset.

Image Segmentation Semantic Segmentation

464

Paper
Code

KiUT: Knowledge-injected U-Transformer for Radiology Report Generation

no code implementations • CVPR 2023 • Zhongzhen Huang, Xiaofan Zhang, Shaoting Zhang

Radiology report generation aims to automatically generate a clinically accurate and coherent paragraph from the X-ray image, which could relieve radiologists from the heavy burden of report writing.

Clinical Knowledge Decoder

Paper
Add Code

Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization

no code implementations • 8 Jun 2023 • Clemens JS Schaefer, Navid Lambert-Shirzad, Xiaofan Zhang, Chiachen Chou, Tom Jablin, Jian Li, Elfie Guo, Caitlin Stanton, Siddharth Joshi, Yu Emma Wang

To address this challenge, we propose a mixed-precision post training quantization (PTQ) approach that assigns different numerical precisions to tensors in a network based on their specific needs, for a reduced memory footprint and improved latency while preserving model accuracy.

Quantization

Paper
Add Code

MidMed: Towards Mixed-Type Dialogues for Medical Consultation

1 code implementation • 5 Jun 2023 • Xiaoming Shi, Zeming Liu, Chuan Wang, Haitao Leng, Kui Xue, Xiaofan Zhang, Shaoting Zhang

To mitigate this challenge, we propose a novel task and create a human-to-human mixed-type medical consultation dialogue corpus, termed MidMed, covering five dialogue types: task-oriented dialogue for diagnosis, recommendation, knowledge-grounded dialogue, QA, and chitchat.

Dialogue Generation

Paper
Code

Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search

no code implementations • 2 Feb 2023 • Clemens JS Schaefer, Elfie Guo, Caitlin Stanton, Xiaofan Zhang, Tom Jablin, Navid Lambert-Shirzad, Jian Li, Chiachen Chou, Siddharth Joshi, Yu Emma Wang

In this paper, we propose a method to efficiently determine quantization configurations of different tensors in ML models using post-training mixed precision quantization.

Quantization

Paper
Add Code

Contrastive Semi-supervised Learning for Domain Adaptive Segmentation Across Similar Anatomical Structures

1 code implementation • 18 Aug 2022 • Ran Gu, Jingyang Zhang, Guotai Wang, Wenhui Lei, Tao Song, Xiaofan Zhang, Kang Li, Shaoting Zhang

To solve this problem, we propose Contrastive Semi-supervised learning for Cross Anatomy Domain Adaptation (CS-CADA) that adapts a model to segment similar structures in a target domain, which requires only limited annotations in the target domain by leveraging a set of existing annotated images of similar structures in a source domain.

Anatomy Contrastive Learning +4

Paper
Code

Compilation and Optimizations for Efficient Machine Learning on Embedded Systems

no code implementations • 6 Jun 2022 • Xiaofan Zhang, Yao Chen, Cong Hao, Sitao Huang, Yuhong Li, Deming Chen

Deep Neural Networks (DNNs) have achieved great success in a variety of machine learning (ML) applications, delivering high-quality inferencing solutions in computer vision, natural language processing, and virtual reality, etc.

BIG-bench Machine Learning

Paper
Add Code

Contrastive Domain Disentanglement for Generalizable Medical Image Segmentation

1 code implementation • 13 May 2022 • Ran Gu, Jiangshan Lu, Jingyang Zhang, Wenhui Lei, Xiaofan Zhang, Guotai Wang, Shaoting Zhang

To tackle this deficiency, we propose Contrastive Domain Disentangle (CDD) network for generalizable medical image segmentation.

Disentanglement Domain Generalization +4

Paper
Code

AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models

no code implementations • 21 Jan 2022 • Xiaofan Zhang, Zongwei Zhou, Deming Chen, Yu Emma Wang

By evaluating on SQuAD, a model found by AutoDistill achieves an 88. 4% F1 score with 22. 8M parameters, which reduces parameters by more than 62% while maintaining higher accuracy than DistillBERT, TinyBERT, and NAS-BERT.

Bayesian Optimization Knowledge Distillation +2

Paper
Add Code

EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search

1 code implementation • 24 Nov 2021 • Qian Jiang, Xiaofan Zhang, Deming Chen, Minh N. Do, Raymond A. Yeh

In this work, we propose End-to-end Hardware-aware DNAS (EH-DNAS), a seamless integration of end-to-end hardware benchmarking, and fully automated DNAS to deliver hardware-efficient deep neural networks on various platforms, including Edge GPUs, Edge TPUs, Mobile CPUs, and customized accelerators.

Benchmarking Neural Architecture Search

Paper
Code

One-shot Weakly-Supervised Segmentation in Medical Images

1 code implementation • 21 Nov 2021 • Wenhui Lei, Qi Su, Ran Gu, Na Wang, Xinglong Liu, Guotai Wang, Xiaofan Zhang, Shaoting Zhang

Deep neural networks usually require accurate and a large number of annotations to achieve outstanding performance in medical image segmentation.

Denoising Image Segmentation +5

Paper
Code

WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image

3 code implementations • 3 Nov 2021 • Xiangde Luo, Wenjun Liao, Jianghong Xiao, Jieneng Chen, Tao Song, Xiaofan Zhang, Kang Li, Dimitris N. Metaxas, Guotai Wang, Shaoting Zhang

Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, and there is a lack of large-scale datasets covering the whole abdomen region with accurate and detailed annotations for the whole abdominal organ segmentation.

Image Segmentation Medical Image Segmentation +4

186

Paper
Code

F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding

no code implementations • 8 Mar 2021 • Xiaofan Zhang, Dawei Wang, Pierce Chuang, Shugao Ma, Deming Chen, Yuecheng Li

Creating virtual avatars with realistic rendering is one of the most essential and challenging tasks to provide highly immersive virtual reality (VR) experiences.

Decoder

Paper
Add Code

Improving Random-Sampling Neural Architecture Search by Evolving the Proxy Search Space

1 code implementation • 1 Jan 2021 • Yuhong Li, Cong Hao, Xiaofan Zhang, JinJun Xiong, Wen-mei Hwu, Deming Chen

This raises the question of whether we can find an effective proxy search space (PS) that is only a small subset of GS to dramatically improve RandomNAS’s search efficiency while at the same time keeping a good correlation for the top-performing architectures.

Image Classification Neural Architecture Search

Paper
Code

Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices

no code implementations • 14 Oct 2020 • Cong Hao, Yao Chen, Xiaofan Zhang, Yuhong Li, JinJun Xiong, Wen-mei Hwu, Deming Chen

High quality AI solutions require joint optimization of AI algorithms, such as deep neural networks (DNNs), and their hardware accelerators.

Paper
Add Code

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

no code implementations • 6 May 2020 • Yuhong Li, Cong Hao, Xiaofan Zhang, Xinheng Liu, Yao Chen, JinJun Xiong, Wen-mei Hwu, Deming Chen

We formulate the co-search problem by fusing DNN search variables and hardware implementation variables into one solution space, and maximize both algorithm accuracy and hardware implementation quality.

Neural Architecture Search

Paper
Add Code

HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation

no code implementations • 8 Apr 2020 • Hanchen Ye, Xiaofan Zhang, Zhize Huang, Gengsheng Chen, Deming Chen

To speedup Deep Neural Networks (DNN) accelerator design and enable effective implementation, we propose HybridDNN, a framework for building high-performance hybrid DNN accelerators and delivering FPGA-based hardware implementations.

Paper
Add Code

AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs

1 code implementation • 6 Jan 2020 • Pengfei Xu, Xiaofan Zhang, Cong Hao, Yang Zhao, Yongan Zhang, Yue Wang, Chaojian Li, Zetong Guan, Deming Chen, Yingyan Lin

Specifically, AutoDNNchip consists of two integrated enablers: (1) a Chip Predictor, built on top of a graph-based accelerator representation, which can accurately and efficiently predict a DNN accelerator's energy, throughput, and area based on the DNN model parameters, hardware configuration, technology-based IPs, and platform constraints; and (2) a Chip Builder, which can automatically explore the design space of DNN chips (including IP selection, block configuration, resource balancing, etc.

Paper
Code

Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network

no code implementations • 9 Oct 2019 • Hong Yu, Xiaofan Zhang, Lingjun Song, Liren Jiang, Xiaodi Huang, Wen Chen, Chenbin Zhang, Jiahui Li, Jiji Yang, Zhiqiang Hu, Qi Duan, Wanyuan Chen, Xianglei He, Jinshuang Fan, Weihai Jiang, Li Zhang, Chengmin Qiu, Minmin Gu, Weiwei Sun, Yangqiong Zhang, Guangyin Peng, Weiwei Shen, Guohui Fu

Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death.

Specificity whole slide images

Paper
Add Code

SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems

2 code implementations • 20 Sep 2019 • Xiaofan Zhang, Haoming Lu, Cong Hao, Jiachen Li, Bowen Cheng, Yuhong Li, Kyle Rupnow, JinJun Xiong, Thomas Huang, Honghui Shi, Wen-mei Hwu, Deming Chen

Object detection and tracking are challenging tasks for resource-constrained embedded systems.

Efficient Neural Network Object +3

231

Paper
Code

SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection

1 code implementation • 25 Jun 2019 • Xiaofan Zhang, Cong Hao, Haoming Lu, Jiachen Li, Yuhong Li, Yuchen Fan, Kyle Rupnow, JinJun Xiong, Thomas Huang, Honghui Shi, Wen-mei Hwu, Deming Chen

Developing artificial intelligence (AI) at the edge is always challenging, since edge devices have limited computation capability and memory resources but need to meet demanding requirements, such as real-time processing, high throughput performance, and high inference accuracy.

object-detection Object Detection

231

Paper
Code

A Bi-Directional Co-Design Approach to Enable Deep Learning on IoT Devices

2 code implementations • 20 May 2019 • Xiaofan Zhang, Cong Hao, Yuhong Li, Yao Chen, JinJun Xiong, Wen-mei Hwu, Deming Chen

Developing deep learning models for resource-constrained Internet-of-Things (IoT) devices is challenging, as it is difficult to achieve both good quality of results (QoR), such as DNN model inference accuracy, and quality of service (QoS), such as inference latency, throughput, and power consumption.

object-detection Object Detection

231

Paper
Code

FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge

2 code implementations • 9 Apr 2019 • Cong Hao, Xiaofan Zhang, Yuhong Li, Sitao Huang, JinJun Xiong, Kyle Rupnow, Wen-mei Hwu, Deming Chen

While embedded FPGAs are attractive platforms for DNN acceleration on edge-devices due to their low latency and high energy efficiency, the scarcity of resources of edge-scale FPGA devices also makes it challenging for DNN deployment.

C++ code object-detection +1

231

Paper
Code

Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering

1 code implementation • CVPR 2019 • Chenyou Fan, Xiaofan Zhang, Shu Zhang, Wensheng Wang, Chi Zhang, Heng Huang

In this paper, we propose a novel end-to-end trainable Video Question Answering (VideoQA) framework with three major components: 1) a new heterogeneous memory which can effectively learn global context information from appearance and motion features; 2) a redesigned question memory which helps understand the complex semantics of question and highlights queried subjects; and 3) a new multimodal fusion layer which performs multi-step reasoning by attending to relevant visual and textual hints with self-updated attention.

Ranked #27 on Visual Question Answering (VQA) on MSRVTT-QA

Question Answering Video Question Answering +1

Paper
Code

SiamVGG: Visual Tracking using Deeper Siamese Networks

4 code implementations • 7 Feb 2019 • Yuhong Li, Xiaofan Zhang, Deming Chen

It combines a Convolutional Neural Network (CNN) backbone and a cross-correlation operator, and takes advantage of the features from exemplary images for more accurate object tracking.

Ranked #1 on Visual Object Tracking on OTB-50

Visual Object Tracking Visual Tracking

437

Paper
Code

Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA

no code implementations • 31 Jul 2018 • Junsong Wang, Qiuwen Lou, Xiaofan Zhang, Chao Zhu, Yonghua Lin, Deming Chen

To create such accelerators, we propose a design flow for accelerating the extremely low bit-width neural network (ELB-NN) in embedded FPGAs with hybrid quantization schemes.

Edge-computing Quantization

Paper
Add Code

Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs

no code implementations • 23 Mar 2018 • Chuanhao Zhuge, Xinheng Liu, Xiaofan Zhang, Sudeep Gummadi, JinJun Xiong, Deming Chen

Deep Convolutional Neural Networks have become a Swiss knife in solving critical artificial intelligence tasks.

Face Recognition

Paper
Add Code

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

13 code implementations • CVPR 2018 • Yuhong Li, Xiaofan Zhang, Deming Chen

We demonstrate CSRNet on four datasets (ShanghaiTech dataset, the UCF_CC_50 dataset, the WorldEXPO'10 dataset, and the UCSD dataset) and we deliver the state-of-the-art performance.

Ranked #3 on Crowd Counting on Venice

Crowd Counting Scene Recognition

637

Paper
Code

Embedding Label Structures for Fine-Grained Feature Representation

no code implementations • CVPR 2016 • Xiaofan Zhang, Feng Zhou, Yuanqing Lin, Shaoting Zhang

However, previous studies have rarely focused on learning a fined-grained and structured feature representation that is able to locate similar images at different levels of relevance, e. g., discovering cars from the same make or the same model, both of which require high precision.

Fine-Grained Image Classification General Classification +3

Paper
Add Code

Fine-Grained Histopathological Image Analysis via Robust Segmentation and Large-Scale Retrieval

no code implementations • CVPR 2015 • Xiaofan Zhang, Hai Su, Lin Yang, Shaoting Zhang

Computer-aided diagnosis of medical images requires thorough analysis of image details.

Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.