Search Results for author: Lu Huang

Found 8 papers, 1 papers with code

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

no code implementations • 15 Nov 2023 • Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Deep biasing for the Transducer can improve the recognition performance of rare words or contextual entities, which is essential in practical applications, especially for streaming Automatic Speech Recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer

no code implementations • 7 Jun 2023 • Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Domain adaptation using text-only corpus is challenging in end-to-end(E2E) speech recognition.

Domain Adaptation Language Modelling +2

Paper
Add Code

HMM-Free Encoder Pre-Training for Streaming RNN Transducer

no code implementations • 2 Apr 2021 • Lu Huang, Jingyu Sun, Yufeng Tang, JunFeng Hou, Jinkun Chen, Jun Zhang, Zejun Ma

This work describes an encoder pre-training procedure using frame-wise label to improve the training of streaming recurrent neural network transducer (RNN-T) model.

Speech Recognition

Paper
Add Code

Deep Convolutional Sparse Coding Network for Pansharpening with Guidance of Side Information

1 code implementation • 10 Mar 2021 • Shuang Xu, Jiangshe Zhang, Kai Sun, Zixiang Zhao, Lu Huang, Junmin Liu, Chunxia Zhang

Pansharpening is a fundamental issue in remote sensing field.

Pansharpening Rolling Shutter Correction

Paper
Code

FGF-GAN: A Lightweight Generative Adversarial Network for Pansharpening via Fast Guided Filter

no code implementations • 31 Dec 2020 • Zixiang Zhao, Jiangshe Zhang, Shuang Xu, Kai Sun, Lu Huang, Junmin Liu, Chunxia Zhang

In addition, the latent information of features can be preserved effectively through adversarial training.

Generative Adversarial Network Image Enhancement +1

Paper
Add Code

Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation

no code implementations • 25 Dec 2019 • Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun

The experimental results show that uPIT outperforms cPIT when LC-BLSTM is used during inference.

Speech Separation

Paper
Add Code

The Domain Shift Problem of Medical Image Segmentation and Vendor-Adaptation by Unet-GAN

no code implementations • 30 Oct 2019 • Wenjun Yan, Yuanyuan Wang, Shengjia Gu, Lu Huang, Fuhua Yan, Liming Xia, Qian Tao

In this work, we proposed a generic framework to address this problem, consisting of (1) an unpaired generative adversarial network (GAN) for vendor-adaptation, and (2) a Unet for object segmentation.

Generative Adversarial Network Image Segmentation +3

Paper
Add Code

An Improved Residual LSTM Architecture for Acoustic Modeling

no code implementations • 17 Aug 2017 • Lu Huang, Jiasong Sun, Ji Xu, Yi Yang

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.