Loading [MathJax]/extensions/MathMenu.js
Qing Wang - IEEE Xplore Author Profile

Showing 1-18 of 18 results

Filter Results

Show

Results

Continuous Super-Resolution (CSR) has garnered considerable popularity for its capability to reconstruct high-resolution (HR) images from low-resolution (LR) inputs at various scales, thereby holding significant practical value in real-world applications. However, the existing studies have relied solely on synthetic datasets due to the scarcity of real-world continuous datasets. In this paper, we ...Show More
Learning to recognize novel concepts from just a few image samples is very challenging as the learned model is easily overfitted on the few data and results in poor generalizability. One promising but underexplored solution is to compensate for the novel classes by generating plausible samples. However, most existing works of this line exploit visual information only, rendering the generated data ...Show More
Existing unsupervised disentanglement methods in latent space of the Generative Adversarial Networks (GANs) rely on the analysis and decomposition of pre-trained weight matrix. However, they only consider the weight matrix of the fully connected layers, ignoring the convolutional layers which are indispensable for image processing in modern generative models. This results in the learned latent sem...Show More
Deep learning(DL) based on image classification, segmentation, detection methods have been providing state-of-the-art performance in recent years. Particularly, these techniques have been successfully applied to medical image to Help doctors have a more convenient and clear understanding of the patient's physical condition, and among them, one deep learning technique, U-Net model has become one of...Show More
Parallel end-to-end driving aims to improve the performance of end-to-end driving models using both simulated- and real-world data. However, how to efficiently utilize the data from both the simulated world and the real world remains a difficult issue, since these data are usually not well aligned. In this article, we build a data set called the parallel end-to-end driving data set (PED) for paral...Show More
The main challenge in video salient object detection is how to model object motion and dramatic changes in appearance contrast. In this work, we propose an attention embedded spatio-temporal network (ASTN) to adaptively exploit diverse factors that influence dynamic saliency prediction within a unified framework. To compensate for object movement, we introduce a flow-guided spatial learning (FGSL)...Show More
Crowd counting, aiming at estimating the total number of people in unconstrained crowded scenes, has increasingly received attention. But it is greatly challenged by the huge variation in people scale. In this paper, we propose a novel Multi-View Scale Aggregation Network (MVSAN), which handle the scale variation from feature, input and criterion view comprehensively. Firstly, we design a simple b...Show More
Taxi demand prediction has recently attracted increasing research interest due to its huge potential application in large-scale intelligent transportation systems. However, most of the previous methods only considered the taxi demand prediction in origin regions, while ignoring the modeling of the specific situation of the destination passengers. In this paper, we present a more challenging task, ...Show More
Current methods for image captioning tend to generate sentences that are generally overly rigid and composed of some most frequent words/phrases, leading to inaccurate and indistinguishable descriptions. This is primarily due to the uneven word distribution of the ground truth captions that encourages to generate high frequent words/phrases while suppressing the less frequent but more concrete one...Show More
Imitation learning for the end-to-end autonomous driving has drawn renewed attention from academic communities. Current methods either only use images as the input, which will yield ambiguities when a vehicle approaches an intersection, or use additional command information to navigate the vehicle but inefficiently. Focusing on making the vehicle automatically drive along the given path, we propos...Show More
Taxi demand prediction has recently attracted increasing research interest due to its huge potential application in large-scale intelligent transportation systems. However, most of the previous methods only considered the taxi demand prediction in origin regions, but neglected the modeling of the specific situation of the destination passengers. We believe it is suboptimal to preallocate the taxi ...Show More
Facial landmark localization plays a critical role in face recognition and analysis. In this paper, we propose a novel cascaded backbone-branches fully convolutional neural network (BB-FCN) for rapidly and accurately localizing facial landmarks in unconstrained and cluttered settings. Our proposed BB-FCN generates facial landmark response maps directly from raw images without any preprocessing. BB...Show More
Person re-identification (re-id) which resolves to recognize a person from the non-overlapped cameras has received increasing research. In this paper, we addressed a new problem of person re-id, i.e., image-to-video (ImtoV) person re-id, in which the probe is an image and the gallery consists of videos from nonoverlapping cameras with different views of probe image as shown in Fig. 1. It is differ...Show More
This article investigates a data-driven approach for semantic scene understanding, without pixelwise annotation or classifier training. The proposed framework parses a target image in two steps: first, retrieving its exemplars (that is, references) from an image database, where all images are unsegmented but annotated with tags; second, recovering its pixel labels by propagating semantics from the...Show More
This article studies a data-driven approach for semantically scene understanding, without pixelwise annotation and classifier pre-training. Our framework parses a target image with two steps: (i) retrieving its exemplars (i.e. references) from an image database, where all images are unsegmented but annotated with tags; (ii) recovering its pixel labels by propagating semantics from the references. ...Show More
In this paper, we propose a patch-based object tracking algorithm which provides both good enough robustness and computational efficiency. Our algorithm learns and maintains Composite Patch-based Templates (CPT) of the tracking target. Each composite template employs HOG, CS-LBP, and color histogram to represent the local statistics of edges, texture and flatness. The CPT model is initially establ...Show More
Building ERP system with reusable components brings many advantages. In this paper we discuss how to use component library to support development process of ERP system. First the necessity of researching component library-based software development (CLBSD) is introduced. Then, development methodology and development process based on component library are analyzed at a macro level. After that, we i...Show More
On-demand broadcast is an efficient wireless data dissemination method for mobile services.As the rapid growth of the mobile application,there is an increasing need for system to support large data dissemination.This paper investigates wireless on-demand broadcast algorithms.We propose a new scheduling algorithm called FMT(Fluent Media Transfer) which takes the flux of program,the program size,req...Show More