Computer-Vision

Connecting Computer Vision and Natural Language

27 November 2018·Updated: 31 January 2025·8 mins

I combined my previous posts on image captioning and visual question answering and extended them to a wider topic - connecting computer vision and natural language.

Understanding Fully Convolutional Networks

15 September 2018·Updated: 31 January 2025·31 mins

Deep-Learning Computer-Vision

I will start from the problem of semantic segmentation, introduce how to use CNNs to solve it, and talk about fully convolutional networks, a widely used framework for semantic segmentation, in great details. Moreover, I will analyze the MXNet implementation of FCNs.

A Dive Into Visual Question Answering

27 August 2018·Updated: 31 January 2025·6 mins

Deep-Learning Computer-Vision NLP

I read some papers on VQA and summarized its state-of-the-art, bottlenecks and possible solutions.

Playing With Image Captioning

8 August 2018·Updated: 31 January 2025·5 mins

Deep-Learning Computer-Vision NLP

I played with image captioning using neuraltalk2 written by Andrej Karpathy.