Deep-Learning

Interpret PyTorch Models with Captum

29 February 2020·5 mins

Deep-Learning Pytorch

I used Captum to interpret the output of a MobileNetV2, which visualized the main regions in the input image that drove the model to generate its output.

From PyTorch to Core ML

16 October 2019·Updated: 31 January 2025·4 mins

Deep-Learning Pytorch Ios

I converted a PyTorch model to Core ML and ran it on an iPhone.

Connecting Computer Vision and Natural Language

27 November 2018·Updated: 31 January 2025·8 mins

Deep-Learning Computer-Vision NLP

I combined my previous posts on image captioning and visual question answering and extended them to a wider topic - connecting computer vision and natural language.

Killing PyTorch Multi-GPU Training the Safe Way

2 November 2018·Updated: 31 January 2025·14 mins

Deep-Learning Pytorch

Recently I was working with PyTorch multi-GPU training and I came across a nightmare GPU memory problem. After some expensive trial and error, I finally found a solution for it.

Understanding Fully Convolutional Networks

15 September 2018·Updated: 31 January 2025·31 mins

Deep-Learning Computer-Vision

I will start from the problem of semantic segmentation, introduce how to use CNNs to solve it, and talk about fully convolutional networks, a widely used framework for semantic segmentation, in great details. Moreover, I will analyze the MXNet implementation of FCNs.

A Dive Into Visual Question Answering

27 August 2018·Updated: 31 January 2025·6 mins

Deep-Learning Computer-Vision NLP

I read some papers on VQA and summarized its state-of-the-art, bottlenecks and possible solutions.

Playing With Image Captioning

8 August 2018·Updated: 31 January 2025·5 mins

Deep-Learning Computer-Vision NLP

I played with image captioning using neuraltalk2 written by Andrej Karpathy.