Posts

I used Captum to interpre the output of a MobileNetV2, which visualized the main regions in the input image that drove the model to …

I converted a PyTorch model to Core ML and ran it on an iPhone.

I combined my previous posts on image captioning and visual question answering and extended them to a wider topic - connecting computer …

Recently I was working with PyTorch multi-GPU training and I came across a nightmare GPU memory problem. After some expensive trial and …

I will start from the problem of semantic segmentation, introduce how to use CNNs to solve it, and talk about fully convolutional …

I read some papers on VQA and summarized its state-of-the-art, bottlenecks and possible solutions.

I have been fascinated by image captioning for some time but still have not played with it. I gave it a try today using the open source …