VGG16 class probabilities and audio forced alignments for the Flickr8k dataset

Generated by Mark Hasegawa-Johnson, 3/22/2017, using Davi Frossard's code and data by Hodosh, Young and Hockenmaier.



The following code chooses an image file at random, shows you the top five ImageNet classes associated with that image by the VGG16 network, and then list the five text transcriptions that were provided by Turkers in the Flickr8k dataset.
$ python
>>> import showprobs
>>> showprobs.showrandom()

If you want to see all 1000 of the class probabilities for your randomly selected image, do this:
>>> showprobs.showrandom(1000)