show_and_tell.tensorflow

  • Owner: jazzsaxmafia/show_and_tell.tensorflow
  • Platform:
  • License:: BSD 2-Clause "Simplified" License
  • Category::
  • Topic:
  • Like:
    0
      Compare:

Github stars Tracking Chart

Neural Caption Generator

  • Tensorflow implementation of "Show and Tell" http://arxiv.org/abs/1411.4555
  • Borrowed some code and ideas from Andrej Karpathy's NeuralTalk.
  • You need flickr30k data (images and annotations)

Code

  • make_flickr_dataset.py : Extracting feats of flickr30k images, and save them in './data/feats.npy'
  • model.py : TensorFlow Version

Usage

  • Flickr30k Dataset Download
  • Extract VGG Featues of Flicker30k images (make_flickr_dataset.py)
  • Train: run train() in model.py
  • Test: run test() or test_tf() in model.py
  • parameters: VGG FC7 feature of test image, trained model path
  • Once you download Tensorflow VGG Net (one of the links below), you don't need Caffe when testing.

Downloading data/trained model

  • Extraced FC7 data: download
  • This is used in train() function in model.py. You can skip feature extraction part by using this.
  • Pretrained model download
  • This is used in test() and test_tf() in model.py. If you do not have time for training, or if you just want to check out captioning, download and test the model.
  • Tensorflow VGG net download
  • This file is used in test_tf() in model.py
  • Along with the files above, you might want to download flickr30k annotation data from link

alt tag

License

  • BSD license

Main metrics

Overview
Name With Ownerjazzsaxmafia/show_and_tell.tensorflow
Primary LanguageJupyter Notebook
Program languagePython (Language Count: 2)
Platform
License:BSD 2-Clause "Simplified" License
所有者活动
Created At2015-11-11 15:42:10
Pushed At2016-10-24 12:56:20
Last Commit At2016-03-01 19:50:08
Release Count0
用户参与
Stargazers Count290
Watchers Count22
Fork Count92
Commits Count45
Has Issues Enabled
Issues Count26
Issue Open Count23
Pull Requests Count0
Pull Requests Open Count1
Pull Requests Close Count0
项目设置
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private