RNN captioning

The visual features can be used with sequence learning to form the output.

This is an architecture for generating captions.

For details, refer: Donahue et al., in the paper https://arxiv.org/pdf/1411.4389.pdf, proposed Long-term recurrent convolutional architectures (LRCN) for the task of image captioning.

Get TensorFlow Deep Learning Projects now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.