End-to-end Generative Pre-training for Multimodal Video Captioning

Posted by Paul Hongsuck Seo and Arsha Nagrani, Research Scientists, Google Research, Perception Team
You must Subscribe to read our archived content. Already subscribed? log in here.

Leave a Reply

Your email address will not be published.

Shared by: Google AI Technology

Tags: ,