Objects are the secret key to link between language and vision
Oscar: Objects are the secret key to link between language and vision
In vision-and-language pretraining (VLP), objects can be used as anchor points to make aligning semantics between image-text pairs easier. Learn how Oscar, a novel VLP framework utilizing objects, sets new state of the art on six vision-and-language tasks.
May 22, 2020
Subscribe
Login
Please login to comment
0 Comments