Objects are the secret key to link between language and vision

Objects are the secret key to link between language and vision

Oscar: Objects are the secret key to link between language and vision

In vision-and-language pretraining (VLP), objects can be used as anchor points to make aligning semantics between image-text pairs easier. Learn how Oscar, a novel VLP framework utilizing objects, sets new state of the art on six vision-and-language tasks.

Source: www.microsoft.com/en-us/research/blog/objects-are-the-secret-key-to-revealing-the-world-between-vision-and-language/

Subscribe to our Digest