Abstract: Image-text matching is a fundamental task in bridging the semantics between vision and language. The key challenge lies in establishing accurate alignment between two heterogeneous ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results