MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment

Published in: ArXiv preprint