FRESH Hacker News
Home
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment
21 points by gmays