Leopard: A Vision Language Model for Text-Rich Multi-Image Tasks

5 points | by PaulHoule 2 days ago

No comments yet.