Preview - rendered architecture diagrams; full validated atlas + dataset coming.
Back to models

blip-2:vision-to-text

multimodal blip 2 validation pending

Architecture diagram

Rendered with TorchLens / Graphviz

Open SVG
blip-2:vision-to-text architecture diagram