Preview - rendered architecture diagrams; full validated atlas + dataset coming.
Back to models

vilt:base-vision-language-fusion

multimodal vilt base validation pending

Architecture diagram

Rendered with TorchLens / Graphviz

Open SVG
vilt:base-vision-language-fusion architecture diagram