Back to models
MDT (Masked Diffusion Transformer: adaLN-Zero DiT blocks + side-interpolater for masked latent tokens)
history mdt masked validation pending
Architecture diagram
Rendered with TorchLens / Graphviz
Rendered with TorchLens / Graphviz