Search papers, labs, and topics across Lattice.
Department of Computer Science, Rensselaer Polytechnic Institute (RPI
2
0
4
Language diffusion models aren't just generative, they're associative memories that reveal a sharp memorization-to-generalization transition detectable via conditional entropy.
Decoder-based Sense Knowledge Distillation (DSKD) lets generative models learn structured semantics without the inference-time overhead of dictionary lookups.