Search papers, labs, and topics across Lattice.
2
0
5
1
Recovering type information from untyped GPU register files is the key to enabling effective binary analysis, unlocking reverse engineering and security analysis of proprietary GPU code.
Asynchronous GPU features like NVIDIA's TMA can unlock up to 6x speedups in sparse matrix multiplication, but only with careful kernel co-design.