Search papers, labs, and topics across Lattice.
1
0
3
2
Current LLM watermarks are surprisingly easy to spoof: a small 4B model, trained on just 100 examples using reinforcement learning, can fool them over 60% of the time.