Search papers, labs, and topics across Lattice.
Kensho Technologies, MIT Cambridge
1
0
2
Subword tokenization just got a whole lot more efficient: ToaST slashes token counts by 11% and boosts language model performance by up to 7.6% compared to standard methods.