Search papers, labs, and topics across Lattice.
Text understanding, generation, summarization, translation, information extraction, and linguistic analysis.
#21 of 24
2
Transformers can be explicitly designed to perform nonlinear regression in-context by leveraging attention as a featurizer, offering a theoretical understanding of how these models learn complex relationships from prompts.
Synthetic data augmentation and per-language threshold tuning can significantly boost the performance of LLMs on multilingual tasks, outperforming alternative architectures that showed promise on the development set.
AI co-mentorship lets high schoolers build real-world financial models, skipping the classroom grind and diving straight into problem-solving.
Hallucination detection can be reframed as a dynamical systems problem, enabling a surprisingly effective and efficient black-box approach that avoids expensive sampling or external knowledge retrieval.
Anomaly detection in EHR data can pinpoint potentially erroneous clinical decisions with surprisingly low false alarm rates, suggesting a practical pathway to improve patient safety.
GMD algorithms, previously seen as a novel generative framework, can be understood as directly targeting fixed points of Wasserstein Gradient Flows, offering a new perspective on their optimization process.
Modeling 10,000+ correlated outputs is now tractable: T-LVMOGP offers a scalable alternative to restrictive low-rank MOGPs by learning a flexible deep kernel in a shared embedding space.
LLMs can now impute missing healthcare data well enough to improve causal treatment effect estimation from real-world EHRs, even with 80% missingness.
Forget rigid memory structures: Memini lets your LLM's external knowledge evolve organically, learning and forgetting like a brain.
Discovering spatial regions and their temporal signatures in massive time series data just got much faster and easier, thanks to a new method that scales log-linearly with the number of time series.
Training data order matters more than you think: reordering your data can significantly improve unsupervised domain adaptation by reducing variance in domain discrepancy estimates.
Forget fine-tuning: "skill neologisms"—new soft tokens—let you inject skills into LLMs without weight updates, composing them zero-shot for flexible knowledge expansion.
Steering LLMs with conceptors—soft projection matrices capturing the full semantic subspace—yields more robust control and enables Boolean logic for composing concepts, moving beyond the limitations of single-vector steering.
Conformal prediction for graph time series doesn't have to break down: by conditioning on low-frequency trends, you can restore exchangeability and get valid uncertainty estimates.
Forget retraining: this model learns interpretable logical rules from data in a zero-shot manner by encoding literals with domain-agnostic statistical properties.
Tabular data synthesis no longer needs to sacrifice privacy for quality: pretraining on diverse datasets lets models generalize from limited context, breaking the traditional tradeoff.
Symmetric spectral analysis of attention is fundamentally blind to information flow direction, but a simple asymmetry coefficient can restore the signal.
Standard multimodal fusion can hurt performance in emotion recognition, but this new approach knows when to drop modalities, leading to state-of-the-art results.
GNN uncertainty just got a whole lot easier: QpiGNN delivers better coverage and tighter intervals without the quantile gymnastics.
Overcome limitations in capturing complex user-service dependencies with a novel tensor decomposition method that significantly boosts QoS prediction accuracy.
LLMs can construct interpretable, multi-layered models of individual student cognition from journal entries, opening new possibilities for personalized education.
Forget opaque transformers: Gyan offers SOTA language modeling with full interpretability, lower compute, and human-like compositional understanding.
Incentivizing honest participation in federated learning is now possible without ground truth labels, even when some participants are trying to game the system.
Carbery's conjectured improvement to the triangle inequality in Lp spaces is false for p > 2, but a weaker version holds true with a sharp exponent.
Hallucination detection can be nearly as effective with a single forward pass as with expensive multi-sample methods.
Forget relying on LLMs to judge themselves: this "Concept Field" approach uses vector math on text corpora to detect hallucinations and novelty, offering a fast, interpretable, and black-box alternative.
Think-Aloud data doesn't just improve cognitive model fit; it fundamentally reshapes the discovered model structure, revealing cognitive mechanisms undetectable from behavior alone.
Interventions on LLMs, like knowledge editing or unlearning, can have surprising side effects that this automated pipeline can now surface and validate.
Shuffling activations, a popular defense in secure Transformer inference, crumbles under a new alignment attack that recovers model weights for just $1.
Forget expert intuition – language trends in patent filings can foresee technological breakthroughs years before they happen.
L2 learners' struggles with idioms, captured in a new eye-tracking dataset, offer a cognitively-grounded benchmark for evaluating how well LLMs truly "understand" figurative language.
Current reward models are surprisingly bad at judging story quality, achieving only 66% accuracy in selecting human-preferred narratives – a gap closed by a new, purpose-built reward model.
Teachers can now scalably provide high-quality, personalized feedback to students by leveraging a multi-LLM system that synthesizes rubric data and qualitative observations, while retaining control through a teacher-in-the-loop workflow.
Forget stilted, unconvincing VR characters: EBM-RL's novel reward decomposition finally makes video-grounded role-playing dialogue feel immersive.
Automating rubric-based feedback on presentation slides is now feasible and perceived as useful, thanks to LLMs and learning analytics dashboards.
Identity-preserving video generation just got a whole lot more faithful: FaithfulFaces maintains identity even under extreme pose variations and occlusions, a feat previous methods struggled with.
LLM uncertainty can be efficiently estimated *without* sampling by measuring the stability of output distributions under semantically equivalent input perturbations.
AI-powered learning systems often fail adult learners because they're built for kids: here are 19 guidelines to fix that.
Unlock scalable, high-quality singing voice synthesis by directly generating structured musical scores from audio, outperforming existing systems on multiple datasets.
HeterSEED achieves state-of-the-art performance on heterophilic heterogeneous graphs by decoupling semantic and structural information, offering a more robust approach than relying on feature similarity alone.
Ditching diffusion's noise-denoising, RLFSeg uses Rectified Flow to directly predict segmentation masks from text prompts, unlocking zero-shot performance gains.
LLMs can get up to 6x more logically consistent without human feedback, simply by fusing NLI scores into the DPO training loop.
A judge-orchestrated ensemble of diverse LLMs trounces single models in multi-turn response generation, proving that strategic model selection beats brute force scaling.
LMs encode grammaticality as a distinct feature in their hidden representations, separable from raw string probability and generalizable across languages.
LLMs ace MRI multiple-choice tests, but can't actually recall basic facts about GE scanners, revealing a dangerous gap between perceived and actual competence.
Overconfident predictions plague mental health prediction models, but this new framework leverages evidential learning to provide more trustworthy uncertainty estimates and human-understandable reasoning signals.
LLMs differ most not in personality, but in how they represent themselves as having (or not having) rich internal experience.
Attention heads hold the key to detecting LLM hallucinations, offering a lightweight, white-box alternative to expensive sampling or external models.
TabEmbed leapfrogs existing text embedding models to achieve SOTA performance on tabular data by reformulating tasks as semantic matching problems and using contrastive learning.
Forget full fine-tuning: QLoRA on 7B models can match the perplexity of fully fine-tuned smaller models for low-resource languages, while slashing the parameter count by 40x.