Search papers, labs, and topics across Lattice.
This paper introduces BlindTrade, an anonymization-first framework to validate the legitimacy of LLM trading agent signals by removing ticker and company name identifiers. The framework uses LLMs to generate trading scores and reasoning, constructs a GNN from the reasoning embeddings, and trades using a PPO-DSR policy. Experiments on 2025 YTD data show a Sharpe ratio of 1.40 +/- 0.22, suggesting the LLM trading agents can identify meaningful market dynamics beyond memorization, although performance is market-regime dependent.
LLM-powered trading agents can still achieve a Sharpe ratio of 1.40 even when completely blindfolded to ticker symbols and company names, suggesting genuine understanding of market dynamics.
For LLM trading agents to be genuinely trustworthy, they must demonstrate understanding of market dynamics rather than exploitation of memorized ticker associations. Building responsible multi-agent systems demands rigorous signal validation: proving that predictions reflect legitimate patterns, not pre-trained recall. We address two sources of spurious performance: memorization bias from ticker-specific pre-training, and survivorship bias from flawed backtesting. Our approach is to blindfold the agents--anonymizing all identifiers--and verify whether meaningful signals persist. BlindTrade anonymizes tickers and company names, and four LLM agents output scores along with reasoning. We construct a GNN graph from reasoning embeddings and trade using PPO-DSR policy. On 2025 YTD (through 2025-08-01), we achieved Sharpe 1.40 +/- 0.22 across 20 seeds and validated signal legitimacy through negative control experiments. To assess robustness beyond a single OOS window, we additionally evaluate an extended period (2024--2025), revealing market-regime dependency: the policy excels in volatile conditions but shows reduced alpha in trending bull markets.