Strategy Hallucination
The model emits plausible trading logic that looks quantitative but has no market rationale, no statistical grounding, or invalid assumptions.
It creates code that passes syntax checks while smuggling in false causality.Perspectiva estructurada
Una comparación estructurada de cuatro generaciones de software de trading: plataformas de la era de indicadores, frameworks de estrategias, pipelines de sistemas y flujos de trabajo LLM de prompt a código.
El cambio clave no es solo el lenguaje o la experiencia de usuario. Cada generación cambió la abstracción central que los traders usaban para expresar, probar y operacionalizar ideas de estrategias.
| Dimensión | Gen 1: Indicator Era~2005-2012 | Gen 2: Strategy Era~2012-2018 | Gen 3: System Era~2018-2023 | Gen 4: LLM Era2023-present |
|---|---|---|---|---|
| Time period | ~2005-2012 | ~2012-2018 | ~2018-2023 | 2023-present |
| Representative products | MT4 / MQL4 | Backtrader, freqtrade, vnpy | QuantConnect, WorldQuant BRAIN | ChatGPT + brokerage API |
| Core abstraction | Modular indicators | Packaged strategy logic: entry, exit, sizing | Feature engineering pipeline | Natural language to code |
| Typical workflow | Drag indicators, set conditions, backtest | Write strategy code, optimize parameters, backtest | Build feature pipeline, train, validate, execute | Prompt, generate strategy, backtest |
| Main progress | Indicators moved from books and forums into reusable components | Complete strategies became packageable, shareable, and reproducible | The pipeline became the product, with standardized validation | Generation is fast and the entry barrier is low |
| Typical trap | Indicator worship and holy-grail thinking | Parameter overfitting: genetic search finds coincidence, not robustness | Data leakage and survivorship bias | Strategy hallucination and backtest overfitting blindness |
| Failure root cause | Rules are manual and lack systematic validation | The optimization target is backtest return, not robustness | Model complexity hides data-quality problems | The model skips infrastructure accumulated across the first three generations |
Los LLM reducen el costo de producir código de estrategias, pero no eliminan la necesidad de infraestructura de validación.
The model emits plausible trading logic that looks quantitative but has no market rationale, no statistical grounding, or invalid assumptions.
It creates code that passes syntax checks while smuggling in false causality.The model treats a profitable backtest as validation and misses leakage, parameter mining, unstable regimes, or survivorship bias.
It accelerates curve fitting because generation speed multiplies untested variants.spread = asset_a.close - hedge_ratio * asset_b.close
z_score = (spread - spread.mean()) / spread.std()
if z_score > 2:
short(asset_a)
long(asset_b)
elif z_score < -2:
long(asset_a)
short(asset_b)Modelo + Herramientas
Los LLM son útiles cuando están envueltos en herramientas. Un stack de trading serio incluye validación de datos, motores de precios, lógica de ejecución, controles de riesgo, monitoreo y disciplina de despliegue.
Los sistemas al estilo Jane Street muestran el patrón: el modelo es una capa dentro de una cadena de herramientas más grande, no el producto completo.
Posicionamiento de StratCraft
StratCraft no es una afirmación Gen 5. Ofrece a los desarrolladores de frameworks de estrategias y pipelines de sistemas rendimiento local de backtesting de nivel C++, aislamiento de plugins y flujos de trabajo de validación repetibles.
Usa el motor de backtest local y el ecosistema de plugins para pasar de ideas generadas a sistemas validados.