Strategy Hallucination
The model emits plausible trading logic that looks quantitative but has no market rationale, no statistical grounding, or invalid assumptions.
It creates code that passes syntax checks while smuggling in false causality.Analyse structuree
Une comparaison structuree de quatre generations de logiciels de trading : plateformes a indicateurs, frameworks de strategies, pipelines systeme et workflows LLM prompt-vers-code.
Le changement cle n'est pas seulement le langage ou l'UX. Chaque generation a modifie l'abstraction fondamentale que les traders utilisaient pour exprimer, tester et operationnaliser leurs idees de strategies.
| Dimension | Gen 1: Indicator Era~2005-2012 | Gen 2: Strategy Era~2012-2018 | Gen 3: System Era~2018-2023 | Gen 4: LLM Era2023-present |
|---|---|---|---|---|
| Time period | ~2005-2012 | ~2012-2018 | ~2018-2023 | 2023-present |
| Representative products | MT4 / MQL4 | Backtrader, freqtrade, vnpy | QuantConnect, WorldQuant BRAIN | ChatGPT + brokerage API |
| Core abstraction | Modular indicators | Packaged strategy logic: entry, exit, sizing | Feature engineering pipeline | Natural language to code |
| Typical workflow | Drag indicators, set conditions, backtest | Write strategy code, optimize parameters, backtest | Build feature pipeline, train, validate, execute | Prompt, generate strategy, backtest |
| Main progress | Indicators moved from books and forums into reusable components | Complete strategies became packageable, shareable, and reproducible | The pipeline became the product, with standardized validation | Generation is fast and the entry barrier is low |
| Typical trap | Indicator worship and holy-grail thinking | Parameter overfitting: genetic search finds coincidence, not robustness | Data leakage and survivorship bias | Strategy hallucination and backtest overfitting blindness |
| Failure root cause | Rules are manual and lack systematic validation | The optimization target is backtest return, not robustness | Model complexity hides data-quality problems | The model skips infrastructure accumulated across the first three generations |
Les LLM reduisent le cout de production du code de strategie, mais ne suppriment pas le besoin d'une infrastructure de validation.
The model emits plausible trading logic that looks quantitative but has no market rationale, no statistical grounding, or invalid assumptions.
It creates code that passes syntax checks while smuggling in false causality.The model treats a profitable backtest as validation and misses leakage, parameter mining, unstable regimes, or survivorship bias.
It accelerates curve fitting because generation speed multiplies untested variants.spread = asset_a.close - hedge_ratio * asset_b.close
z_score = (spread - spread.mean()) / spread.std()
if z_score > 2:
short(asset_a)
long(asset_b)
elif z_score < -2:
long(asset_a)
short(asset_b)Modele + Outils
Les LLM sont utiles lorsqu'ils sont encapsules dans des outils. Un stack de trading serieux comprend la validation des donnees, les moteurs de tarification, la logique d'execution, les controles de risque, la surveillance et la discipline de deploiement.
Les systemes de type Jane Street montrent le schema : le modele est une couche a l'interieur d'une chaine d'outils plus large, pas le produit entier.
Positionnement StratCraft
StratCraft ne pretend pas etre Gen 5. Il offre aux developpeurs de frameworks de strategies et de pipelines systeme des performances de backtesting locales de niveau C++, l'isolation des plugins et des workflows de validation reproductibles.
Utilisez le moteur de backtest local et l'ecosysteme de plugins pour passer d'idees generees a des systemes valides.