thrux

AI Trading Research

Machine learning meets markets - algorithms, strategies, and real research

AI Trading Reality Check

Renaissance Technologies (Medallion Fund): 66% annual returns over 30 years
Retail algorithmic traders profitable: ~5-10%
HFT firms' market share: 50-70% of US equity volume
Typical ML model decay: 3-6 months

Foundational Knowledge

Essential Book

Advances in Financial Machine Learning

Marcos López de Prado

The bible of ML trading. Covers feature engineering, backtesting pitfalls, and production systems. Written by AQR/Guggenheim quant.

Book Info → Code Examples →
Academic Course

Machine Learning for Trading

Georgia Tech (Coursera)

Complete course covering ML algorithms, market mechanics, and portfolio optimization. Includes Python implementations.

Take Course →
Research Hub

SSRN Quantitative Finance

Academic papers on algorithmic trading, market microstructure, and ML applications. Where quants publish.

Browse Papers →
Classic Text

Algorithmic Trading

Ernest Chan

Practical guide from ex-Morgan Stanley quant. Covers strategy development, backtesting, and risk management.

Author's Blog →

Machine Learning Strategies

Deep Learning

Deep Learning for Finance

Yves Hilpisch

Neural networks, RNNs, and transformer models for trading. Python code with TensorFlow/PyTorch examples.

Code Repository → Research Paper →
Reinforcement Learning

FinRL: Deep RL for Trading

Open-source library for developing RL trading agents. Includes DQN, A2C, PPO implementations.

GitHub → Paper →
Time Series

Prophet & Neural Prophet

Facebook/Meta

Advanced time series forecasting with seasonality. Used by quant funds for price prediction.

Prophet Docs → Neural Prophet →
Feature Engineering

TA-Lib & Technical Indicators

200+ technical indicators for feature engineering. Essential for building ML trading features.

Python TA-Lib → Modern Alternative →

Backtesting & Infrastructure

Backtesting Framework

Backtrader

Python backtesting library with live trading support. Handles complex strategies and portfolio management.

Documentation → GitHub →
Professional Platform

QuantConnect

Cloud-based algorithmic trading platform. Free tier with historical data. Deploy strategies to live trading.

Platform → LEAN Engine →
Risk Analytics

PyFolio

Quantopian

Portfolio and risk analytics. Tear sheets, drawdown analysis, and performance attribution.

GitHub →
Data Pipeline

Arctic - Tick Database

High-performance datastore for time-series data. Used by Man AHL for tick data storage.

Arctic GitHub →

Cryptocurrency & DeFi Trading

Crypto ML

Freqtrade

Open-source crypto trading bot with ML support. Backtesting, hyperopt, and live trading on 100+ exchanges.

Documentation → GitHub →
On-Chain Analytics

Nansen & Glassnode

On-chain data for ML features. Wallet tracking, smart money flows, and network metrics.

Nansen → Glassnode →
DeFi Arbitrage

Flashbots & MEV

Maximum Extractable Value research. Arbitrage, liquidations, and sandwich attacks in DeFi.

Flashbots Docs → MEV Inspect →
Sentiment Analysis

Crypto Social Sentiment

NLP on crypto Twitter, Reddit, and Telegram. Real-time sentiment indicators for trading signals.

Example Code →

High-Frequency & Market Making

HFT Education

High-Frequency Trading

Irene Aldridge

Comprehensive guide to HFT strategies, technology, and regulations. Industry standard reference.

Book Info →
Market Making

Hummingbot

Open-source market making bot. Connects to centralized and decentralized exchanges.

Platform → GitHub →
Low Latency

C++ for Quantitative Finance

Building ultra-low latency trading systems. Memory management, lock-free programming, and FPGA.

C++ Jupyter →

Research Papers & Cutting Edge

Transformer Models

Temporal Fusion Transformers

Google Research

State-of-the-art time series forecasting. Beats traditional methods on financial data.

Paper → Code →
Graph Neural Networks

GNNs for Stock Prediction

Using graph structures to model market relationships. Captures sector correlations and supply chains.

Research →
Alternative Data

Satellite Data Trading

Using satellite imagery for commodity trading and economic nowcasting. Parking lots, ship tracking, crop yields.

Provider List →

Communities & Resources

Reddit Communities

Algo Trading Communities

Active communities for strategy discussion and code sharing.

r/algotrading → r/quant →
YouTube Channels

Educational Content

Video tutorials on algorithmic trading and quantitative finance.

Sentdex → QuantPy →
Data Sources

Free Financial Data

APIs for historical and real-time market data.

Alpha Vantage → Polygon.io →

Getting Started Path

1. Learn Python: NumPy, Pandas, Scikit-learn basics

2. Understand Markets: Order types, bid/ask, market microstructure

3. Simple Strategies: Moving average crossovers, mean reversion

4. Backtest Everything: Paper trade for 6 months minimum

5. Start Small: $100-500 to test infrastructure

6. Continuous Learning: Markets evolve, strategies decay

# Example: Simple ML trading strategy skeleton import pandas as pd from sklearn.ensemble import RandomForestClassifier def prepare_features(df): # Add technical indicators df['SMA_20'] = df['close'].rolling(20).mean() df['RSI'] = calculate_rsi(df['close']) return df def train_model(X_train, y_train): model = RandomForestClassifier(n_estimators=100) model.fit(X_train, y_train) return model # Remember: This is just the beginning...