fintech | AI, Tech & Life

It works to copy custom HTML into post.

Posted on October 26, 2022 by sheng gao Leave a comment

The post is to test whether custom HTML can work in wordpress. Unfortunately, only table display and figure cannot. Share strategy backtest report and predicted signal. Not so correct, but ok as reference.

TQQQ	predict open	predict close	predict high	predict low	Signal
2022-10-25	21.36	22.08	22.07	21.06	buy
2022-10-26	22.46	22.55	23.14	22.06	buy

Key Performance Metrics

Metric	Strategy
Risk-Free Rate	0.0%
Time in Market	74.0%

Cumulative Return	48.62%
CAGR﹪	17.1%

Sharpe	0.98
Prob. Sharpe Ratio	93.65%
Smart Sharpe	0.96
Sortino	1.37
Smart Sortino	1.35
Sortino/√2	0.97
Smart Sortino/√2	0.95
Omega	1.22

Max Drawdown	-13.71%
Longest DD Days	209
Volatility (ann.)	17.71%
Calmar	1.25
Skew	-0.46
Kurtosis	4.98

Expected Daily	0.06%
Expected Monthly	1.29%
Expected Yearly	14.12%
Kelly Criterion	10.26%
Risk of Ruin	0.0%
Daily Value-at-Risk	-1.77%
Expected Shortfall (cVaR)	-1.77%

Max Consecutive Wins	7
Max Consecutive Losses	5
Gain/Pain Ratio	0.22
Gain/Pain (1M)	1.28

Payoff Ratio	0.93
Profit Factor	1.22
Common Sense Ratio	1.26
CPC Index	0.64
Tail Ratio	1.03
Outlier Win Ratio	4.78
Outlier Loss Ratio	3.66

MTD	-1.04%
3M	-1.24%
6M	-5.57%
YTD	-0.92%
1Y	6.45%
3Y (ann.)	17.1%
5Y (ann.)	17.1%
10Y (ann.)	17.1%
All-time (ann.)	17.1%

Best Day	6.16%
Worst Day	-5.28%
Best Month	13.99%
Worst Month	-7.78%
Best Year	24.58%
Worst Year	-0.92%

Avg. Drawdown	-2.9%
Avg. Drawdown Days	24
Recovery Factor	3.55
Ulcer Index	0.06
Serenity Index	0.85

Avg. Up Month	4.33%
Avg. Down Month	-2.68%
Win Days	56.84%
Win Month	58.06%
Win Quarter	63.64%
Win Year	66.67%

EOY Returns

Year	Return	Cumulative
2020	23.01%	24.58%
2021	20.02%	20.41%
2022	0.56%	-0.92%

Worst 10 Drawdowns

Started	Recovered	Drawdown	Days
2020-09-03	2021-01-21	-13.71	140
2022-03-30	2022-10-25	-12.64	209
2021-12-28	2022-02-28	-10.51	62
2021-07-08	2021-10-21	-8.82	105
2021-04-22	2021-06-17	-7.73	56
2021-02-16	2021-04-01	-5.41	44
2021-12-09	2021-12-27	-4.87	18
2021-12-01	2021-12-07	-4.68	6
2021-01-27	2021-02-12	-4.64	16
2020-07-23	2020-08-03	-2.63	11

How to create long-short strategy using backtrader

Posted on October 2, 2022 by sheng gao Leave a comment

In the last post How to build machine learning model to generate trading signal, I share the general processing flow to apply machine learning based signal prediction to create a long strategy. Now I will share how to create a long-short strategy using backtrader to improve the trading performance. In the following, I will discuss four topics:

How to use personalized data format (CSV) in backtrader
How to add personalized signal line in plot
How to set a long-short strategy
Performance comparison between only-long and long-short stragtegy

How to use personalized CSV format

Backtrader provides API to read Yahoo-format OHLC CSV file. When new columns in CSV, you need write your own CSV reader. For example, in machine learning based trading, I predict buy/sell/hold signal and write OHLC together with predicted signal in CSV, like

Example of ML-based signal generation for trading (column predict is signal, 0=hold,1=buy,2=sell)

The CSV cannot read using Yahoo CSV reader in backtrader. In order to read the format, write a new reader based on base class, bt.feeds.PandasData, in backtrader.

class MLPredictCsv(bt.feeds.PandasData):
    '''
    Desc: for customized CSV format
    '''

    # What new data will be availible in the Stratagies line object
    lines = ('predict',)

    # Which columns go to which variable
    params = (
        ('open', 'Open'),
        ('high', 'High'),
        ('low', 'Low'),
        ('close', 'Close'),
        ('Adj Close','Adj Close'),
        ('volume', 'Volume'),
        ('openinterest', 'openinterest'),
        ('predict', 'signal'),
    )

In the above personalized CSV reader, each element in params is a mapping, first value being backtrader internal variable name which is actually used in backtrade test and second being the column name in CSV. Then you can read your CSV like, and use data as normal.

mlpredicted_signal = pd.read_csv(ml_predict_csv_file,
                                parse_dates=True,
                                index_col=0,                             
                            )
 data = MLPredictCsv(dataname=mlpredicted_signal)

How to personalize signal line

Because signal is predicted by third-party tool, the buy/sell/hold signal will not be automatically display in plot. In order to display personal lines, you need write another class based on bt.Indicator.

 class MLSignal(bt.Indicator):

    lines = ('predict',)

    def __init__(self):
        self.lines.predict = self.data0.predict

Then add it in __init__() of strategy class, like

class MLStrategy(bt.Strategy):
    params = dict(
        onlylong = False
    )
    
    def log(self, txt, dt=None):
        ''' Logging function fot this strategy'''
        dt = dt or self.datas[0].datetime.date(0)
        print('%s, %s' % (dt.isoformat(), txt))

    def __init__(self):
        # Keep a reference to the "close" line in the data[0] dataseries
        self.dataclose = self.datas[0].close
        self.dataopen = self.datas[0].open
        self.ml_signal = MLSignal(self.data)
        # To keep track of pending orders
        self.order = None

Then in plot, MLSignal predict line will be shown. Otherwise, it is not.

How to use long-short strategy

In the backtrader source code, there is an example file, LongShortStrategy.py, to show how to use long-short. In the following, I will share my complete code for long-short & long strategy test.

#*-* coding: utf-8 *-*
#!/usr/bin/env python3

'''
long-short strategy evaluation
'''
from __future__ import (absolute_import, division, print_function,
                        unicode_literals)

import datetime  # For datetime objects
import os.path  # To manage paths
import sys  # To find out the script name (in argv[0])
import backtrader as bt
import backtrader.indicators as btind
from backtrader.feeds import GenericCSVData

import itertools
import backtrader as bt
import pandas as pd
import quantstats
from backtrader.analyzers import (SQN, AnnualReturn, TimeReturn, SharpeRatio,
                                  TradeAnalyzer)

# onlylong = False

class MLSignal(bt.Indicator):

    lines = ('predict',)

    def __init__(self):
        self.lines.predict = self.data0.predict

class MLPredictCsv(bt.feeds.PandasData):
    '''
    Desc: for customized CSV format
    '''

    # What new data will be availible in the Stratagies line object
    lines = ('predict',)

    # Which columns go to which variable
    params = (
        ('open', 'Open'),
        ('high', 'High'),
        ('low', 'Low'),
        ('close', 'Close'),
        ('Adj Close','Adj Close'),
        ('volume', 'volume'),
        ('openinterest', 'openinterest'),
        ('predict', 'signal'),
    )

# Create a Stratey
class MLStrategy(bt.Strategy):
    params = dict(
        onlylong = False
    )
    
    def log(self, txt, dt=None):
        ''' Logging function fot this strategy'''
        dt = dt or self.datas[0].datetime.date(0)
        print('%s, %s' % (dt.isoformat(), txt))

    def __init__(self):
        # Keep a reference to the "close" line in the data[0] dataseries
        self.dataclose = self.datas[0].close
        self.dataopen = self.datas[0].open
        self.ml_signal = MLSignal(self.data)
        # To keep track of pending orders
        self.order = None

    def notify_order(self, order):
        if order.status in [order.Submitted, order.Accepted]:
            # Buy/Sell order submitted/accepted to/by broker - Nothing to do
            return

        # Check if an order has been completed
        # Attention: broker could reject order if not enough cash
        if order.status in [order.Completed]:
            if order.isbuy():
                self.log('BUY EXECUTED, %.2f' % order.executed.price)
            elif order.issell():
                self.log('SELL EXECUTED, %.2f' % order.executed.price)

            self.bar_executed = len(self)

        elif order.status in [order.Canceled, order.Margin, order.Rejected]:
            self.log('Order Canceled/Margin/Rejected')

        # Write down: no pending order
        self.order = None

    def next(self):
        #if order is active, no new order allow
        if self.order:
            return

        # Check if we are in the market
        if self.ml_signal.lines.predict > 0:
            if self.position:
                self.log('CLOSE SHORT , %.2f' % self.data.close[0])
                self.close()
            # Buy
            self.log('BUY CREATE, %.2f' % self.dataclose[0])
            self.order = self.buy()
        elif self.ml_signal.lines.predict < 0:
            if self.position:
                self.log('CLOSE LONG , %.2f' % self.data.close[0])
                self.close()
            
            if not self.p.onlylong:
                self.log('SELL CREATE , %.2f' % self.data.close[0])
                self.sell()

def strategyEvaluate(tick_symbol, ml_predict_csv, strategy_log_file, quant_output, quant_output_html, n_stake = 40, cash_capital = 1000, is_onlylong = False):
    '''
    Desc: ML predict signal based strategy evaluation

    '''
    mlpredicted_signal = pd.read_csv(ml_predict_csv,
                                parse_dates=True,
                                index_col=0,                             
                            )
    def mapsignal(x):
        if x == 0:
            return 0 #bt.SIGNAL_NONE
        elif x == 1:
            return 1 #bt.SIGNAL_LONG
        elif x == 2:
            return -1 #bt.SIGNAL_SHORT
    
    mlpredicted_signal['signal'] = mlpredicted_signal['signal'].transform(mapsignal)

    yr1, mth1, day1 = list(map(lambda x:int(x),str(mlpredicted_signal.index[0]).split(' ')[0].split('-')))
    yr2, mth2, day2 = list(map(lambda x:int(x),str(mlpredicted_signal.index[-1]).split(' ')[0].split('-')))
    
    mlpredicted_signal['openinterest'] = 0
    data = MLPredictCsv(dataname=mlpredicted_signal)

    # create Cerebro instance and attach data to it
    cerebro = bt.Cerebro()
    cerebro.adddata(data)
    # Add a strategy
    cerebro.addstrategy(MLStrategy, onlylong = is_onlylong)
    # Set our desired cash start
    cerebro.broker.setcash(cash_capital)

#     cerebro.broker.setcommission(commission=0)

    #Add strategy to Cerebro
    cerebro.addanalyzer(bt.analyzers.SharpeRatio, _name='sharpe_ratio')
    cerebro.addanalyzer(bt.analyzers.PyFolio, _name='PyFolio')
    cerebro.addanalyzer(TradeAnalyzer)

    # better net liquidation value view
    cerebro.addobserver(bt.observers.Value)

    # Default position size
    cerebro.addsizer(bt.sizers.SizerFix, stake=n_stake)    
    
    #add output log file 
    cerebro.addwriter(bt.WriterFile, csv=True, out=strategy_log_file)    


    # Print out the starting conditions
    print('Starting Portfolio Value: %.2f' % cerebro.broker.getvalue())

    # Run over everything
    results = cerebro.run()

    # Print out the final result
    print('Final Portfolio Value: %.2f' % cerebro.broker.getvalue())

    #Get strategy stats
    strat = results[0]
    portfolio_stats = strat.analyzers.getbyname('PyFolio')
    returns, positions, transactions, gross_lev = portfolio_stats.get_pf_items()
    returns.index = returns.index.tz_convert(None)
    
    # print(returns)
    # print(positions)
    # print(portfolio_stats)
    quantstats.reports.html(returns, output = quant_output, download_filename = quant_output_html, title = tick_symbol)
    
    import webbrowser
    webbrowser.open(quant_output_html)

    cerebro.plot(iplot=False)

if __name__ == '__main__':
    tick_symbol = 'TQQQ'
    signal_file = f'{tick_symbol}_predict_eval_test.csv'
    o_strategy_log_file = f'{tick_symbol}_predict_eval_strategy.log'
    o_quant_output_file = f'{tick_symbol}_predict_eval_strategy.stats'
    o_quant_output_html = f'{tick_symbol}_predict_eval_strategy.html'
    n_stake = 10
    cash_capital = 1000
    is_onlylong = False #True, False=long-short, True=only long without short

    strategyEvaluate(tick_symbol, signal_file, o_strategy_log_file, o_quant_output_file, o_quant_output_html, n_stake, cash_capital, is_onlylong)

Performance comparison between longshort & long only

The obvious difference between long & longshort in that in 2022, longshort is quite better than long only.

Summary

Issues in ML predicted signal:

Prediction is still not robust enough. Some parameters changes will significantly affect performance
Predicted signal based on daily, it is too many in/out operation. Need further post-optimize the signal
Short is still very dangerous. As a naive, it is just for paper trading and test. Complete strategy need including many factors like signal pruning, risk control and capital management. Currently have no knowledge.

Bokeh – A Python package to create figure & UI

Posted on March 7, 2022 by sheng gao Leave a comment

To plot figure in Python, matplotlib is an native module. Now there is another good option, Bokeh, https://docs.bokeh.org/en/latest/index.html. Bokeh is Python lib, which core module is written by JS.

Using Bokeh, you can create more interactive & beautiful user interface using build-in models, or even using native html script (I like). This is a snapshot I use Bokeh to display candlestick using selected date range, and also update information in a table (integrating native html script).

AI, Tech & Life

涓涓细流，汇成江河

Tag Archives: fintech