Authors: William Toner and Luke Darlow
This is the official repository to replicate the Ordinary Least Squares models from "An Analysis of Linear Time Series Forecasting Models".
We chose to keep this repository minimal for ease of use, thereby refraining from including any baseline comparitor methods. For the paper we ensured that all comparison methods were not affected by the 'drop_last' problem. See the FITS repo, where we worked with those authors to highlight and remedy this issue.
This repository requires only Pandas, Numpy and SKLearn (no deep learning frameworks), yet produces models that are comparable or better (see Table 3 in the paper) than state-of-the-art.
Commands to replicate the main performance results in Table 2 can be found in run_main.sh, while commands for regularised models in Table 3 can be found in run_regularised.sh.
By setting the dataset to 'custom' and providing the correct root, csv filename, and train/test percentages, you can run OLS solutions for custom csv datasets, as follows:
python main.py --dataset custom --context_length 720 --horizon 96 --root data/ --custom_csv_filename path_to_data.csv --custom_train_percentage 0.6 --custom_test_percentage 0.2