Modeling Real-Time Stocks Using Monte Carlo Simulations

Presented by:

Keyur Chaudhari

Prabhas Reddy Onteru

Shruti Goswami

Problem Statement

The stock market is a dynamic system influenced by numerous factors.
Accurate predictions can reduce investment risk by helping investors make informed decisions.

Objective: Model and forecast stock prices using Monte Carlo simulations.
Goal: Provide insights to aid in risk assessment and strategic investment.

Related Work

ARIMA:
- Best for short-term; limited in capturing long-term trends.
- Assumes linearity, struggles with stock market volatility.
Markov Chains:
- Assumes future depends only on the current state, ignoring past events.
- Can't model long-term patterns and sudden drifts in stocks effectively.

Traditional Methods:

Deep Learning Methods (e.g., LSTM, Neural Networks):

Interpretability Issues : Often a “black box,” limiting transparency for financial strategy and risk assessment.
Hyperparameter Sensitivity : Demands time-consuming tuning for optimal performance.
Computationally Expensive : Requires high-performance hardware and large datasets, which increases cost.

Methodology: ARIMA Model

If the series is non-stationary, differencing (using Integrated terms) can be applied to make the series stationary.
Autoregressive (AR) terms: Models the relationship between an observation and its previous values.
This can be expressed as:
- \(Y_t = c + \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + \cdots + \phi_p Y_{t-p} + \epsilon_t\)
where \(Y_{t}\) is the current value, \(\phi_1, \ldots, \phi_p\)
are the autoregressive coefficients, and \(ϵ_{t}\) is white noise
Similarly Moving Average terms models relationship between an observation and error terms.
We have used statsmodels.tsa.arima.model to automatically find ARIMA parameters that best fit the data.

Figure 1 : Original Data Plot

reference : https://neptune.ai/blog/arima-sarima-real-world-time-series-forecasting-guide

Figure 2 : Plot With d=1(differencing factor)

Methodology: Markov Chain Modeling

Methodology: LSTM Network

Training:
- LSTM can capture both short-term and long-term dependencies.
- Unlike RNNs which use only hidden states to pass information from previous states, LSTM utilizes Forget, Input, and Output gates for optimal data flow over time
Prediction:
- Given some history of prices, the prices will be passed through learned parameters to get output predictions.

reference : https://thorirmar.com/post/insight_into_lstm/

Figure 3 : LSTM Cell

Methodology: Monte Carlo Simulation

Monte Carlo Simulation generates multiple future price paths where each one is random walk and uses these simulations to get predictions for the random variable associated.
Geometric Brownian Motion (GBM): Geometric Brownian Motion (GBM) is a stochastic process widely used for modeling stock prices. It assumes that the logarithm of the stock price follows a Brownian motion with drift.

Figure 4 : 100K Simulations on APPL Dataset

Algorithm for GBM

Results

We have considered AAPL dataset obtained from Yahoo Finance (yfinance) which represents the APPLE stocks

Dataset	Training Period	Testing period
Dataset-A	2015-2020	2020-2021
Dataset-B	2015 to Oct 2020	Nov 2020 to March 2021

Table 2 : Simulation Results For Different Sample Sizes On DATASET-A

Simulation Size	MSE	MAE	RMSE
100	249.83	16.33	18.70
100k	150.66	9.29	12.27
50k	340.26	18.12	20.98

Table 1 : Different Datasets considered

The simulation results indicate that there need not be positive correlation between number of simulations and performance. So we have looped through all number of simulations with step size of 100 and reported the best obtained.

Model	MSE	MAE	RMSE	Total Time
ARIMA	1310.78	29.08	36.20	10 sec
Markov	187.09	12.05	13.93	40 Sec
LSTM	61.40	4.99	5.89	1 Hour 10 Min
GBM	150.66	9.29	12.27	1 Min

Model	MSE	MAE	RMSE
ARIMA	156.43	9.94	12.51
Markov	283.57	14.68	16.84
LSTM	25.72	4.17	5.07
GBM	73.64	6.52	8.58

Table 3 : Dataset-A results

Table 4 : Dataset-B results

Results

Comparison

Figure 5 : Comparison of predictions of multiple methods over Dataset-A

Figure 6 : Comparison of predictions of multiple methods over Dataset-B

Conclusions

ARIMA tends to perform worst as it cannot handle non linearities present in stock data.
Monte Carlo simulations have performed well compared to other methods like ARIMA and Markov models.
However, LSTM models show better results when more parameters are used, making them more effective at capturing complex stock price patterns in certain cases.
But we can see Monte Carlo Simulations as a trade off between the performance and Computation.

Modeling_ppt

By PRABHAS ONTERU

Modeling Real-Time Stocks Using Monte Carlo Simulations​

Problem Statement

Related Work

Methodology: ARIMA Model

Methodology: Markov Chain Modeling

Methodology: LSTM Network​

Methodology: Monte Carlo Simulation​

Algorithm for GBM

Results

Results

Comparison

Conclusions

Modeling_ppt

More from PRABHAS ONTERU

Modeling Real-Time Stocks Using Monte Carlo Simulations

Methodology: LSTM Network

Methodology: Monte Carlo Simulation