Discrete-Time Mean-Variance Strategy Based on Reinforcement LearningCui, Xiangyu;Li, Xun;Shi, Yun;Zhao, Si
doi: 10.48550/arxiv.2312.15385pmid: N/A
Abstract:This paper studies a discrete-time mean-variance model based on reinforcement learning. Compared with its continuous-time counterpart in \cite{zhou2020mv}, the discrete-time model makes more general assumptions about the asset's return distribution. Using entropy to measure the cost of exploration, we derive the optimal investment strategy, whose density function is also Gaussian type. Additionally, we design the corresponding reinforcement learning algorithm. Both simulation experiments and empirical analysis indicate that our discrete-time model exhibits better applicability when analyzing real-world data than the continuous-time model.
Managing ESG Ratings Disagreement in Sustainable Portfolio SelectionCesarone, Francesco;Martino, Manuel Luis;Ricca, Federica;Scozzari, Andrea
doi: 10.48550/arxiv.2312.10739pmid: N/A
Abstract:Sustainable Investing identifies the approach of investors whose aim is twofold: on the one hand, they want to achieve the best compromise between portfolio risk and return, but they also want to take into account the sustainability of their investment, assessed through some Environmental, Social, and Governance (ESG) criteria. The inclusion of sustainable goals in the portfolio selection process may have an actual impact on financial portfolio performance. ESG indices provided by the rating agencies are generally considered good proxies for the performance in sustainability of an investment, as well as, appropriate measures for Socially Responsible Investments (SRI) in the market. In this framework of analysis, the lack of alignment between ratings provided by different agencies is a crucial issue that inevitably undermines the robustness and reliability of these evaluation measures. In fact, the ESG rating disagreement may produce conflicting information, implying a difficulty for the investor in the portfolio ESG evaluation. This may cause underestimation or overestimation of the market opportunities for a sustainable investment. In this paper, we deal with a multi-criteria portfolio selection problem taking into account risk, return, and ESG criteria. For the ESG evaluation of the securities in the market, we consider more than one agency and propose a new approach to overcome the problem related to the disagreement between the ESG ratings by different agencies. We propose a nonlinear optimization model for our three-criteria portfolio selection problem. We show that it can be reformulated as an equivalent convex quadratic program by exploiting a technique known in the literature as the k-sum optimization strategy. An extensive empirical analysis of the performance of this model is provided on real-world financial data sets.
Discounting the distant future: What do historical bond prices imply about the long term discount rate?Farmer, J. Doyne;Geanakoplos, John;Richiardi, Matteo G.;Montero, Miquel;Perelló, Josep;Masoliver, Jaume
doi: N/Apmid: N/A
Abstract:We present a thorough empirical study on real interest rates by also including risk aversion through the introduction of the market price of risk. With the view of complex systems science and its multidisciplinary approach, we use the theory of bond pricing to study the long term discount rate. Century-long historical records of 3 month bonds, 10 year bonds, and inflation allow us to estimate real interest rates for the UK and the US. Real interest rates are negative about a third of the time and the real yield curves are inverted more than a third of the time, sometimes by substantial amounts. This rules out most of the standard bond pricing models, which are designed for nominal rates that are assumed to be positive. We therefore use the Ornstein-Uhlenbeck model which allows negative rates and gives a good match to inversions of the yield curve. We derive the discount function using the method of Fourier transforms and fit it to the historical data. The estimated long term discount rate is $1.7$ \% for the UK and $2.2$ \% for the US. The value of $1.4$ \% used by Stern is less than a standard deviation from our estimated long run return rate for the UK, and less than two standard deviations of the estimated value for the US. All of this once more reinforces the support for immediate and substantial spending to combat climate change.
Increasing Profitability and Confidence by using Interpretable Model for Investment DecisionsArshad, Sahar;Latif, Seemab;Salman, Ahmad;Irfan, Saadia
doi: N/Apmid: N/A
Abstract:Financial forecasting plays an important role in making informed decisions for financial stakeholders, specifically in the stock exchange market. In a traditional setting, investors commonly rely on the equity research department for valuable reports on market insights and investment recommendations. The equity research department, however, faces challenges in effectuating decision-making due to the demanding cognitive effort required for analyzing the inherently volatile nature of market dynamics. Furthermore, financial forecasting systems employed by analysts pose potential risks in terms of interpretability and gaining the trust of all stakeholders. This paper presents an interpretable decision-making model leveraging the SHAP-based explainability technique to forecast investment recommendations. The proposed solution not only provides valuable insights into the factors that influence forecasted recommendations but also caters to investors of varying types, including those interested in daily and short-term investment opportunities. To ascertain the efficacy of the proposed model, a case study is devised that demonstrates a notable enhancement in investor's portfolio value, employing our trading strategies. The results highlight the significance of incorporating interpretability in forecasting models to boost stakeholders' confidence and foster transparency in the stock exchange domain.
Optimal insurance with mean-deviation measuresBoonen, Tim J.;Han, Xia
doi: 10.48550/arxiv.2312.01813pmid: N/A
Abstract:This paper studies an optimal insurance contracting problem in which the preferences of the decision maker given by the sum of the expected loss and a convex, increasing function of a deviation measure. As for the deviation measure, our focus is on convex signed Choquet integrals (such as the Gini coefficient and a convex distortion risk measure minus the expected value) and on the standard deviation. We find that if the expected value premium principle is used, then stop-loss indemnities are optimal, and we provide a precise characterization of the corresponding deductible. Moreover, if the premium principle is based on Value-at-Risk or Expected Shortfall, then a particular layer-type indemnity is optimal, in which there is coverage for small losses up to a limit, and additionally for losses beyond another deductible. The structure of these optimal indemnities remains unchanged if there is a limit on the insurance premium budget. If the unconstrained solution is not feasible, then the deductible is increased to make the budget constraint binding. We provide several examples of these results based on the Gini coefficient and the standard deviation.
European Football Player Valuation: Integrating Financial Models and Network TheoryCohen, Albert;Risk, Jimmy
doi: N/Apmid: N/A
Abstract:This paper presents a new framework for player valuation in European football by fusing principles from financial mathematics and network theory. The valuation model leverages a "passing matrix" to encapsulate player interactions on the field, utilizing centrality measures to quantify individual influence. Unlike traditional approaches, this model is both metric-driven and cohort-free, providing a dynamic and individualized framework for ascertaining a player's fair market value. The methodology is empirically validated through a case study in European football, employing real-world match and financial data. The paper advances the disciplines of sports analytics and financial mathematics by offering a cross-disciplinary mechanism for player valuation, and also links together two well-known econometric methods in marginal revenue product and expected present valuation.
A novel scaling approach for unbiased adjustment of risk estimatorsPitera, Marcin;Schmidt, Thorsten;Stettner, Łukasz
doi: 10.48550/arxiv.2312.05655pmid: N/A
Abstract:The assessment of risk based on historical data faces many challenges, in particular due to the limited amount of available data, lack of stationarity, and heavy tails. While estimation on a short-term horizon for less extreme percentiles tends to be reasonably accurate, extending it to longer time horizons or extreme percentiles poses significant difficulties. The application of theoretical risk scaling laws to address this issue has been extensively explored in the literature. This paper presents a novel approach to scaling a given risk estimator, ensuring that the estimated capital reserve is robust and conservatively estimates the risk. We develop a simple statistical framework that allows efficient risk scaling and has a direct link to backtesting performance. Our method allows time scaling beyond the conventional square-root-of-time rule, enables risk transfers, such as those involved in economic capital allocation, and could be used for unbiased risk estimation in small sample settings. To demonstrate the effectiveness of our approach, we provide various examples related to the estimation of value-at-risk and expected shortfall together with a short empirical study analysing the impact of our method.
Deep Reinforcement Learning for Quantitative TradingXu, Maochun;Lan, Zixun;Tao, Zheng;Du, Jiawei;Ye, Zongao
doi: 10.48550/arxiv.2312.15730pmid: N/A
Abstract:Artificial Intelligence (AI) and Machine Learning (ML) are transforming the domain of Quantitative Trading (QT) through the deployment of advanced algorithms capable of sifting through extensive financial datasets to pinpoint lucrative investment openings. AI-driven models, particularly those employing ML techniques such as deep learning and reinforcement learning, have shown great prowess in predicting market trends and executing trades at a speed and accuracy that far surpass human capabilities. Its capacity to automate critical tasks, such as discerning market conditions and executing trading strategies, has been pivotal. However, persistent challenges exist in current QT methods, especially in effectively handling noisy and high-frequency financial data. Striking a balance between exploration and exploitation poses another challenge for AI-driven trading agents. To surmount these hurdles, our proposed solution, QTNet, introduces an adaptive trading model that autonomously formulates QT strategies through an intelligent trading agent. Incorporating deep reinforcement learning (DRL) with imitative learning methodologies, we bolster the proficiency of our model. To tackle the challenges posed by volatile financial datasets, we conceptualize the QT mechanism within the framework of a Partially Observable Markov Decision Process (POMDP). Moreover, by embedding imitative learning, the model can capitalize on traditional trading tactics, nurturing a balanced synergy between discovery and utilization. For a more realistic simulation, our trading agent undergoes training using minute-frequency data sourced from the live financial market. Experimental findings underscore the model's proficiency in extracting robust market features and its adaptability to diverse market conditions.
CVA Hedging by Risk-Averse Stochastic-Horizon Reinforcement LearningDaluiso, Roberto;Pinciroli, Marco;Trapletti, Michele;Vittori, Edoardo
doi: 10.48550/arxiv.2312.14044pmid: N/A
Abstract:This work studies the dynamic risk management of the risk-neutral value of the potential credit losses on a portfolio of derivatives. Sensitivities-based hedging of such liability is sub-optimal because of bid-ask costs, pricing models which cannot be completely realistic, and a discontinuity at default time. We leverage recent advances on risk-averse Reinforcement Learning developed specifically for option hedging with an ad hoc practice-aligned objective function aware of pathwise volatility, generalizing them to stochastic horizons. We formalize accurately the evolution of the hedger's portfolio stressing such aspects. We showcase the efficacy of our approach by a numerical study for a portfolio composed of a single FX forward contract.