Długookresowa optymalizacja portfelowa z kosztami transakcyjnymi: zastosowania uczenia ze wzmocnieniem dla kryterium wrażliwego na ryzyko.

Chmura, Michał

Simple view

Full metadata view

Authors

Statistics

Długookresowa optymalizacja portfelowa z kosztami transakcyjnymi: zastosowania uczenia ze wzmocnieniem dla kryterium wrażliwego na ryzyko.

master

Alternative title

Long-term portfolio optimization with transaction costs: applications of reinforcement learning for risk-sensitive criterion.

Author

Chmura Michał

Reviewer

Pitera Marcin

Peszat Szymon

Advisor

Pitera Marcin

Date of defence

2023-09-25

Keywords in Polish

Optymalizacja portfela, Koszty transakcyjne, Ryzyko, Uczenie ze wzmocnieniem, Kryterium wrażliwe na ryzyko, Operator Bellmana, Proces decyzyjny Markowa, Entropiczna funkcja użyteczności, Strategie inwestycyjne, Efektywność obliczeniowa

Keywords in English

Portfolio optimization, Transaction costs, Risk, Reinforcement learning, Risk-sensitive criterion, Bellman operator, Markov decision process, Entropic utility function, Investment strategies, Computational efficiency

Language

Polish

Abstract in Polish

Praca koncentruje się na problemie optymalizacji portfela finansowego na długim okresie czasu, uwzględniając proporcjonalne koszty transakcyjne i ryzyko. Celem jest minimalizacja ryzyka i maksymalizacja zysku na jednostkę czasu poprzez dobór odpowiednich strategii inwestycyjnych. Autor rozważa kryterium wrażliwe na ryzyko w formie entropicznej funkcji użyteczności. Korzystamy z metod iteracji operatora Bellmana i metod uczenia ze wzmocnieniem wrażliwego na ryzyko do znalezienia optymalnej strategii. Praca składa się z trzech głównych części: wprowadzenie do pojęć portfela finansowego, kryterium wrażliwego na ryzyko i operatora Bellmana; omówienie metod uczenia ze wzmocnieniem i procesu decyzyjnego Markowa; oraz porównanie implementacji obu metod. Wyniki sugerują, że metody uczenia ze wzmocnieniem mogą być skuteczniejsze niż proste strategie typu "kup i trzymaj", porównywalne ze strategią optymalną i gorsze od metody iteracyjnej. Niemniej jednak metody te wymagają istotnie mniejszej ilości założeń, a także są efektywniejsze obliczeniowo od metody iteracyjnej. Na końcu pracy wymieniono niedociągnięcia metod uczenia ze wzmocnieniem wynikające z implementacji, oraz potencjalne usprawnienia.

Abstract in English

The paper focuses on the problem of financial portfolio optimization over a long period of time, taking into account proportional transaction costs and risk. The goal is to minimize risk and maximize profit per unit of time by selecting appropriate investment strategies. The author considers a risk-sensitive criterion in the form of an entropic utility function.We use the methods of Bellman operator iteration and risk-sensitive reinforcement learning to find the optimal strategy. The work consists of three main parts: an introduction to the concepts of financial portfolio, risk-sensitive criterion, and Bellman operator; a discussion of reinforcement learning methods and the Markov decision process; and a comparison of the implementation of both methods. The results suggest that reinforcement learning methods may be more effective than simple "buy and hold" strategies, comparable to the optimal strategy, and worse than the iterative method. Nevertheless, these methods require significantly fewer assumptions and are also more computationally efficient than the iterative method. At the end of the paper, the shortcomings of reinforcement learning methods resulting from the implementation, as well as potential improvements, are listed.

dc.abstract.en	The paper focuses on the problem of financial portfolio optimization over a long period of time, taking into account proportional transaction costs and risk. The goal is to minimize risk and maximize profit per unit of time by selecting appropriate investment strategies. The author considers a risk-sensitive criterion in the form of an entropic utility function.We use the methods of Bellman operator iteration and risk-sensitive reinforcement learning to find the optimal strategy. The work consists of three main parts: an introduction to the concepts of financial portfolio, risk-sensitive criterion, and Bellman operator; a discussion of reinforcement learning methods and the Markov decision process; and a comparison of the implementation of both methods. The results suggest that reinforcement learning methods may be more effective than simple "buy and hold" strategies, comparable to the optimal strategy, and worse than the iterative method. Nevertheless, these methods require significantly fewer assumptions and are also more computationally efficient than the iterative method. At the end of the paper, the shortcomings of reinforcement learning methods resulting from the implementation, as well as potential improvements, are listed.	pl
dc.abstract.pl	Praca koncentruje się na problemie optymalizacji portfela finansowego na długim okresie czasu, uwzględniając proporcjonalne koszty transakcyjne i ryzyko. Celem jest minimalizacja ryzyka i maksymalizacja zysku na jednostkę czasu poprzez dobór odpowiednich strategii inwestycyjnych. Autor rozważa kryterium wrażliwe na ryzyko w formie entropicznej funkcji użyteczności. Korzystamy z metod iteracji operatora Bellmana i metod uczenia ze wzmocnieniem wrażliwego na ryzyko do znalezienia optymalnej strategii. Praca składa się z trzech głównych części: wprowadzenie do pojęć portfela finansowego, kryterium wrażliwego na ryzyko i operatora Bellmana; omówienie metod uczenia ze wzmocnieniem i procesu decyzyjnego Markowa; oraz porównanie implementacji obu metod. Wyniki sugerują, że metody uczenia ze wzmocnieniem mogą być skuteczniejsze niż proste strategie typu "kup i trzymaj", porównywalne ze strategią optymalną i gorsze od metody iteracyjnej. Niemniej jednak metody te wymagają istotnie mniejszej ilości założeń, a także są efektywniejsze obliczeniowo od metody iteracyjnej. Na końcu pracy wymieniono niedociągnięcia metod uczenia ze wzmocnieniem wynikające z implementacji, oraz potencjalne usprawnienia.	pl
dc.affiliation	Wydział Matematyki i Informatyki	pl
dc.area	obszar nauk ścisłych	pl
dc.contributor.advisor	Pitera, Marcin	pl
dc.contributor.author	Chmura, Michał	pl
dc.contributor.departmentbycode	UJK/WMI2	pl
dc.contributor.reviewer	Pitera, Marcin	pl
dc.contributor.reviewer	Peszat, Szymon	pl
dc.date.accessioned	2023-09-26T21:47:45Z
dc.date.available	2023-09-26T21:47:45Z
dc.date.submitted	2023-09-25	pl
dc.fieldofstudy	matematyka	pl
dc.identifier.apd	diploma-170448-258927	pl
dc.identifier.uri	https://ruj.uj.edu.pl/xmlui/handle/item/319869
dc.language	pol	pl
dc.subject.en	Portfolio optimization, Transaction costs, Risk, Reinforcement learning, Risk-sensitive criterion, Bellman operator, Markov decision process, Entropic utility function, Investment strategies, Computational efficiency	pl
dc.subject.pl	Optymalizacja portfela, Koszty transakcyjne, Ryzyko, Uczenie ze wzmocnieniem, Kryterium wrażliwe na ryzyko, Operator Bellmana, Proces decyzyjny Markowa, Entropiczna funkcja użyteczności, Strategie inwestycyjne, Efektywność obliczeniowa	pl
dc.title	Długookresowa optymalizacja portfelowa z kosztami transakcyjnymi: zastosowania uczenia ze wzmocnieniem dla kryterium wrażliwego na ryzyko.	pl
dc.title.alternative	Long-term portfolio optimization with transaction costs: applications of reinforcement learning for risk-sensitive criterion.	pl
dc.type	master	pl
dspace.entity.type	Publication

dc.abstract.enpl

The paper focuses on the problem of financial portfolio optimization over a long period of time, taking into account proportional transaction costs and risk. The goal is to minimize risk and maximize profit per unit of time by selecting appropriate investment strategies. The author considers a risk-sensitive criterion in the form of an entropic utility function.We use the methods of Bellman operator iteration and risk-sensitive reinforcement learning to find the optimal strategy. The work consists of three main parts: an introduction to the concepts of financial portfolio, risk-sensitive criterion, and Bellman operator; a discussion of reinforcement learning methods and the Markov decision process; and a comparison of the implementation of both methods. The results suggest that reinforcement learning methods may be more effective than simple "buy and hold" strategies, comparable to the optimal strategy, and worse than the iterative method. Nevertheless, these methods require significantly fewer assumptions and are also more computationally efficient than the iterative method. At the end of the paper, the shortcomings of reinforcement learning methods resulting from the implementation, as well as potential improvements, are listed.

dc.abstract.plpl

Praca koncentruje się na problemie optymalizacji portfela finansowego na długim okresie czasu, uwzględniając proporcjonalne koszty transakcyjne i ryzyko. Celem jest minimalizacja ryzyka i maksymalizacja zysku na jednostkę czasu poprzez dobór odpowiednich strategii inwestycyjnych. Autor rozważa kryterium wrażliwe na ryzyko w formie entropicznej funkcji użyteczności. Korzystamy z metod iteracji operatora Bellmana i metod uczenia ze wzmocnieniem wrażliwego na ryzyko do znalezienia optymalnej strategii. Praca składa się z trzech głównych części: wprowadzenie do pojęć portfela finansowego, kryterium wrażliwego na ryzyko i operatora Bellmana; omówienie metod uczenia ze wzmocnieniem i procesu decyzyjnego Markowa; oraz porównanie implementacji obu metod. Wyniki sugerują, że metody uczenia ze wzmocnieniem mogą być skuteczniejsze niż proste strategie typu "kup i trzymaj", porównywalne ze strategią optymalną i gorsze od metody iteracyjnej. Niemniej jednak metody te wymagają istotnie mniejszej ilości założeń, a także są efektywniejsze obliczeniowo od metody iteracyjnej. Na końcu pracy wymieniono niedociągnięcia metod uczenia ze wzmocnieniem wynikające z implementacji, oraz potencjalne usprawnienia.

dc.affiliationpl

Wydział Matematyki i Informatyki

dc.areapl

obszar nauk ścisłych

dc.contributor.advisorpl

Pitera, Marcin

dc.contributor.authorpl

Chmura, Michał

dc.contributor.departmentbycodepl

UJK/WMI2

dc.contributor.reviewerpl

Pitera, Marcin

dc.contributor.reviewerpl

Peszat, Szymon

dc.date.accessioned

2023-09-26T21:47:45Z

dc.date.available

2023-09-26T21:47:45Z

dc.date.submittedpl

2023-09-25

dc.fieldofstudypl

matematyka

dc.identifier.apdpl

diploma-170448-258927

dc.identifier.uri

https://ruj.uj.edu.pl/xmlui/handle/item/319869

dc.languagepl

pol

dc.subject.enpl

Portfolio optimization, Transaction costs, Risk, Reinforcement learning, Risk-sensitive criterion, Bellman operator, Markov decision process, Entropic utility function, Investment strategies, Computational efficiency

dc.subject.plpl

Optymalizacja portfela, Koszty transakcyjne, Ryzyko, Uczenie ze wzmocnieniem, Kryterium wrażliwe na ryzyko, Operator Bellmana, Proces decyzyjny Markowa, Entropiczna funkcja użyteczności, Strategie inwestycyjne, Efektywność obliczeniowa

dc.titlepl

Długookresowa optymalizacja portfelowa z kosztami transakcyjnymi: zastosowania uczenia ze wzmocnieniem dla kryterium wrażliwego na ryzyko.

dc.title.alternativepl

Long-term portfolio optimization with transaction costs: applications of reinforcement learning for risk-sensitive criterion.

dc.typepl

master

dspace.entity.type

Publication

Affiliations

No affiliation

Chmura, Michał

Pitera, Marcin

Peszat, Szymon

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views

10 Views per month

Views per city

Warsaw

5

Krakow

3

Radomyśl Wielki

1

Wroclaw

1

No access

Collections

Masters theses