====== Projektgruppe 642 (Sommersemester 2021) ======

===  Verteiltes Deep Reinforcement Learning System zum Trainieren von Game AI === 

  * Veranstalter:  [[https://ls11-www.cs.tu-dortmund.de/people/rudolph/|Prof. Dr. Günter Rudolph]] [[https://ls11-www.cs.tu-dortmund.de/staff/pleines|M.Sc. Marco Pleines]] [[https://ls11-www.cs.tu-dortmund.de/staff/kalkreuth|M.Sc. Roman Kalkreuth]]

=== Regelmäßiger betreuter Zeitraum ===

Mittwochs 11 - 12 Uhr

Discord

Ansonsten je nach Bedarf Termine mit den Betreuern vereinbaren.

=== Deadnline Abschlussbericht ===

31.03.2022

=== PG Richtlinien und Modulhandbuch ===

[[https://www.cs.tu-dortmund.de/nps/de/Studium/Ordnungen_Handbuecher_Beschluesse/Modulhandbuecher/Master_Inf/Pflichtveranstaltungen/INF-MSc-101.pdf|Modulhandbuch]]

[[https://www.cs.tu-dortmund.de/nps/de/Studium/besondere_Lehrveranstaltungen/Projektgruppen/Rechtliches/PGRichtlinien/PGR_2012__2012_10_24.pdf|PG Richtlinien]]


=== Seminartermine ===

^ Thema ^  Datum  ^  Uhrzeit  ^  Platform  ^
| Rainbow DQN |  15.04.21 | 14:00 Uhr | Zoom |
| Soft Actor-Critic |  22.04.21 | 14:00 Uhr | ::: |
| Proximal Policy Optimization |  29.04.21 | 14:00 Uhr | ::: |
| Never Give Up |  06.05.21 | 14:00 Uhr | ::: |


=== Folien ===

^  Nr.  ^  Thema  ^  Download  ^  
| 1 | Kick-Off Meeting | {{ :teaching:pg642-kick-off.pdf |PG642-Kick-Off.pdf}} | 
| 2 | Projektmanagement Workshop Teil I | {{ :teaching:pg642:pg642-projektmanagement-workshop-1.pdf |PG642-Projektmanagement-Workshop-1.pdf}} | 
| 2 | Projektmanagement Workshop Teil II | {{ :teaching:pg642:projektmanagement_workshop_teil2.pdf |PG642-Projektmanagement-Workshop-2.pdf}} | 
| 3 | Projektmanagement Workshop Teil III | {{ :teaching:pg642:projektmanagement_workshop_teil3.pdf |PG642-Projektmanagement-Workshop-3.pdf}} | 


=== Praktikum ===

^  Nr.  ^  Thema  ^  Download  ^  
| 1 | DRL Umwelten |  {{ :teaching:pg642:praktikum_01_folien.pdf |Praktikum_01_Folien.pdf}} {{ :teaching:pg642:praktikum_01_aufgaben.pdf |Praktikum_01_Aufgaben.pdf}} | 
| 2 & 3 | Experimente, LiDo 3 |  {{ :teaching:pg642:praktikum_02_folien.pdf |Praktikum_02_Folien.pdf}} {{ :teaching:pg642:praktikum_02_aufgaben.pdf |Praktikum_02_Aufgaben.pdf}} {{https://drive.google.com/file/d/1QTX4DUYCqP2AN6BfejIukoO80qUerWbq/view?usp=sharing|Datenpaket}} {{https://colab.research.google.com/drive/1K5Pq29oTyoHgJp2rsZ7rNN0N9kbn7Bq9?usp=sharing|Google Colab}}|
| 4 | Testing, Debugging | {{ :teaching:pg642:praktikum_03.pdf |}}|


=== Literatur ===

== Bücher ==

  *  R. S. Sutton and A. G. Barto :  "Reinforcement Learning: An Introduction", MIT Press, Cambridge, 2018 (ISBN:  9780262039246)
  *  G. N. Yannakakis and J. Togelius :  "Artificial Intelligence and Games", Springer, 2018 (ISBN: 9783319635194) 
  *  M. Lapan:  "Deep Reinforcement Learning Hands-On", Packt Publishing, 2018 (ISBN: 9781788834247) 
  *  L. Graesser and W. L. Keng: "Foundations of Deep Reinforcement Learning",  Addison-Wesley Professional, 2019 (ISBN: 9780135172490) 
  * M. Morales: "grokking Deep Reinforcement Learning", Manning Publications Co., 2020 (ISBN: 9781617295454)

== Paper/Tutorials ==

  *  V. Mnih et al., “Human-level control through deep reinforcement learning”, Nat., vol. 518, no. 7540, pp. 529-533, 2015 K.Cobbe et al., “Leveraging procedural generation to benchmark reinforcement learning”, CoRR,vol. abs/1912.01588, 2019.
  * A. Juliani et al., “Obstacle tower: A generalization challenge in vision, control, and planning”, in Proceedings IJCAI 2019, Macao, China (S. Kraus, ed.), pp. 2684-2691, ijcai.org, 2019.
  * O. Vinyals et al., “Grandmaster level in starcraft II using multi-agent reinforcement learning”, Nat., vol. 575, no. 7782, pp. 350-354, 2019.
  * C. Berner et al., “Dota 2 with large scale deep reinforcement learning”, CoRR, vol. abs/ 1912.06680, 2019.
  * C. Hidber, “Reinforcement Learning: a gentle Introduction and industrial Application”, aufgerufen über https://youtu.b/3RjSanoNIlk am 14.12.2020, 2019.
  * M. Andrychowicz et al., “Learning dexterous in-hand manipulation”, Int. J. Robotics Res., vol. 39, no. 1, 2020.
  * M. G. Bellemare et al., “Autonomous navigation of stratospheric balloons using reinforcement learning”, Nat. vol. 588, pp.77-82, 2020.