Admin Panels
Algorithms
Asset Management
Audio
Authentication
More Categories
Boilerplate Build Tools Caching CMS Code Analysis Code Refactoring Code review tool Command-line Interface Development Command-line Tools Communication Computer Vision Concurrency and Parallelism Configuration Cryptography Data Analysis Data Containers Data Serialization Data Structures Data Validation Data Visualization Database Database Drivers Date & Time Utilities Debugging Tools Deep Learning Deep Learning Model Explanation DevOps Tools Distributed Computing Distribution Django Documentation Downloader E-commerce Editor Plugins Email Environment Management FastAPI Projects FastAPI Utilities Feature Engineering File & Path Utilities Finance Flask Forms Functional Programming Game Development General Utilities Geolocation GPU Utilities GraphQL GUI Development Hardware HTML Manipulation HTTP Clients IDE Image Processing Implementations of Python Internationalization Interpreter Job Scheduler JSON Linters & Style Checkers Logging Machine Learning Markdown/YAML Microsoft Windows Miscellaneous Monitoring Network Virtualization Networking Office Files Processing Organization ORM Package Management Payment Processing PDF Files Processing Performance optimization Pipelines Process Utilities Productivity PyTorch Learning Resources Pytorch Utilities Recommender Systems Reinforcement Learning RESTful API RPC Servers Science SCM Search Security related resources Serialization Serverless Frameworks Sklearn Utilities Specific Formats Processing Static Site Generator Storage Task Queues Template Engine Testing Text Data & NLP Text Processing Third-party APIs Wrappers URL Manipulation Video Web Asset Management Web Content Extracting Web Crawling Web Frameworks WebSocket WSGI Servers
Popular Repo
Latest Repo
Resources
All Article News Book Tutorial

Overview
Comments 1
Releases

Reinforcement Learning Theory Book (rus)

Last update: Nov 27, 2022

Related tags

Deep Learning RL-Theory-book

Overview

Reinforcement Learning Theory Book (rus)

Full book on Arxiv: https://arxiv.org/abs/2201.09746

Ch. 1: Introduction
Ch. 2: Meta-heuristics
- NEAT, WANN
- CEM, OpenAI-ES, CMA-ES
Ch. 3: Classic theory
- Bellman equations
- RPI, policy improv. theorem
- Value Iteration, Generalized Policy Iteration
- Temporal Difference, Q-learning, SARSA
- Eligibility Traces, TD-lambda, Retrace
Ch. 4: Value-based
- DQN
- Double DQN, Dueling DQN, PER, Noisy DQN, Multi-step DQN
- c51, QR-DQN, IQN, Rainbow DQN
Ch. 5: Policy Gradient
- REINFORCE, A2C, GAE
- TRPO, PPO
Ch. 6: Continuous Control
- DDPG, TD3
- SAC
Ch. 7: Model-based
- Bandits
- MCTS, AlphaZero, MuZero
- LQR
Ch. 8: Next Stage
- Imitation Learning / Inverse Reinforcement Learning
- Intrinsic Motivation
- Multi-Task and Hindsight
- Hierarchical RL
- Partial observability
- Multi-Agent RL

Owner

qbrick

qbrick

GitHub Repository

Data Engineering ZoomCamp

Data Engineering ZoomCamp I'm partaking in a Data Engineering Bootcamp / Zoomcamp and will be tracking my progress here. I can't promise these notes w

61 Jan 06, 2023

Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure.

Event Queue Dialect Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure. Motivation The m

23 Dec 08, 2022

GyroSPD: Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices

GyroSPD Code for the paper "Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices" accepted at NeurIPS 2021. Re

12 Dec 12, 2022

PyTorch implementation of our paper How robust are discriminatively trained zero-shot learning models?

How robust are discriminatively trained zero-shot learning models? This repository contains the PyTorch implementation of our paper How robust are dis

5 Feb 04, 2022

Alignment Attention Fusion framework for Few-Shot Object Detection

AAF framework Framework generalities This repository contains the code of the AAF framework proposed in this paper. The main idea behind this work is

20 Dec 16, 2022

A library for Deep Learning Implementations and utils

deeply A Deep Learning library Table of Contents Features Quick Start Usage License Features Python 2.7+ and Python 3.4+ compatible. Quick Start $ pip

1 Dec 12, 2022

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Unified Multi-modal Transformers This repository maintains the official implementation of the paper UMT: Unified Multi-modal Transformers for Joint Vi

84 Jan 04, 2023

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Interaction Grounded Learning This repository contains a simple PyTorch implementation of the ideas presented in the paper Interaction Grounded Learni

4 Aug 31, 2022

Classification models 1D Zoo - Keras and TF.Keras

Classification models 1D Zoo - Keras and TF.Keras This repository contains 1D variants of popular CNN models for classification like ResNets, DenseNet

12 Jan 06, 2023

3D-printable hand-strapped keyboard

Note: This repo has not been cleaned up and prepared for general consumption at all. This is just a dump of the project files. If there is any interes

41 Dec 31, 2022

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Ok Mugle! 🎵 장르부터 멜로디까지, Content-based Music Recommendation 'Ok Mugle!'은 제13회 투빅스 컨퍼런스(2022.01.15)에서 진행한 음악 추천 프로젝트입니다. Description 📖 본 프로젝트에서는 Kakao

5 Oct 09, 2022

Simple transformer model for CIFAR10

CIFAR-Transformer Simple transformer model for CIFAR10. Reference: https://www.tensorflow.org/text/tutorials/transformer https://github.com/huggingfac

9 Nov 07, 2022

Shape-Adaptive Selection and Measurement for Oriented Object Detection

Source Code of AAAI22-2171 Introduction The source code includes training and inference procedures for the proposed method of the paper submitted to t

24 Nov 29, 2022

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces Paper Temporary GitHub page for City Surfaces paper. More soon! While designing s

14 Nov 10, 2022

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

UAV-Networks Simulator - Autonomous Networking - A.A. 20/21 UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac pr

0 Nov 13, 2021

Deep Learning Visuals contains 215 unique images divided in 23 categories

Deep Learning Visuals contains 215 unique images divided in 23 categories (some images may appear in more than one category). All the images were originally published in my book "Deep Learning with P

1.3k Dec 28, 2022

SGPT: Multi-billion parameter models for semantic search

SGPT: Multi-billion parameter models for semantic search This repository contains code, results and pre-trained models for the paper SGPT: Multi-billi

182 Dec 29, 2022

Code for CVPR2019 paper《Unequal Training for Deep Face Recognition with Long Tailed Noisy Data》

Unequal-Training-for-Deep-Face-Recognition-with-Long-Tailed-Noisy-Data. This is the code of CVPR 2019 paper《Unequal Training for Deep Face Recognition

68 Jan 07, 2023

Official Repository of NeurIPS2021 paper: PTR

PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning Figure 1. Dataset Overview. Introduction A critical aspect of human vis

32 Jun 02, 2022

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

3d-building-reconstruction This is part of a study project using the AA-RMVSNet to reconstruct buildings from multiple images. Introduction It is exci

17 Oct 17, 2022

2022.PythonRepo

About
Contact Us
DMCA
Disclaimer
Privacy Policy