TUNING PROXIMAL POLICY OPTIMIZATION
ALGORITHM IN MAZE SOLVING WITH ML-AGENTS

- Tài khoản và mật khẩu chỉ cung cấp cho sinh viên, giảng viên, cán bộ của TRƯỜNG ĐẠI HỌC FPT
- Hướng dẫn sử dụng: Xem Video .
- Danh mục tài liệu mới: Tại đây .
- Đăng nhập : Tại đây .

UniKey
Bộ gõ Tiếng Việt miễn phí
Font ABC-VNI
Bộ Font ABC - VNI thông dụng
WinRAR
Soft nén và giải nén mạnh
Flashget v3.3
Trình hỗ trợ tăng tốc download files
Mozilla Firefox
Trình duyệt Web nhanh nhất
Foxit Reader 5
Xem PDF miễn phí

HƯỚNG DẪN SỬ DỤNG
Hướng dẫn cách sử dụng thư viện
GỬI YÊU CẦU
Gửi yêu cầu cho thủ thư

SỐ LƯỢT TRUY CẬP

Visits Counter

FPT University|e-Resources > Đồ án tốt nghiệp (Dissertations) > Khoa học máy tính - Trí tuệ nhân tạo >

Please use this identifier to cite or link to this item: http://ds.libol.fpt.edu.vn/handle/123456789/3326

Title:	TUNING PROXIMAL POLICY OPTIMIZATION ALGORITHM IN MAZE SOLVING WITH ML-AGENTS
Authors:	Phan, Duy Hung Mac, Duy Dan Truong Phan, Thanh Hung
Keywords:	Computer Science ALGORITHM ML-AGENTS TUNING PROXIMAL POLICY
Issue Date:	2022
Publisher:	FPTU Ha Noi
Abstract:	The proximal Policy Optimization algorithm is the ML-Agents toolkit's default reinforcement algorithm. This approach can switch between sampling data via interaction with the environment and utilizing stochastic gradient descent to optimize a "surrogate" cost function. Although when creating a new machine learning model, it is tough to know the optimal model architecture for a given project immediately. In most cases, We can either utilize the algorithm's default values or we may use the machine to undertake this exploration and automatically select the best model architecture. Hyperparameters define the model architecture; thus, searching for the best model is called hyperparameter tuning. We focus on comparing four hyperparameters: Beta, Epsilon, Lambd, Num_epoch of PPO algorithm in solving a maze. The results obtained in the training process show the difference in the selection of hyperparameters. The modification of hyperparameters will depend on the maze's complexity and the complexity of the Agent's actions. This thesis will help to make appropriate choices at hyperparameters in concrete and practical projects. Code is available at hungpt17102k/Maze-Solving-ML-Agent (github.com).
URI:	/handle/123456789/3326
Appears in Collections:	Khoa học máy tính - Trí tuệ nhân tạo

Files in This Item:

File	Description	Size	Format
Slides - Tuning Proximal Policy Optimization Algorithm.pdf	Free	4.32 MB	Adobe PDF	View/Open
Thesis - Tuning Proximal Policy Optimization Algorithm.pdf	Free	2 MB	Adobe PDF	View/Open

Recommend this item

View Statistics

FSE Hoa Lac Library

Add : Room 107, 1st floor, Hoa Lac campus, Km28 Thang Long Avenue, Hoa Lac Hi-Tech Park

Office tel: + 844.66805912 / Email : thuvien_fu_hoalac@fpt.edu.vn

- Feedback