'Reinforcement Learning' 태그의 글 목록

Notice

Recent Posts

Recent Comments

Link

« 2026/06 »
일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Tags more

Archives

Today

Total

관리 메뉴

목록Reinforcement Learning (2)

김태오

[Borg-Orchestrator 07] Reward Tuning with Optuna, Ray, and RLlib

Once the live loop existed, I started tuning the control behavior instead of only tuning model metrics.This was where Optuna and Ray/RLlib entered the project. I did not want tuning to be invisible. If the reward weights changed, I wanted the dashboard to show the trials. If the RL policy was disabled for a fast run, I wanted the dashboard to say so. If a PPO bootstrap completed, I wanted the ch..

ML 2026. 5. 21. 02:13

[Borg-Orchestrator 05] Designing the Six-Layer Orchestrator Stack

The six-layer orchestrator was the point where the project stopped being only a data/model pipeline and became an actual control-plane experiment.I built it because I was tired of looking at model scores in isolation. A risk score is useful only if something can consume it. A demand estimate is useful only if it can affect efficiency behavior. Queue pressure is useful only if admission control c..

ML 2026. 5. 21. 02:10

이전 Prev 1 Next 다음

목록Reinforcement Learning (2)

김태오

티스토리툴바