Optimal Frame RTB Algorithm Using Deep Reinforcement Learning
Technical Lead | CNAI, KAIST | Mar. 2020–Mar. 2021
- Independently formulated the optimization problem for customized real-time bidding of online ad frames as a Markov decision process.
- Designed an offline deep reinforcement learning method based on counterfactual learning to address the data scarcity inherent in the target settings.
- Led a team of engineers and researchers from algorithm design through software implementation and delivery.