-
公开(公告)号:US11748647B1
公开(公告)日:2023-09-05
申请号:US16708341
申请日:2019-12-09
Applicant: Amazon Technologies, Inc.
Inventor: Wenjun Zeng , Yi Liu , Zachary Wake Austin , Hau Wing Calvin Kwok
IPC: G06F16/958 , G06N20/20 , G06N7/01 , G06N3/047 , G06N5/01 , G06Q30/0241 , G06Q20/12
CPC classification number: G06N7/01 , G06F16/958 , G06N3/047 , G06N5/01 , G06N20/20 , G06Q20/127 , G06Q30/0277
Abstract: Technologies are provided for the generating of optimal policies for bidding in auctions having unknown dynamics. In some embodiments, a computing system can configure many multi-armed bandit (MAB) models defining candidate directed contents for a sequence of pages. A particular MAB model of the many MAB models defines candidate directed contents for a particular page in the sequence of pages, where each arm in the particular MAB model corresponds to a candidate impression on the particular page. The computing system can then determine a solution to an optimization problem with respect to an objective function based on an expected long-term reward for a defined impression on the first page, a defined impression on the second page, and a defined impression on the third page. The solution results in respective directed content for presentation on the first, second, and third pages.