Using Reinforcement Learning Methods to Price a Perishable Product, Case Study: Orange

Publish Year: 1400
نوع سند: مقاله ژورنالی
زبان: English
View: 140

This Paper With 18 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:


تاریخ نمایه سازی: 17 فروردین 1400


‎Determining the optimal selling price for different commodities has always been one of the main topics of scientific and industrial research‎. ‎Perishable products have a short life and due to their deterioration over time‎, ‎they cause great damage if not managed‎. ‎Many industries‎, ‎retailers‎, ‎and service providers have the opportunity to increase their revenue through optimal pricing of perishable products that must be sold within a certain period‎. ‎In the pricing issue‎, ‎a seller must determine the price of several units of a perishable or seasonal product to be sold for a limited time‎. ‎This article examines pricing policies that increase revenue for the sale of a given inventory with an expiration date‎. ‎Booster learning algorithms are used to analyze how companies can simultaneously learn and optimize pricing strategy in response to buyers‎. ‎It is also shown that using reinforcement learning we can model a demand-dependent problem‎. ‎This paper presents an optimization method in a model-independent environment in which demand is learned and pricing decisions are updated at the moment‎. ‎We compare the performance of learning algorithms using Monte Carlo simulations‎.


Abbas Shekari Firouzjaie

Industrial Engineering Department, Science and Technology of Behshahr, Mazandran, Iran.

Navid Sahebjamnia

Department of industrial engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

Hadi Abdollahzade

Industrial Engineering Department, Science and Technology of Behshahr, Mazandran, Iran