Workshop’s Topic: Decision making with limited statistical characterization and limited data is challenging. The typical statistical-machine-learning approaches would call for migrating the experience of a related system with ample data through transfer learning or leveraging the similarity of multiple systems with limited data through data pooling. We, instead, develop new solution concepts to learn across related systems by adapting the parametric Operational Data Analytics (ODA) framework, which is known to produce uniformly optimal data-integrated decisions in the corresponding parametric settings for non-parametric decision making. We demonstrate, through the application of newsvendor systems, that transfer learning can, indeed, improve decision performance in the focal system by utilizing a model pre-trained using the ample data in a related system. However, in the lens of the ODA framework, the best transfer-learning decision falls in a subclass of operational statistics, limiting the ultimate optimality. In contrast, the ODA cross-learning approach utilizes the ample data from the related system to mimic the stochastic environment of the focal system. When the data from the old system are sufficiently large, the cross-learning solutions derived outperforms any transfer learning solution, and they are shown to asymptotically approach the parametric ODA solutions. When there are multiple related systems with limited data, we aggregate the data from different systems to create a generic stochastic environment for the decision-making problem, which facilitates the implementation of the parametric ODA solutions. We show that the derived co-learning solutions are asymptotically optimal for the aggregate system and for each sub-system. This approach outperforms the existing data-pooling techniques in the sense that the latter focuses only on the aggregated performance, and the chosen solution may be (asymptotically) suboptimal for individual sub-systems. Our results underscore the roles of domain knowledge and the structural relationships between the data and the decision in designing efficient learning solutions with limited data. Though we demonstrate our development through the application of newsvendor systems, the solutions developed in this study applies to a much wider class of operational decision-making problems that exhibit certain homogenous properties.
Time and Location: 10:00 AM (GMT+8), Room A523 (School of Management)
Language: Bilingual (Chinese and English)