Optimal Learning
by Powell, Warren B.; Ryzhov, Ilya O.-
Free Shipping On Orders Over $59
Free standard shipping on order over $59 to your home address. Marketplace purchases through third-party sellers are excluded from free shipping promotions.
Buy New
Rent Textbook
Used Textbook
We're Sorry
Sold Out
eTextbook
We're Sorry
Not Available
How Marketplace Works:
- This item is offered by an independent seller and not shipped from our warehouse
- Item details like edition and cover design may differ from our description; see seller's comments before ordering.
- Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
- Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
- Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.
Summary
Author Biography
Table of Contents
| Preface | p. xv |
| Acknowledgments | p. xix |
| The Challenges of Learning | p. 1 |
| Learning the Best Path | p. 2 |
| Areas of Application | p. 4 |
| Major Problem Classes | p. 12 |
| The Different Types of Learning | p. 13 |
| Learning from Different Communities | p. 16 |
| Information Collection Using Decision Trees | p. 18 |
| A Basic Decision Tree | p. 18 |
| Decision Tree for Offline Learning | p. 20 |
| Decision Tree for Online Learning | p. 21 |
| Discussion | p. 25 |
| Website and Downloadable Software | p. 26 |
| Goals of this Book | p. 26 |
| Problems | p. 27 |
| Adaptive Learning | p. 31 |
| The Frequentist View | p. 32 |
| The Bayesian View | p. 33 |
| The Updating Equations for Independent Beliefs | p. 34 |
| The Expected Value of Information | p. 36 |
| Updating for Correlated Normal Priors | p. 38 |
| Bayesian Updating with an Uninformative Prior | p. 41 |
| Updating for Non-Gaussian Priors | p. 42 |
| The Gamma-Exponential Model | p. 43 |
| The Gamma-Poisson Model | p. 44 |
| The Pareto-Uniform Model | p. 45 |
| Models for Learning Probabilities* | p. 46 |
| Learning an Unknown Variance* | p. 49 |
| Monte Carlo Simulation | p. 51 |
| Why Does It Work?* | p. 54 |
| Derivation of ¿ | p. 54 |
| Derivation of Bayesian Updating Equations for Independent Beliefs | p. 55 |
| Bibliographic Notes | p. 57 |
| Problems | p. 57 |
| The Economics of Information | p. 61 |
| An Elementary Information Problem | p. 61 |
| The Marginal Value of Information | p. 65 |
| An information Acquisition Problem | p. 68 |
| Bibliographic Notes | p. 70 |
| Problems | p. 70 |
| Ranking and Selection | p. 71 |
| The Model | p. 72 |
| Measurement Policies | p. 75 |
| Deterministic Versus Sequential Policies | p. 75 |
| Optimal Sequential Policies | p. 76 |
| Heuristic Policies | p. 77 |
| Evaluating Policies | p. 81 |
| More Advanced Topics* | p. 83 |
| An Alternative Representation of the Probability Space | p. 83 |
| Equivalence of Using True Means and Sample Estimates | p. 84 |
| Bibliographic Notes | p. 85 |
| Problems | p. 85 |
| The Knowledge Gradient | p. 89 |
| The Knowledge Gradient for Independent Beliefs | p. 90 |
| Computation | p. 91 |
| Some Properties of the Knowledge Gradient | p. 93 |
| The Four Distributions of Learning | p. 94 |
| The Value of Information and the S-Curve Effect | p. 95 |
| Knowledge Gradient for Correlated Beliefs | p. 98 |
| Anticipatory Versus Experiential Learning | p. 103 |
| The Knowledge Gradient for Some Non-Gaussian Distributions | p. 105 |
| The Gamma-Exponential Model | p. 105 |
| The Gamma-Poisson Model | p. 108 |
| The Pareto-Uniform Model | p. 109 |
| The Beta-Bernoulli Model | p. 111 |
| Discussion | p. 113 |
| Relatives of the Knowledge Gradient | p. 114 |
| Expected Improvement | p. 114 |
| Linear Loss* | p. 115 |
| The Problem of Priors | p. 118 |
| Discussion | p. 120 |
| Why Does It Work?* | p. 120 |
| Derivation of the Knowledge Gradient Formula | p. 120 |
| Bibliographic Notes | p. 125 |
| Problems | p. 125 |
| Bandit Problems | p. 139 |
| The Theory and Practice of Gittins Indices | p. 141 |
| Gittins Indices in the Beta-Bernoulli Model | p. 142 |
| Gittins Indices in tie Normal-Normal Model | p. 145 |
| Approximating Gittins Indices | p. 147 |
| Variations of Bandit Problems | p. 148 |
| Upper Confidence Bounding | p. 149 |
| The Knowledge Gradient for Bandit Problems | p. 151 |
| The Basic Idea | p. 151 |
| Some Experimental Comparisons | p. 153 |
| Non-Normal Models | p. 156 |
| Bibliographic Notes | p. 157 |
| Problems | p. 157 |
| Elements of a Learning Problem | p. 163 |
| The States of our System | p. 164 |
| Types of Decisions | p. 166 |
| Exogenous Information | p. 167 |
| Transition Functions | p. 168 |
| Objective Functions | p. 168 |
| Designing Versus Controlling | p. 169 |
| Measurement Costs | p. 170 |
| Objectives | p. 170 |
| Evaluating Policies | p. 175 |
| Discussion | p. 177 |
| Bibliographic Notes | p. 178 |
| Problems | p. 178 |
| Linear Belief Models | p. 181 |
| Applications | p. 182 |
| Maximizing Ad Clicks | p. 182 |
| Dynamic Pricing | p. 184 |
| Housing Loans | p. 184 |
| Optimizing Dose Response | p. 185 |
| A Brief Review of Linear Regression | p. 186 |
| The Normal Equations | p. 186 |
| Recursive Least Squares | p. 187 |
| A Bayesian Interpretation | p. 188 |
| Generating a Prior | p. 189 |
| The Knowledge Gradient for a Linear Model | p. 191 |
| Application to Drug Discovery | p. 192 |
| Application to Dynamic Pricing | p. 196 |
| Bibliographic Notes | p. 200 |
| Problems | p. 200 |
| Subset Selection Problems | p. 203 |
| Applications | p. 205 |
| Choosing a Subset Using Ranking and Selection | p. 207 |
| Setting Prior Means and Variances | p. 207 |
| Two Strategies for Setting Prior Covariances | p. 208 |
| Larger Sets | p. 209 |
| Using Simulation to Reduce the Problem Size | p. 210 |
| Computational Issues | p. 212 |
| Experiments | p. 213 |
| Very Large Sets | p. 214 |
| Bibliographic Notes | p. 216 |
| Problems | p. 216 |
| Optimizing a Scalar Function | p. 219 |
| Deterministic Measurements | p. 219 |
| Stochastic Measurements | p. 223 |
| The Model | p. 223 |
| Finding the Posterior Distribution | p. 224 |
| Choosing the Measurement | p. 226 |
| Discussion | p. 229 |
| Bibliographic Notes | p. 229 |
| Problems | p. 229 |
| Optimal Bidding | p. 231 |
| Modeling Customer Demand | p. 233 |
| Some Valuation Models | p. 233 |
| The Logit Model | p. 234 |
| Bayesian Modeling for Dynamic Pricing | p. 237 |
| A Conjugate Prior for Choosing Between Two Demand Curves | p. 237 |
| Moment Matching for Nonconjugate Problems | p. 239 |
| An Approximation for the Logit Model | p. 242 |
| Bidding Strategies | p. 244 |
| An Idea From Multi-Armed Bandits | p. 245 |
| Bayes-Greedy Bidding | p. 245 |
| Numerical Illustrations | p. 247 |
| Why Does It Work?* | p. 251 |
| Moment Matching for Pareto Prior | p. 251 |
| Approximating the Logistic Expectation | p. 252 |
| Bibliographic Notes | p. 253 |
| Problems | p. 254 |
| Stopping Problems | p. 255 |
| Sequential Probability Ratio Test | p. 255 |
| The Secretary Problem | p. 261 |
| Setup | p. 261 |
| Solution | p. 262 |
| Bibliographic Notes | p. 266 |
| Problems | p. 266 |
| Active Learning in Statistics | p. 269 |
| Deterministic Policies | p. 270 |
| Sequential Policies for Classification | p. 274 |
| Uncertainty Sampling | p. 274 |
| Query by Committee | p. 275 |
| Expected Error Reduction | p. 277 |
| A Variance-Minimizing Policy | p. 277 |
| Mixtures of Gaussians | p. 280 |
| Estimating Parameters | p. 280 |
| Active Learning | p. 282 |
| Bibliographic Notes | p. 283 |
| Simulation Optimization | p. 285 |
| Indifference Zone Selection | p. 288 |
| Batch Procedures | p. 288 |
| Sequential Procedures | p. 290 |
| The 0-1 Procedure: Connection to Linear Loss | p. 292 |
| Optimal Computing Budget Allocation | p. 293 |
| Indifference-Zone Version | p. 293 |
| Linear Loss Version | p. 295 |
| When Does It Work? | p. 295 |
| Model-Based Simulated Annealing | p. 296 |
| Other Areas of Simulation Optimization | p. 298 |
| Bibliographic Notes | p. 299 |
| Learning in Mathematical Programming | p. 301 |
| Applications | p. 303 |
| Piloting a Hot Air Balloon | p. 303 |
| Optimizing a Portfolio | p. 308 |
| Network Problems | p. 309 |
| Discussion | p. 313 |
| Learning on Graphs | p. 313 |
| Alternative Edge Selection Policies | p. 317 |
| Learning Costs for Linear Programs* | p. 318 |
| Bibliographic Notes | p. 324 |
| Optimizing Over Continuous Measurements | p. 325 |
| The Belief Model | p. 327 |
| Updating Equations | p. 328 |
| Parameter Estimation | p. 330 |
| Sequential Kriging Optimization | p. 332 |
| The Knowledge Gradient for Continuous Parameters* | p. 334 |
| Maximizing the Knowledge Gradient | p. 334 |
| Approximating the Knowledge Gradient | p. 335 |
| The Gradient of the Knowledge Gradient | p. 336 |
| Maximizing the Knowledge Gradient | p. 338 |
| The KGCP Policy | p. 339 |
| Efficient Global Optimization | p. 340 |
| Experiments | p. 341 |
| Extension to Higher-Dimensional Problems | p. 342 |
| Bibliographic Notes | p. 343 |
| Learning With a Physical State | p. 345 |
| Introduction to Dynamic Programming | p. 347 |
| Approximate Dynamic Programming | p. 348 |
| The Exploration vs. Exploitation Problem | p. 350 |
| Discussion | p. 351 |
| Some Heuristic Learning Policies | p. 352 |
| The Local Bandit Approximation | p. 353 |
| The Knowledge Gradient in Dynamic Programming | p. 355 |
| Generalized Learning Using Basis Functions | p. 355 |
| The Knowledge Gradient | p. 358 |
| Experiments | p. 361 |
| An Expected Improvement Policy | p. 363 |
| Bibliographic Notes | p. 364 |
| Index | p. 381 |
| Table of Contents provided by Ingram. All Rights Reserved. |
An electronic version of this book is available through VitalSource.
This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.
By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.
Digital License
You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.
More details can be found here.
A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.
Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.
Please view the compatibility matrix prior to purchase.