The Sample Complexity of Bandit Multiclass Classification
In multiclass classification with bandit feedback, a learner only observes whether its predicted label was correct, rather than the true label itself. This simple change turns a basic supervised