IQL
Implicit Q-Learning — an offline RL algorithm that avoids querying the Q-function on out-of-distribution actions by using an expectile regression objective. IQL achieves strong offline RL performance without the explicit conservatism of CQL, making it simpler to implement and tune. It has become popular for robot manipulation tasks trained on demonstration data.