Highest Rated Comments


SorrowInCoreOfWin30 karma

How would you deal with the states that are underrepresented in the dataset (especially in offline RL)? Any strategies to emphasize learning in those states instead of just throwing them away?