Highest Rated Comments


ombelicoInfinito11 karma

How good/bad do you think metrics for NLG (including summarization, translation etc) are? Can we trust them at this point? Do you use them in your work or you evaluate with humans / other methods?