Please guys, are we using a sentence Bleu or corpus bleu for evaluation?
I don't seem to get the difference in application between the two.
The name of the metric is just "BLUE" score. That's all. You can find out through google.