Hi,
In the example given on the pytorch website for calculating the bleu_score:
>>> from torchtext.data.metrics import bleu_score
>>> candidate_corpus = [['My', 'full', 'pytorch', 'test'], ['Another', 'Sentence']]
>>> references_corpus = [[['My', 'full', 'pytorch', 'test'], ['Completely', 'Different']], [['No', 'Match']]]
>>> bleu_score(candidate_corpus, references_corpus)
0.8408964276313782
My question is:
For the sentence ['My', 'full', 'pytorch', 'test']
, the Bleu is getting calculated against [['My', 'full', 'pytorch', 'test'], ['Completely', 'Different']]
and for the sentence ['Another', 'Sentence']
the Bleu is getting calculated against [['No', 'Match']]
, Right ?